Data processing: speech signal processing – linguistics – language – Speech signal processing – Application
Reexamination Certificate
1999-04-09
2003-10-21
McFadden, Susan (Department: 2654)
Data processing: speech signal processing, linguistics, language
Speech signal processing
Application
C704S270000
Reexamination Certificate
active
06636831
ABSTRACT:
NOTICE
A portion of the disclosure of this patent document contains material which is subject to copyright protection. The copyright owner has no objection to the facsimile reproduction by anyone of the patent document or the patent disclosure, as it appears in the Patent and Trademark Office patent file or records, but otherwise reserves all copyright rights whatsoever.
FIELD OF THE INVENTION
The present invention relates in general to voice-controlled devices and, in particular, to systems and processes for voice-controlled information retrieval.
BACKGROUND OF THE INVENTION
There is a continuing challenge in providing access to computational resources to mobile workers. A “mobile worker” performs job duties that require constant physical movement or manual labor, such as performed by a traditional blue-collar worker. Mobile workers typically use their hands in performing their work and do not work at a desk in a traditional office-type setting.
Personal computers and terminals fail to adequately provide computer access to the mobile worker for at least two reasons. First, personal computers and terminals are stationary devices. As a result, mobile workers are forced to alter their work patterns to allow for physical access centered on the stationary personal computer or terminal. Second, personal computers and terminals typically include a display and a keyboard or other tactile input device. Thus, mobile workers must take their eyes off their work to view the display and use their hands to operate the tactile input device. These changes in work patterns are not always practical.
Enterprise resource planning (ERP) systems are one type of computer resource particularly well suited for use by mobile workers. These systems provide an integrated solution by combining traditionally stand-alone legacy systems, such as human resources, sales, marketing and other functionally separate areas, into a unified package. Two companies active in the development of ERP solutions are PeopleSoft and SAP AG.
Moreover, the use of ERP systems opens up a wide range of new possible uses for information stored in corporate databases. For example, previously unavailable engineering plans, such as blueprints, can be made available to assembly line workers. Similarly, an inventory system can be updated on the fly by a packing clerk who works in the shipping department to reflect a change in the inventory of available goods.
Present mobile computing systems suffer from limited available bandwidth with which to send and receive data. This poses a problem with providing mobile workers with access to ERP information. Mobile workers require continuous access to corporate data. The use of visual-based browsers, by way of example, typically require high bandwidth capabilities which are not typically available on mobile computing devices. A speech-based approach is needed.
A prior art, speech only approach to providing voice-controlled access to information retrieval can be found in telephony interactive menu systems or so-called “voice response systems.” These systems are generally used by voice activated menu systems which provide a spoken menu of selections to a user over a telephone. The user indicates an appropriate response, generally corresponding to a number on the telephone keypad. The response can be spoken or keyed into the keypad. Such systems limit responses to a finite set of numeric potential choices. Such systems are further limited in the complexity of any given menu option which generally must be short and easily understandable to be effective.
A prior art, visual/speech approach to providing hands free access to information retrieval is a speech-enabled Web browser, such as described in the commonly assigned U.S. patent application Ser. No. 09/272,892, entitled “Voice-Controlled Web Browser,” pending, filed Mar. 19, 1999, the disclosure of which is incorporated herein by reference. Such speech-enabled Web browsers augment a standard user interface with a microphone and speaker. Hyperlinks are presented visually to the user who responds by voice using the hyperlink's text, or using a visual hint to make a selection. However, the visual nature of the information content itself inherently limits the flexibility of this approach. The voice prompts are driven by the linear arrangement of the Web content which is designed primarily for visual display and is not formatted for access by a speech-enabled browser. Consequently, complex information is not always easily accessible through speech-enabled Web browsers.
Consequently, there is a need for providing mobile workers with voice-controlled access to computer retrievable information without requiring the mobile worker to alter a work pattern through the use of a stationary personal computer or terminal which requires a display and manual tactile input. Such a solution would preferably be mobile in nature, that is, easily wearable or holdable by the mobile worker and operable without the need for a visual display. Alternately, such a solution could be embodied on a conventional client computer or on telephony devices.
SUMMARY OF THE INVENTION
The present invention provides an approach to voice-controlled information retrieval in which information, such as dynamically generated corporate data, can be presented to a mobile worker using a low bandwidth, speech-oriented connection. The approach includes the capability to present closely related, but mostly static, visual information or other high bandwidth information to a mobile worker using a portable or stationary, but locally situated, Web server. The visual information can optionally be displayed on a Web browser running on another client.
One embodiment of the present invention is a system, process and storage medium for voice-controlled information retrieval using a voice transceiver. A voice transceiver executes a conversation template. The conversation template comprises a script of tagged instructions comprising voice prompts and expected user responses. A speech engine processes a voice command identifying information content to be retrieved. The voice transceiver sends a remote method invocation requesting the identified information content to an applet process associated with a Web browser. An applet method retrieves the identified information content on the Web browser responsive to the remote method invocation.
A further embodiment of the present invention is a system, process and storage medium for retrieving Web content onto a browser running on a remote client using a voice transceiver. A storage device stores a conversation template on the server. The conversation template comprises a script including instruction tags for voice commands and voice prompts. A voice transceiver receives the conversation template. A parser parses the instruction tags from the script to form a set of interrelated tokens and instantiates an object corresponding to each token. An interpreter interprets the set of tokens by executing the object instance corresponding to each token. A speech engine receives a voice command on the voice transceiver from a user for Web content. A remote client is interconnected to the server and the voice transceiver via a network. The voice transceiver sends a remote method invocation identifying the Web content. The remote client includes an applet associated with a browser running on the remote client and requests the Web content from the server responsive to the remote method invocation. The browser receives the Web content.
A further embodiment of the present invention is a process and language definition embodied as code stored on a computer-readable storage medium for facilitating speech driven information processing using a voice transceiver. A speech markup document for speech operations interpretable by the voice transceiver is defined. The markup document comprises a set of tags with each such tag comprising a speech instruction and at least one such tag further comprising a remote procedure call. An applet object for information processing operations interpre
Brown N. Gregg
Colombo Lianne M.
Mezey Peter S.
Profit, Jr. Jack H.
Christensen O'Connor Johnson & Kindness PLLC
Inroad, Inc.
McFadden Susan
LandOfFree
System and process for voice-controlled information retrieval does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with System and process for voice-controlled information retrieval, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and System and process for voice-controlled information retrieval will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-3150278