Accessing network-based electronic information through...

Electrical computers and digital processing systems: multicomput – Remote data accessing – Accessing a remote server

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C709S227000

Reexamination Certificate

active

06513063

ABSTRACT:

BACKGROUND OF THE INVENTION
The present invention relates generally to the navigation of electronic data by means of spoken natural language requests, and to feedback mechanisms and methods for resolving the errors and ambiguities that may be associated with such requests.
As global electronic connectivity continues to grow, and the universe of electronic data potentially available to users continues to expand, there is a growing need for information navigation technology that allows relatively naive users to navigate and access desired data by means of natural language input. In many of the most important markets—including the home entertainment arena, as well as mobile computing—spoken natural language input is highly desirable, if not ideal. As just one example, the proliferation of high-bandwidth communications infrastructure for the home entertainment market (cable, satellite, broadband) enables delivery of movies-on-demand and other interactive multimedia content to the consumer's home television set. For users to take full advantage of this content stream ultimately requires interactive navigation of content databases in a manner that is too complex for user-friendly selection by means of a traditional remote-control clicker. Allowing spoken natural language requests as the input modality for rapidly searching and accessing desired content is an important objective for a successful consumer entertainment product in a context offering a dizzying range of database content choices. As further examples, this same need to drive navigation of (and transaction with) relatively complex data warehouses using spoken natural language requests applies equally to surfing the Internet/Web or other networks for general information, multimedia content, or e-commerce transactions.
In general, the existing navigational systems for browsing electronic databases and data warehouses (search engines, menus, etc.), have been designed without navigation via spoken natural language as a specific goal. So today's world is full of existing electronic data navigation systems that do not assume browsing via natural spoken commands, but rather assume text and mouse-click inputs (or in the case of TV remote controls, even less). Simply recognizing voice commands within an extremely limited vocabulary and grammar—the spoken equivalent of button/click input (e.g., speaking “channel
5
” selects TV channel
5
)—is really not sufficient by itself to satisfy the objectives described above. In order to deliver a true “win” for users, the voice-driven front-end must accept spoken natural language input in a manner that is intuitive to users. For example, the front-end should not require learning a highly specialized command language or format. More fundamentally, the front-end must allow users to speak directly in terms of what the user ultimately wants—e.g., “I'd like to see a Western film directed by Clint Eastwood”—as opposed to speaking in terms of arbitrary navigation structures (e.g., hierarchical layers of menus, commands, etc.) that are essentially artifacts reflecting constraints of the pre-existing text/click navigation system. At the same time, the front-end must recognize and accommodate the reality that a stream of naive spoken natural language input will, over time, typically present a variety of errors and/or ambiguities: e.g., garbled/unrecognized words (did the user say “Eastwood” or “Easter”?) and under-constrained requests (“Show me the Clint Eastwood movie”). An approach is needed for handling and resolving such errors and ambiguities in a rapid, user-friendly, non-frustrating manner.
What is needed is a methodology and apparatus for rapidly constructing a voice-driven front-end atop an existing, non-voice data navigation system, whereby users can interact by means of intuitive natural language input not strictly conforming to the step-by-step browsing architecture of the existing navigation system, and wherein any errors or ambiguities in user input are rapidly and conveniently resolved. The solution to this need should be compatible with the constraints of a multi-user, distributed environment such as the Internet/Web or a proprietary high-bandwidth content delivery network; a solution contemplating one-at-a-time user interactions at a single location is insufficient, for example.
SUMMARY OF THE INVENTION
The present invention addresses the above needs by providing a system, method, and article of manufacture for navigating network-based electronic data sources in response to spoken input requests. When a spoken input request is received from a user, it is interpreted, such as by using a speech recognition engine to extract speech data from acoustic voice signals, and using a language parser to linguistically parse the speech data. The interpretation of the spoken request can be performed on a computing device locally with the user or remotely from the user. The resulting interpretation of the request is thereupon used to automatically construct an operational navigation query to retrieve the desired information from one or more electronic network data sources, which is then transmitted to a client device of the user. If the network data source is a database, the navigation query is constructed in the format of a database query language.
Typically, errors or ambiguities emerge in the interpretation of the spoken request, such that the system cannot instantiate a complete, valid navigational template. This is to be expected occasionally, and one preferred aspect of the invention is the ability to handle such errors and ambiguities in relatively graceful and user-friendly manner. Instead of simply rejecting such input and defaulting to traditional input modes or simply asking the user to try again, a preferred embodiment of the present invention seeks to converge rapidly toward instantiation of a valid navigational template by soliciting additional clarification from the user as necessary, either before or after a navigation of the data source, via multimodal input, i.e., by means of menu selection or other input modalities including and in addition to spoken input. This clarifying, multi-modal dialogue takes advantage of whatever partial navigational information has been gleaned from the initial interpretation of the user's spoken request. This clarification process continues until the system converges toward an adequately instantiated navigational template, which is in turn used to navigate the network-based data and retrieve the user's desired information. The retrieved information is transmitted across the network and presented to the user on a suitable client display device.


REFERENCES:
patent: 5197005 (1993-03-01), Schwartz et al.
patent: 5386556 (1995-01-01), Hedin et al.
patent: 5434777 (1995-07-01), Luciw
patent: 5519608 (1996-05-01), Kupiec
patent: 5608624 (1997-03-01), Luciw
patent: 5721938 (1998-02-01), Stuckey
patent: 5729659 (1998-03-01), Potter
patent: 5748974 (1998-05-01), Johnson
patent: 5774859 (1998-06-01), Houser et al.
patent: 5794050 (1998-08-01), Dahlgren et al.
patent: 5802526 (1998-09-01), Fawcett et al.
patent: 5805775 (1998-09-01), Eberman et al.
patent: 5855002 (1998-12-01), Armstrong
patent: 5890123 (1999-03-01), Brown et al.
patent: 5963940 (1999-10-01), Liddy et al.
patent: 6003072 (1999-12-01), Gerritsen et al.
patent: 6012030 (2000-01-01), French-St. George et al.
patent: 6026388 (2000-02-01), Liddy et al.
patent: 6173279 (2001-01-01), Levin et al.
patent: 6192338 (2001-02-01), Haszto et al.
patent: 0 895 396 (1999-02-01), None
patent: 1 094 406 (2001-04-01), None
http://www.ai.sri.com
atural-language/projects/arpa-sls
at-lang.html “Gemini: A Natural Language System for Spoken-Language Understanding” “Interleaving Syntax and Semantics in an Efficient Bottom-Up Parser,”.
http://www.ai.sri.com
atural-language/projects/arpa-sls/spnl-int.html “Combining Linguistic and Statistical Knowledge Sources in Natural-Language Processing for ATIS”.
http://www.ai.sri.com/~oaa.applications.html “InfoWiz: An Animated Voice Interactive

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Accessing network-based electronic information through... does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Accessing network-based electronic information through..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Accessing network-based electronic information through... will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-3023062

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.