User interface for speech recognition system grammars

Data processing: speech signal processing – linguistics – language – Speech signal processing – Recognition

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C704S275000

Reexamination Certificate

active

06456972

ABSTRACT:

TECHNICAL FIELD
The present invention relates to speech recognition systems, and more particularly, to a graphical user interface for interacting with multiple application-associated grammars.
BACKGROUND ART
A computer operating system graphical user interface allows a user to control multiple applications by manually operating a pointing device to position a cursor over application-associated icons. The pointing device is also employed within a specific application to control operation of the application, typically by drop-down menus and dialog boxes.
Speech recognition systems may also perform such application control by spoken input rather than manually operated pointing device. Multiple computer applications may be registered with a speech recognition engine, e.g.,word processing, e-mail, etc. Each such application may have an associated grammar of valid vocabulary and usage. These grammars vary in their complexity. A grammar may be as simple as a list of command words such as “open,” “close,” “save,” “print,” etc. Such a simple grammar may also be visually displayed to a user in the familiar structure of drop-down menus. More sophisticated grammars emulate natural language usage and may parse complex phrases such as: “Italicize the first paragraph.”
A grammar is properly viewed as the specification of word sequence structures permitted in a given language or application. The most common kinds of grammars are known as context-free grammars (CFGs) which contain a set of terminal symbols that appear in final sequences (e.g.,words and punctuation), a set of non-terminal symbols that are expanded into other symbols (e.g., “NP” for a noun phrase, “VP” for a verb phrase), a specific non-terminal designated as the starting symbol (e.g., “s-maj”, for major sentence), and a set of rewrite rules each of which has a single non-terminal symbol on the left-hand side (LHS) and one or more symbols on the right-hand side (RHS). A context-free grammar thus assigns one or more structures to every valid word sequence in a language.
Context-free grammars are often presented in Backus-Naur Form (BNF) notation. In BNF, non-terminal symbols are enclosed in <brackets> and the LHS and RHS of a rewrite rule are separated by an indicator symbol ‘::=’ which may be read as “includes”. In an example of a simple grammar:
<SENTENCE>::=<SUBJECT><VERBPHRASE>
<SUBJECT>::=John|Mary
<VERBPHRASE>::=<VERB><OBJECT>
<VERB>::=eats|drinks
<OBJECTS>::=wine|cheese
A speech recognition system user may have some difficulty operating multiple applications by spoken command when each application has its own different associated grammar of valid vocabulary and usage. As a result, at any given instant, such a user may not know what commands may be spoken and recognized by a given application.
SUMMARY OF THE INVENTION
A preferred embodiment of the present invention provides a graphical user interface of a speech recognition system for user interaction with at least one grammar of valid vocabulary and usage for an application which is accessible to the speech recognition system. The interface includes an interactive dialog box in a portion of a display screen, a grammar identifier displayed within the dialog box which describes an active grammar, a search identifier displayed within the dialog box which describes a grammar search mode for determining valid inputs according to the active grammar, a current input displayed within the dialog box which describes a spoken input, and a valid grammar list displayed within the dialog box which lists examples of valid vocabulary and usage according to the grammar search mode for the active grammar. The dialog box may further include an active grammar display within the dialog box which displays the active grammar, and which may be used to interactively select the current input.
In a further embodiment, the active grammar or the grammar search mode, or both are interactively selectable using the dialog box. Similarly, one of the examples displayed in the valid grammar list may be interactively selectable as an input to the application of the active grammar.
In addition, or alternatively, the grammar search mode may determine all word phrases of at least one word which are valid in the active grammar, or word phrases of at least one word which contain the current input and which are valid in the active grammar. Such word phrases may be further constrained to start with the current input.
A preferred embodiment may also bias towards shortness the examples displayed in the valid grammar list by use of a matrix-based algorithm. For example, the matrix-based algorithm may determine an average generated example length to bias the examples towards shortness.
A preferred embodiment also includes a method of user interaction via a graphical user interface with at least one grammar of valid vocabulary and usage for an application accessible to a speech recognition system. The method includes creating an interactive dialog box in a portion of a display screen; displaying within the dialog box a grammar identifier which describes an active grammar, a search identifier which describes a grammar searching mode for determining valid inputs according to the active grammar, and a current input which describes a spoken input; and generating and displaying within the dialog box a valid grammar list which lists examples of valid vocabulary and usage according to the grammar search mode for the active grammar. The dialog box may further include an active grammar display within the dialog box which displays the active grammar, and which may be used to interactively select the current input.
In a further embodiment, the active grammar or the grammar search mode, or both, are interactively selected using the dialog box. Similarly, one of the examples displayed in the valid grammar list may be selected as an input to the application of the active grammar.
In addition, or alternatively, the grammar search mode may determine all word phrases of at least one word which are valid in the active grammar, or word phrases of at least one word which contain the current input and which are valid in the active grammar. Such word phrases may be further constrained to start with the current input.
A preferred embodiment may also bias towards shortness the examples displayed in the valid grammar list by use of a matrix-based algorithm. For example, the matrix-based algorithm may determine an average generated example length to bias the examples towards shortness.


REFERENCES:
patent: 5774628 (1998-06-01), Hemphill
patent: 5812977 (1998-09-01), Douglas
patent: 5890122 (1999-03-01), Van Kleeck et al.
patent: 5937385 (1999-08-01), Zadrozny et al.
patent: 5983190 (1999-11-01), Trower, II et al.
patent: 6064961 (2000-05-01), Hanson
patent: 6094635 (2000-07-01), Scholz et al.
patent: 6178404 (2001-01-01), Hambleton et al.
patent: 6188985 (2001-02-01), Thrift et al.
patent: 6208971 (2001-03-01), Bellegarda et al.
patent: 6208972 (2001-03-01), Grant et al.
patent: 195 33 541 (1997-03-01), None
patent: 08044384 (1996-02-01), None

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

User interface for speech recognition system grammars does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with User interface for speech recognition system grammars, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and User interface for speech recognition system grammars will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-2892492

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.