Data processing: speech signal processing – linguistics – language – Speech signal processing – Application
Reexamination Certificate
1999-02-25
2001-06-19
Dorvil, Richemond (Department: 2741)
Data processing: speech signal processing, linguistics, language
Speech signal processing
Application
C704S275000
Reexamination Certificate
active
06249764
ABSTRACT:
FIELD OF THE INVENTION
The invention relates to a system and method for retrieving speech information and for presenting such information to a user. In particular, the invention relates to a system and method for specifying, retrieving, and presenting the speech information desired by the user.
BACKGROUND OF THE INVENTION
Recently, a large variety of information has become available over the Internet. A user can retrieve and display such information using a personal computer located at various places including the home. The user can use a browser operating through the World-Wide Web that extends throughout the world to retrieve and display the information. Information retrieved from the Internet and displayed to the user includes words or phrases distinguished from the remainder of the information by such devices as underlining or display in a different color. Such words or phrases include concealed links to other World-Wide Web pages containing additional information. The words or phrases distinguished in this way are called hot spots. The user can obtain additional information related to the hot spot by moving the mouse pointer to the hot spot and clicking a button on the mouse. The browser responds by jumping to the web page linked to the word or phrase. This web page provides the additional information.
A terminal, such as a personal computer, that includes a graphics display is required to perform the Web-based information retrieval method just described. If a computer or other graphics display device is not available, information cannot normally be retrieved using the World-Wide Web. Moreover, even if a computer or other graphics display device is available, the user may not be able to retrieve the information if the user is unable to operate a pointing device, such as a mouse, or if the user is unable to see the information displayed on a screen. This may occur, for example, if the user is physically or visually handicapped. Moreover, it may be difficult for a bedridden person to see a graphics display device, to operate a mouse and to type using a keyboard. It has been proposed to solve these problems by using speech to present information in a manner analogous to the way information is presented in Web pages.
Automated speech has been used to present information in a variety of fields as a way of reducing labor costs. Current methods for providing information using automated speech are based on conventional telephone networks and have many operational problems. Such systems are unable to present information using speech in a manner analogous to the way in which information is presented in Web pages. One reason for this is the difficulty in including in speech a feature analogous to the above-described hot spots. Conventional ways of presenting information using speech provide the user with rudimentary choices by presenting the user with multi-item menus to which the user responds by entering a number using the keypad on the telephone.
An example of a conventional way of presenting information using speech will now be described. In this, a recorded or synthesized voice makes the following announcement: “After the beep, please enter your location: 1. Tokyo, 2. Kanagawa, 3. Saitama, 4. Chiba, 5. Yamanashi.” After the beep, the user uses the keypad of the telephone to enter the number corresponding to his or her choice. Information has to be presented in this way because hot spots are not included in the speech information defining the menu. The drawback of this method is that the user has to listen until the whole menu has been presented because the user does not know the whole menu until the whole menu has been presented. In addition, during presentation of the whole menu, the user must remember the number corresponding to the user's choice while continuing to listen for a more appropriate choice. If the menu is long, a memory lapse or confusion on the part of the user may result in the user entering the incorrect number. Conventional systems try to overcome this problem by presenting only simple menus. If many choices are available, several menus may be required. This increases in the time required for the user to retrieve the information he or she desires and increases the possibility of the user making an incorrect choice. Conventional ways of presenting information using speech also have operability problems. For example, such ways may lack the ability to return to a previous menu. Thus, conventional ways of presenting information using speech have problems in function and operation, and may not be able to provide the exact information desired by the user when the user requires extensive, in-depth information.
What is needed, therefore, is an improved system and method for retrieving and presenting information using speech. Such system and method should provide improved convenience and operability in terms of selecting, detecting, retrieving, and presenting the information desired by the user. The system and method should use a simple operating method to present to the user speech information covering a broad range of content and in the depth desired by the user.
What also is needed is a system and method for retrieving and presenting speech information that operates analogously to the way in which a web browser retrieves and presents information using the World-Wide Web.
SUMMARY OF THE INVENTION
The invention provides a method of retrieving and presenting a desired item of speech information. In the method, speech files are provided, and a speech information presentation operation is iteratively performed until the speech information presentation operation present the desired speech information. In the speech information presentation operation, a speech file is retrieved and the speech information represented by the speech file is presented. Each speech file represents an item of speech information. At least one of the speech files is a hyperspeech file that represents an item of speech information and includes a hot spot specification specifying a hot spot in the item of speech information. The hot spot specification comprises a hot spot definition and an identifier. The hot spot definition defines the hot spot in the speech information. During the hot spot, the speech information identifies additional speech information. The identifier identifies another of the speech files that represents the additional speech information. The speech file retrieved first is a hyperspeech file.
In the iteratively-performed speech information presentation operation, a speech file retrieval operation is performed in which one of the speech files is retrieved, any hot spot specification comprised therein is extracted, and a speech signal is generated therefrom. The speech signal includes a distinguishing portion corresponding to any hot spot specified therein. The speech information is presented in response to the speech signal. The distinguishing portion included in the speech signal distinguishes the hot spot from the remainder of the speech information during presentation of the speech information. When the speech information presented is not the desired speech information, a user request signal is provided during the hot spot to request presentation of the item of additional speech information identified by the speech information presented during the hot spot. The identifier included in the hot spot specification is referenced in response to the user request signal. The identifier identifies the speech file that will be retrieved when the speech file retrieval operation is next performed.
The invention also provides a system for retrieving and presenting a desired item of speech information. The system comprises speech files, a hyperspeech file processor and a user interface device linked to the hyperspeech file processor. Each of the speech files represents an item of speech information. At least one of the speech files is a hyperspeech file representing an item of speech information and including a hot spot specification specifying a hot spot in the item of speech inform
Hirayama Makoto
Kamae Takahiko
Dorvil Richemond
Hewlett--Packard Company
LandOfFree
System and method for retrieving and presenting speech... does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with System and method for retrieving and presenting speech..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and System and method for retrieving and presenting speech... will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-2532310