Speech recognition apparatus and method for matching...

Data processing: speech signal processing – linguistics – language – Speech signal processing – Recognition

Reexamination Certificate

Rate now

[ 0.00 ] – not rated yet Voters 0 Comments 0

Details Speech recognition apparatus and method for matching... Speech recognition apparatus and method for matching...

: 1994-02-14
: 2001-05-22
: Dorvil, Richemond (Department: 2741)
: Data processing: speech signal processing, linguistics, language
: Speech signal processing
: Recognition

: C704S238000, C704S239000
: Reexamination Certificate
: active
: 06236964
: ABSTRACT:

BACKGROUND OF THE INVENTION
1. Field of the Invention
The present invention relates to a speech recognition method and apparatus therefor, and more particularly, to a speech recognition method and apparatus therefor for recognizing a speech, such as a word, uttered continuously by an unspecified speaker.
2. Description of the Prior Arts
Among various types of known unspecified speaker recognition techniques, the most commonly used unspecified speaker recognizing system will be described below.
FIG. 15
shows the configuration of a recognition system which handles large unspecified vocabularies. A speech input from a speech input unit
1
is sent to a speech analysis unit
2
where a filter bank output including the power term of a speech or feature parameter, such as LPC cepstrum, of the input speech is obtained. Compression (dimension compression by the K-L transform in the case of the filter bank output) of the parameters is also conducted in the speech analysis unit
2
. Since analysis is conducted by the unit of a frame, the compressed feature parameter is hereinafter referred to as a feature vector.
Next, the phoneme boundary is determined in the continuously uttered speech by a phoneme boundary detecting unit
3
. Subsequently, a phoneme discriminating unit
4
determines phonemes by a statistical technique. A reference phoneme pattern storing unit
5
stores reference phoneme patterns created from a large amount of phoneme samples. A word discriminating unit
6
outputs a final recognition result from a word dictionary
7
using the results of the output of the phoneme discriminating unit
4
or by performing modification on the candidate phonemes by means of a modification regulating unit
8
. The results of the recognition are displayed by a recognition result display unit
9
.
Generally, the phoneme boundary detecting unit
3
uses functions or the like for discrimination. The phoneme discriminating unit
4
also conducts discrimination using the functions. Candidates which satisfy a predetermined threshold are output from each of these components. A plurality of phoneme candidates are output for each phoneme boundary. Therefore, the word discriminating unit
6
narrows a final word using the top-down information stored in the components
7
and
8
.
However, since the aforementioned conventional recognition system basically has a bottom-up structure, in a case when errors are generated at a certain point in the recognition process, the following process will be readily affected adversely. For example, in the case when phoneme boundary is erroneously determined in the phoneme boundary detecting unit
3
, the operation by the phoneme discriminating unit
4
or the word discriminating unit
6
may be greatly affected. That is, the final speech recognition rate is lowered in proportion to the product of the error rates of the individual processes. It is therefore impossible to attain a high recognition rate.
Furthermore, in the case of a recognition apparatus designed for the recognition of unspecified speakers, setting of a threshold value used for determination made in each process is very difficult. Setting of a threshold value which ensures that an objective is contained in the candidates increases the number of candidates in each process and hence makes accurate narrowing of the plurality of candidate words very difficult. Furthermore, when the recognition apparatus is used in an actual environment, unsteady-state noises are generated to a large excess, thus lowering the recognition rate even for a recognition apparatus designed to handle a small number of words.
SUMMARY OF THE INVENTION
An object of the present invention is to provide a speech recognition method which is capable of recognizing speech continuously uttered by a given speaker at a high recognition rate, and a speech recognition apparatus therefor.
Another object of the present invention is to provide a speech recognition method which comprises the two stages and selects candidate words concurrently with the slicing of the speech section by the unit of a word by spotting, and which conducts matching by the unit of a phoneme so as to allow selection of the candidate words and slicing of the speech section to be conducted at the same time and so as to allow reducing the number of candidate words to be facilitated, and a speech recognition apparatus therefor.
Another object of the present invention is to provide a speech recognition method in which reference phoneme patterns under a plurality of environments are prepared so as to allow input speech under a larger number of conditions to be recognized using a smaller amount of data when compared with the case in which reference word patterns under a plurality of environments are prepared, and a speech recognition apparatus therefor.
Another object of the present invention is to provide a speech recognition method which performs spotting by the unit of a word in a first stage to obtain the speech section and the candidate words, and makes comparison in the second stage between the candidate words and the reference phoneme patterns prepared in a plurality of numbers for respective characteristics of speech so as to allow more accurate speech recognition to be achieved and thereby allow the recognition rate to be enhanced.
Other objects and advantages of the invention will become apparent during the following discussion of the accompanying drawings.

REFERENCES:
patent: 4349700 (1982-09-01), Pirz et al.
patent: 4489433 (1984-12-01), Suehiro et al.
patent: 4736429 (1988-04-01), Niyada et al.
patent: 4985924 (1991-01-01), Matsuura
patent: 5033087 (1991-07-01), Bahl et al.
patent: 5131043 (1992-07-01), Fujii et al.
patent: 5133012 (1992-07-01), Nitta
patent: 5315689 (1994-05-01), Kanazawa et al.

Affiliated with

Kosaka Tetsuo

Inventor

[ 0.00 ] – not rated yet Voters 0 Comments 0

Sakurai Atsushi

Inventor

[ 0.00 ] – not rated yet Voters 0 Comments 0

Tamura Jun-ichi

Inventor

[ 0.00 ] – not rated yet Voters 0 Comments 0

Also associated with

Canon Kabushiki Kaisha

Corporate Assignee

[ 0.00 ] – not rated yet Voters 0 Comments 0

Dorvil Richemond

Examiner

[ 0.00 ] – not rated yet Voters 0 Comments 0

Fitzpatrick ,Cella, Harper & Scinto

Law Firm

[ 0.00 ] – not rated yet Voters 0 Comments 0

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Speech recognition apparatus and method for matching... does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Speech recognition apparatus and method for matching..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Speech recognition apparatus and method for matching... will most certainly appreciate the feedback.

Rate now

Comments { 0 }

Profile ID: LFUS-PAI-O-2566694

All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.

Canada

Charities
Companies
MP Candidates
Patents
Employee Salary Disclosure

World

Places of the World
Scientific Papers

United States

Banks
Companies
Counties
Patents
Employee Salary Disclosure