Data processing: speech signal processing – linguistics – language – Speech signal processing – Recognition
Patent
1995-09-08
1998-10-13
Dorvil, Richemond
Data processing: speech signal processing, linguistics, language
Speech signal processing
Recognition
704255, 704239, G10L 708
Patent
active
058227288
ABSTRACT:
The multistage word recognizer uses a word reference representation based on reliably detected peaks of phoneme similarity values. The word reference representation captures the basic features of the words by targets that describe the location and shape of stable peaks of phoneme similarity values. The first stage of the word hypothesizer represents each reference word with statistical information on the number of high similarity regions over a predefined number of time intervals. The second stage represents each word by a prototype that consists of a series of phoneme targets and global statistics, namely the average word duration and average match rate. These represent the degree of fit of the word prototype to its training data. Word recognition scores generated in the two stages are converted to dimensionless normalized values and combined by averaging for use in selecting the most probable word candidates.
REFERENCES:
patent: 3770892 (1973-11-01), Clapper
patent: 4481593 (1984-11-01), Bahler
patent: 4489434 (1984-12-01), Moshier
patent: 4489435 (1984-12-01), Moshier
patent: 4528688 (1985-07-01), Ichikawa et al.
patent: 4559602 (1985-12-01), Bates, Jr.
patent: 4718094 (1988-01-01), Bahl et al.
patent: 4723290 (1988-02-01), Watanabe et al.
patent: 4742547 (1988-05-01), Watanabe
patent: 4748670 (1988-05-01), Bahl et al.
patent: 4780906 (1988-10-01), Rajasekaran et al.
patent: 4803729 (1989-02-01), Baker
patent: 4820059 (1989-04-01), Miller et al.
patent: 4888823 (1989-12-01), Nitta et al.
patent: 4905287 (1990-02-01), Degawa
patent: 4907274 (1990-03-01), Nomura et al.
patent: 4908865 (1990-03-01), Doddington et al.
patent: 4924518 (1990-05-01), Ukita
patent: 4937871 (1990-06-01), Hattori
patent: 4987596 (1991-01-01), Ukita
patent: 5027408 (1991-06-01), Kroeker et al.
patent: 5129001 (1992-07-01), Bahl et al.
patent: 5131043 (1992-07-01), Fujii et al.
patent: 5133012 (1992-07-01), Nitta
patent: 5195167 (1993-03-01), Bahl et al.
patent: 5195168 (1993-03-01), Yong
patent: 5197113 (1993-03-01), Mumolo
patent: 5218668 (1993-06-01), Higgins et al.
patent: 5233681 (1993-08-01), Bahl et al.
patent: 5241619 (1993-08-01), Schwartz et al.
patent: 5268990 (1993-12-01), Cohen et al.
patent: 5309547 (1994-05-01), Niyada et al.
patent: 5345536 (1994-09-01), Hoshimi et al.
patent: 5349645 (1994-09-01), Zhao
patent: 5369727 (1994-11-01), Nomura et al.
patent: 5369728 (1994-11-01), Kosaka et al.
patent: 5390278 (1995-02-01), Gupta et al.
patent: 5528728 (1996-06-01), Matsuura et al.
Ronald Cole, Krist Roginski and Mark Fanty, "English Alphabet Recognition With Telephone Speech"; Eurospeech 91, Sep. 91, pp. 479-482 of 4 vol.
Climent Nadeu and Biing-Hwang Juang, "Filtering of Spectral Parameters for Speech Recognition", pp. S31-24.1-S31-24.3, Sep. 1994.
Cole, Fanty, Gopalakrishnan and Janssen, "Speaker-Independent Name Retrieval From Spellings Using a Database of 50,000 Names", pp. 325-328, 1991, ICASSP-91, May 1991.
Phillippe Morin, Jean-Claude Junqua, "Habitable Interaction in Goal-Oriented Multimodal Dialogue Systems", pp. 1669-1672, Speech Technology Lab. May 1994.
Hoshimi, Miyata, Kiroaka and Niyada, "Speaker Independent Speech Recognition Method Using Training Speech From a Small Number of Speakers", pp. I-469-I-472, ICASSP-92 Mar. 92.
Yifan Gong and Jean-Paul Haton, "Plausibility Functions in Continuous Speech Recognition: The VINICS System", pp. 187-195, 1993; Speech Communication Oct., 1993.
Applebaum Ted H.
Morin Philippe R.
Dorvil Richemond
Matsushita Electric - Industrial Co., Ltd.
LandOfFree
Multistage word recognizer based on reliably detected phoneme si does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Multistage word recognizer based on reliably detected phoneme si, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Multistage word recognizer based on reliably detected phoneme si will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-326880