Data processing: speech signal processing – linguistics – language – Speech signal processing – Recognition
Patent
1997-06-26
1999-06-01
Hudspeth, David R.
Data processing: speech signal processing, linguistics, language
Speech signal processing
Recognition
704255, 704231, G10L 506
Patent
active
059096667
ABSTRACT:
A computerized speech recognition system creates acoustic models of phrases by concatenating acoustic models for individual words. The system stores an acoustic word model and spelling for each of its vocabulary words. When it receives the spelling of a multi-word phrase to be treated as a new vocabulary word, it stores that multi-word spelling as the spelling of the new vocabulary word, and a new acoustic model created by concatenating the acoustic word models of previous vocabulary words whose spellings correspond to words in the multi-word spelling as the acoustic model for the new word. The system can then perform speech recognition by comparing acoustic signals against the word models of stored vocabulary words, including those representing such multi-word phrases. Preferably when a multi-word model is formed, the individual acoustic models concatenated are modified to represent the coarticulation which takes place between words spoken continuously. This can be done by representing word models as sequences of individual phonemes, individual phonemes by phoneme-in-context models, and coarticulation by modifying the phoneme-in-context models of phonemes adjacent the boundary between concatenated words models to reflect the context of the phonemes on the other side of such word boundaries. The system can be a discrete word recognizer. The multi-word phrases can be selected by a user or be obtained from other programs running on the same computer as the speech recognizer, such as from one or more commands available in another program.
REFERENCES:
patent: 4297528 (1981-10-01), Beno
patent: 4336421 (1982-06-01), Welch et al.
patent: 4349700 (1982-09-01), Pirz et al.
patent: 4394538 (1983-07-01), Warren et al.
patent: 4439161 (1984-03-01), Wiggins et al.
patent: 4509133 (1985-04-01), Monbaron et al.
patent: 4624008 (1986-11-01), Vensko et al.
patent: 4651289 (1987-03-01), Maeda et al.
patent: 4677673 (1987-06-01), Ukita et al.
patent: 4720863 (1988-01-01), Li et al.
patent: 4731845 (1988-03-01), Matsuki et al.
patent: 4751737 (1988-06-01), Gerson et al.
patent: 4776016 (1988-10-01), Hansen
patent: 4783803 (1988-11-01), Baker et al.
patent: 4819271 (1989-04-01), Bahl et al.
patent: 4829575 (1989-05-01), Lloyd
patent: 4829576 (1989-05-01), Porter
patent: 4831653 (1989-05-01), Katayama
patent: 4833713 (1989-05-01), Muroi et al.
patent: 4837830 (1989-06-01), Wrench, Jr. et al.
patent: 4837831 (1989-06-01), Gillick et al.
patent: 4866778 (1989-09-01), Baker
patent: 4903305 (1990-02-01), Gillick et al.
patent: 4903306 (1990-02-01), Nakamura
patent: 4964077 (1990-10-01), Eisen et al.
patent: 4975959 (1990-12-01), Benbassat
patent: 4977599 (1990-12-01), Bahl et al.
patent: 4979213 (1990-12-01), Nitta
patent: 4994983 (1991-02-01), Landell et al.
patent: 5003603 (1991-03-01), Searcy et al.
patent: 5027406 (1991-06-01), Roberts et al.
patent: 5036539 (1991-07-01), Wrench, Jr. et al.
patent: 5065431 (1991-11-01), Rollett
patent: 5097509 (1992-03-01), Lennig
patent: 5122972 (1992-06-01), Richards et al.
patent: 5123086 (1992-06-01), Tanaka et al.
patent: 5136654 (1992-08-01), Ganong, III et al.
patent: 5146503 (1992-09-01), Cameron et al.
patent: 5231670 (1993-07-01), Goldhor et al.
patent: 5377303 (1994-12-01), Firman
patent: 5384892 (1995-01-01), Strong
patent: 5386492 (1995-01-01), Wilson et al.
patent: 5386494 (1995-01-01), White
patent: 5390279 (1995-02-01), Strong
patent: 5425128 (1995-06-01), Morrison
patent: 5428707 (1995-06-01), Gould et al.
patent: 5465317 (1995-11-01), Epstein
patent: 5513289 (1996-04-01), Stanford et al.
patent: 5632002 (1997-05-01), Hashimoto et al.
patent: 5640490 (1997-06-01), Hansen et al.
patent: 5642519 (1997-06-01), Martin
patent: 5677991 (1997-10-01), Hsu et al.
Roszkiewicz, "Back Talk: Lip Service," A+ Magazine, pp. 60-61, date Feb. 1984.
Schmandt, "Augmenting a Window System with Speech Input," Computer, pp. 50-56, date Aug. 1990.
Tough, Carol, "The Design of an Intelligent Transparent Speech Interface," IEE Colloquium on "Systems and Applications of Man-Machine Interaction Using Speech I/O)" (Digest No. 066), pp. 2/1-4, date Mar. 1991.
Gliedman, "Turning Talk Into Action: Voice-Dictation and Voice-Command Systems," Computer Shopper, pp. 780-781, date Sep. 1994.
Lane, "Expert's Toolbox: Store-Bought Recognition Tools," Al Expert, pp. 11-12, date Oct. 1994.
Kurzweil Voice User's Guide, Release 1.0, Cover and Copyright pages and pp. 60-61, copyright 1994.
Gould Joel M.
McGrath Frank J.
Parke Joel W.
Roberts Jed M.
Squires Steven D.
Dragon Systems, Inc.
Hudspeth David R.
Porter Edward W.
Wieland Susan
LandOfFree
Speech recognition system which creates acoustic models by conca does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Speech recognition system which creates acoustic models by conca, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Speech recognition system which creates acoustic models by conca will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-962419