Attribute-based word modeling

Data processing: speech signal processing – linguistics – language – Speech signal processing – Recognition

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C704S255000, C704S240000, C704S244000, C704S257000

Reexamination Certificate

active

06963837

ABSTRACT:
An attribute-based speech recognition system is described. A speech pre-processor receives input speech and produces a sequence of acoustic observations representative of the input speech. A database of context-dependent acoustic models characterize a probability of a given sequence of sounds producing the sequence of acoustic observations. Each acoustic model includes phonetic attributes and suprasegmental non-phonetic attributes. A finite state language model characterizes a probability of a given sequence of words being spoken. A one-pass decoder compares the sequence of acoustic observations to the acoustic models and the language model, and outputs at least one word sequence representative of the input speech.

REFERENCES:
patent: 5689616 (1997-11-01), Li
patent: 5745649 (1998-04-01), Lubensky
patent: 5758023 (1998-05-01), Bordeaux
patent: 5805772 (1998-09-01), Chou et al.
patent: 5870709 (1999-02-01), Bernstein
patent: 5940794 (1999-08-01), Abe
patent: 5983180 (1999-11-01), Robinson
patent: 6567776 (2003-05-01), Chang et al.
Alleva, Fil, et al, “Improvements on the Pronunciation Prefix Tree Search Organization”, ICASSP 1996, pp. 133-136.
Anastasakos, Anastasios, et al, “Duration Modeling in Large Vocabulary Speech Recognition”,International Conference on Acoustics, Speech and Signal Processing,vol. 1, May 9, 1995, pp. 628-631.
Byrne, W., et al, “Pronunciation Modelling Using a Hand-Labelled Corpus for Conversational Speech Recognition”, IEEE, vol. Conf. 23, May 12, 1998, pp. 313-316.
Delmonte, R., “Linguistic Tools for Speech Recognition and Understanding”,Database Inspec On Line,Institute of Electrical Engineers, Stevenage, GB, Database accessin No. 4199465 XP002159657, abstract and Speech Recognition and Understanding Recent Advances, Trends and Applications,Proceedings of the NATO Advanced Study Institute,Cetraro, Italy, Jul. 1-13, 1990, pp. 481-485, Berlin, Germany, ISBN: 3-540-54032-6.
Erler, Kevin, et al, “HMM Representation of Quantized Articulatory Features for Recognition of Highly Confusible Words”,Proceedings of the International Conference on Acoustics, Speech and Signal Processing,USA, NY,IEEE,vol. Conf. 17, Mar. 23, 1992, pp. 545-548.
Finke, Michael, et al, “Flexible Transcription Alignment”,Proceedings. ASRU '97,Santa Barbara, USA, Dec. 1997.
Finke, Michael, et al, “Speaking Mode Dependent Pronunciation Modeling In Large Vocabulary Conversational Speech Recognition”,Proceedings of Eurospeech-97, Sep. 1997.
Fritsch, J., et al, “The Bucket Box Intersection (BBI) Algorithm for Fast Approximative Evaluation of Diagonal Mixture Gaussians”,IEEE International Conference on Acoustics, Speech and Signal Processing Conference Proceedings,1996, (Cat. No. 96CH35903), vol. 2, pp. 837-840.
Hwang, Jenq-Neng, et al, “Dynamic Frame-Skipping in Video Transcoding”,IEEE Second Workshop on Multimedia Signal Processing(Cat. No. 98EX175), Dec. 7-9, 1998, pp. 616-621.
Koo, Myoung-Wan, et al, “A New Decoder Based on a Generalized Confidence Score”,International Conference on Acoustics, Speech and Signal Processing,vol. 1, 1998.
Llorens, D., et al, “Acoustic and Syntactical Modeling in the Atros System”,IEEE, US,Mar. 15-19, 1999, pp. 641-644.
Mergel, D., et al, “Construction of Language Models for Spoken Database Queries”,IEEE International Conference on Acoustics, Speech&Signal Processing,vol. Conf. 12, Apr. 1, 1987, pp. 844-847.
Ney, H., et al, “Dynamic Programming Search for Continuous Speech Recognition”,IEEE Signal Processing Magazine,Sep. 1999, vol. 16, No. 5, pp. 64-83.
Ney, H., et al, “Improvements in Beam Search for 10000-Word Continuous Speech Recognition”,IEEE,Sep. 1992, pp. 9-12.
Ortmanns, Stefan, et al, “Look-Ahead Techniques for Fast Beam Search”,Proceedings of the ICASSP'97,Munich, (Germany), 1997, pp. 1783-1786.
Ostendorf, M., et al, “Modeling Systematic Variations in Pronunciation via a Language-Dependent Hidden Speaking Mode”, Proc. ICSLP, 1996, pp. 1-20.
Renals, S., et al, “Start-Synchronous Search for Large Vocabulary Continuous Speech Recognition”,IEEE Transactions on Speech and Audio Processing,USA, Sep. 1999, vol. 7, No. 5, pp. 542-553.
Suaudeau, Nelly, et al, “An Efficient Combination of Acoustic and Supra-Segmental Informations in a Speech Recognition System”,ICASSP-94, IEEE International Conference on Acoustics, Speech and Signal Processing,USA, vol. 1, Apr. 1994, pp. I/65-68.
Wagner, M., “Speaker Characteristics in Speech and Speaker Recognition”,Proceedings of IEEE Telcon '97, IEEE Region 10 Annual Conference, Speech and Image Technologies for Computing and Telecommunications,vol. 2, , pp. 626, abstract.
Wang, H., et al., “Complete Recognition of Continuous Mandarin Speech for Chinese Language With Very Large Vocabulary but Limited Training Data”,Proceedings of the International Conference on Acoustics, Speech and Signal Processing(ICASSP),IEEE,USA, May 9, 1995, pp. 61-64.
Finke, M., et al “Modeling and Efficient Decoding of Large Vocabulary Conversational Speech”,Eurospeech '99,vol. 1, Sep. 5-9, 1999, pp. 467-470, XP002168070, Budapest, Hungary.
EPO International Search Report dated Jun. 29, 2001 for PCT/IB00/01539.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Attribute-based word modeling does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Attribute-based word modeling, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Attribute-based word modeling will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-3482469

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.