Data processing: speech signal processing – linguistics – language – Speech signal processing – Recognition
Patent
1994-07-07
1999-03-16
Dorvil, Richemond
Data processing: speech signal processing, linguistics, language
Speech signal processing
Recognition
704207, 704 56, G10L 900
Patent
active
058842616
ABSTRACT:
Tone-sensitive acoustic models are generated by first generating acoustic vectors which represent the input data. The input data is separated into multiple frames and an acoustic vector is generated for each frame which represents the input data over its corresponding frame. A tone-sensitive parameter is then generated for each of the frames which indicates the tone of the input data at its corresponding frame. Tone-sensitive parameters are generated in accordance with two embodiments. First, a pitch detector may be used to calculate a pitch for each of the frames. If a pitch cannot be detected for a particular frame, then a pitch is created for that frame based on the pitch values of surrounding frames. Second, the cross covariance between the autocorrelation coefficients for each frame and its successive frame may be generated and used as the tone-sensitive parameter. Feature vectors are then created for each frame by appending the tone-sensitive parameter for a frame to the acoustic vector for the same frame. Then, using these feature vectors, acoustic models are created which represent the input data.
REFERENCES:
patent: 4819271 (1989-04-01), Bahl et al.
Alex Waibel, et al., "Readings in Speech Recognition", pp. 308-319, 332-339 and 507-514, Morgan Publishers, Inc. 1990.
Lalit R. Bahl, et al., "A Maximum Likelihood Approach to Continuous Speech Recognition", IEEE, Mar. 1983.
Lalit R. Bahl, et al., "Speech Recognition With Continuous-Parameter Hidden Markov Models", IEEE, Sep./Dec. 1987.
Lalit R. Bahl, et al. "A Tree-Based Statistical Language Model for Natural Language Speech Recognition", IEEE, Jul. 1989.
Hsiao-Wuen Hon, et al., "CMU Robust Vocabulary-Independent Speech Recognition System", IEEE 1991, pp. 889-892.
L. R. Bahl, et al., "Acoustic Markov Models Used in the Tangora Speech Recognition System", IEEE 1988, pp. 497-500.
Chih-Heng Lin, et al., "A New Framework for Recognition of Mandarin Syllables With Tones Using Sub-syllabic Units", IEEE, Apr. 1993, pp. 227-230.
Lin-shan Lee, et al., "Golden Mandarin (II)- An Improved Single-Chip Real-Time Mandarin Dictation Machine for Chinese Langurage With Very Large Vocabulary", IEEE, Apr. 1993, pp. 503-506.
International Conference on Computer Processing of Chinese and Oriental Languages, vol. 5, No. 3-4, Aug. 1991 Taipei,TW, Lee, et al., "System Description of Golden Mandarin 1 Voice Input System For Unlimited Chinese Characters", pp. 314-326. Aug. 1991.
International Conference on Acoustics, Speech and Signal Processing, 1990, vol. 1, 3-6 Apr. 1990 Albuquerque, NM, Lee, et al., "A Real-time Mandarin Dictation Machine For Chinese Language With Unlimited Texts and Very Large Vocabulary", pp. 65-68, Apr. 1990.
International Conference on Speech, Image Processing and Neural Networks, 1994, vol. 2, 13-16 Apr. 1994 Hong Kong, HK, Tianying, et al., A Method for Chinese Syllables Recognition Based Upon Sub-syllable Hidden Markov Model:, pp. 730-733, Apr. 1994.
Proceedings Tencon '93, IEEE Region 10 Conference on Computer, Communication, Control and Power Engineering, IEEE Region 10 Interantional Conference on Computers, Communications and Automation, vol. 3, 19-21, Oct. 1993 Beijing, CH, Chang, et al., "A Segment-based Speech Recognition System For Isolated Mandarin Syllables", pp. 317-320--Oct. 1993.
De Souza Peter V.
Fineberg Adam B.
Hon Hsiao-Wuen
Yuan Baosheng
Apple Computer Inc.
Dorvil Richemond
LandOfFree
Method and apparatus for tone-sensitive acoustic modeling does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Method and apparatus for tone-sensitive acoustic modeling, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Method and apparatus for tone-sensitive acoustic modeling will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-827309