Preprocessing system for speech recognition

Electrical audio signal processing systems and devices – One-way audio signal program distribution – Public address system

Patent

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

381 49, G10L 904, G10L 506

Patent

active

050540858

ABSTRACT:
The present invention processes an independent body of speech during an enrollment process and creates a set of speaker specific enrollment parameters for normalizing analysis parameters including the speaker's pitch, the frequency spectrum of the speech as a function of time, and certain measurements of the speech signal in the time-domain. A particular objective of the invention is to make these analysis parameters have the same meaning from speaker to speaker. Thus after the pre-processing performed by this invention, the parameters would look much the same for the same word independent of speaker. In this manner, variations in the speech signal caused by the physical makeup of a speaker's throat, mouth, lips, teeth, and nasal cavity would be, at least in part, reduced by the pre-processing.

REFERENCES:
patent: 3649765 (1972-03-01), Rabiner et al.
patent: 3679830 (1972-07-01), Uffelman et al.
patent: 3752929 (1973-08-01), Fletcher
patent: 4039754 (1977-08-01), Lokerson
patent: 4058676 (1977-11-01), Wilkes et al.
patent: 4394538 (1983-07-01), Warren et al.
patent: 4516259 (1985-05-01), Yato et al.
patent: 4718096 (1988-01-01), Meisel
Jaschul, "An Approach to Speaker Normalization for Automatic Speech Recognition", IEEE ICASSP 79, Apr. 1979.
Silverman et al., "A Parametrically Controlled Spectral Analysis System for Speech", IEEE Trans., ASSP-22, No. 5, Oct. 1974, pp. 362-381.
Gold et al., "Parallel Processing Techniques for Estimating Pitch Periods of Speech in the Time Domain", The Journal of the Acoustical Society of America, vol. 46, 1969.
Clapper, IBM Technical Disclosure, "Word Recognizers with Filters Automatically Adjusted to Speaker", vol. 13, No. 3, Aug. 1970.
Heisey et al., Journal of the Acoustical Society of America, Abstract: "A Time/Speaker Normalization Technique for Word Verification", vol. 63, No. 1, Spring 1978.
Jaschul, "Speaker Adaption by a Linear Transformation with Optimised Parameters", ICASSP 82, IEEE, vol. 3 of 3, May 1982.
Lea, "Prosodic Aids to Speech Recognition", Trends in Speech Recognition, Prentice Hall, 1980, pp. 166-205.
Baker, "A New Time-Domain Analysis of Human Speech and Other Complex Waveforms", Canegie-Mellon Univ., 1975.
Zwicker et al., "Analytical Expressions for Critical-Band Rate and Critical Bandwidth as a Function of Frequency", Journal of the Acoustic Society of America, vol. 68, No. 5, pp. 1523-1525.
Bug, "A New Analysis Technique for Time Series Data", NATO Advanced Study Institute on Signal Processing with Emphasis on Underwater Acoustics, Aug. 12-13, 1968, pp. 42-48.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Preprocessing system for speech recognition does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Preprocessing system for speech recognition, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Preprocessing system for speech recognition will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-1761307

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.