Electrical audio signal processing systems and devices – One-way audio signal program distribution – Public address system
Patent
1990-11-19
1991-10-01
Harkcom, Gary V.
Electrical audio signal processing systems and devices
One-way audio signal program distribution
Public address system
381 49, G10L 904, G10L 506
Patent
active
050540858
ABSTRACT:
The present invention processes an independent body of speech during an enrollment process and creates a set of speaker specific enrollment parameters for normalizing analysis parameters including the speaker's pitch, the frequency spectrum of the speech as a function of time, and certain measurements of the speech signal in the time-domain. A particular objective of the invention is to make these analysis parameters have the same meaning from speaker to speaker. Thus after the pre-processing performed by this invention, the parameters would look much the same for the same word independent of speaker. In this manner, variations in the speech signal caused by the physical makeup of a speaker's throat, mouth, lips, teeth, and nasal cavity would be, at least in part, reduced by the pre-processing.
REFERENCES:
patent: 3649765 (1972-03-01), Rabiner et al.
patent: 3679830 (1972-07-01), Uffelman et al.
patent: 3752929 (1973-08-01), Fletcher
patent: 4039754 (1977-08-01), Lokerson
patent: 4058676 (1977-11-01), Wilkes et al.
patent: 4394538 (1983-07-01), Warren et al.
patent: 4516259 (1985-05-01), Yato et al.
patent: 4718096 (1988-01-01), Meisel
Jaschul, "An Approach to Speaker Normalization for Automatic Speech Recognition", IEEE ICASSP 79, Apr. 1979.
Silverman et al., "A Parametrically Controlled Spectral Analysis System for Speech", IEEE Trans., ASSP-22, No. 5, Oct. 1974, pp. 362-381.
Gold et al., "Parallel Processing Techniques for Estimating Pitch Periods of Speech in the Time Domain", The Journal of the Acoustical Society of America, vol. 46, 1969.
Clapper, IBM Technical Disclosure, "Word Recognizers with Filters Automatically Adjusted to Speaker", vol. 13, No. 3, Aug. 1970.
Heisey et al., Journal of the Acoustical Society of America, Abstract: "A Time/Speaker Normalization Technique for Word Verification", vol. 63, No. 1, Spring 1978.
Jaschul, "Speaker Adaption by a Linear Transformation with Optimised Parameters", ICASSP 82, IEEE, vol. 3 of 3, May 1982.
Lea, "Prosodic Aids to Speech Recognition", Trends in Speech Recognition, Prentice Hall, 1980, pp. 166-205.
Baker, "A New Time-Domain Analysis of Human Speech and Other Complex Waveforms", Canegie-Mellon Univ., 1975.
Zwicker et al., "Analytical Expressions for Critical-Band Rate and Critical Bandwidth as a Function of Frequency", Journal of the Acoustic Society of America, vol. 68, No. 5, pp. 1523-1525.
Bug, "A New Analysis Technique for Time Series Data", NATO Advanced Study Institute on Signal Processing with Emphasis on Underwater Acoustics, Aug. 12-13, 1968, pp. 42-48.
Meisel William S.
Wittenstein W. Andreas
Harkcom Gary V.
Knepper David D.
Speech Systems, Inc.
LandOfFree
Preprocessing system for speech recognition does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Preprocessing system for speech recognition, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Preprocessing system for speech recognition will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-1761307