Data processing: speech signal processing – linguistics – language – Speech signal processing – For storage or transmission
Patent
1996-12-24
1999-03-30
Hudspeth, David R.
Data processing: speech signal processing, linguistics, language
Speech signal processing
For storage or transmission
704256, 704203, 704204, 606204, G10L 302
Patent
active
058901116
ABSTRACT:
Injection noise and silence are detected in an input speech signal and an external amplifier is switched on or off based on the detected injection noise or silence. The input speech signal is digitized and a first copy of the digitized signal is preemphasized. After the input speech signal is preemphasized, a predetermined number of Mel-frequency cepstral coefficients (MFCCs) and difference cepstra are calculated for each window of the speech signal. A measure of signal energy and a measure of the rate of change of the signal energy is computed. A second copy of the digitized input speech signal is processed using amplitude summation or by differencing a center-clipped signal. The measures of signal energy, rate of change of the signal energy, the Mel coefficients, difference cepstra, and either the amplitude summation value or the differenced value are combined to form an observation vector. Hidden Markov Model (HMM) based decoding is used on the observation vector to detect the occurrence of injection noise or silence. A gain switch on an external speech amplifier is turned on after an occurrence of injection noise and remains on for the duration of speech and the amplifier is turned off when an occurrence of silence is detected.
REFERENCES:
patent: 4308861 (1982-01-01), Kelly
patent: 4439872 (1984-04-01), Henley-Cohn et al.
patent: 4489440 (1984-12-01), Chaoui
patent: 4502150 (1985-02-01), Katz et al.
patent: 4589136 (1986-05-01), Poldy et al.
patent: 4627095 (1986-12-01), Thompson
patent: 4669643 (1987-06-01), Ley
patent: 4718099 (1988-01-01), Hotvet
patent: 4736432 (1988-04-01), Cantrell
patent: 4837832 (1989-06-01), Fanshel
patent: 4862506 (1989-08-01), Landgarten et al.
patent: 4896358 (1990-01-01), Bahler et al.
patent: 5123922 (1992-06-01), Berg
patent: 5157653 (1992-10-01), Genter
patent: 5326349 (1994-07-01), Baraff
patent: 5359663 (1994-10-01), Katz
patent: 5511009 (1996-04-01), Pastor
patent: 5649055 (1997-07-01), Gupta et al.
patent: 5684921 (1997-11-01), Bayya et al.
patent: 5706392 (1998-01-01), Goldberg et al.
Thomas Parsons; Voice and Speech Processing; McGraw-Hill; p. 73, Jan. 1987.
Article by Frederick L. Jelinek, entitled "Continuous Speech Recognition by Statistical Methods" published Apr. 4, 1976, pp. 532-558, Proceedings of the IEEE, vol. 64, No. 4.
Galler Michael
Javkin Hector Raul
Niedzielski Nancy
Azad Abul K.
Hudspeth David R.
Technology Research Association of Medical & Welfare Apparatus
LandOfFree
Enhancement of esophageal speech by injection noise rejection does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Enhancement of esophageal speech by injection noise rejection, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Enhancement of esophageal speech by injection noise rejection will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-1225242