Data processing: speech signal processing – linguistics – language – Speech signal processing – For storage or transmission
Reexamination Certificate
2000-06-02
2003-12-02
Dorvil, Richemond (Department: 2654)
Data processing: speech signal processing, linguistics, language
Speech signal processing
For storage or transmission
C704S226000
Reexamination Certificate
active
06658380
ABSTRACT:
BACKGROUND OF THE INVENTION
The present invention relates to digital speech signal processing techniques. It relates more particularly to techniques which detect vocal activity to perform different processing according to whether the signal is supporting vocal activity or not.
The digital techniques in question relate to various domains: coding of speech for transmission or storage, speech recognition, noise reduction, echo cancellation, etc.
The main difficulty with vocal activity detection methods is distinguishing vocal activity from the accompanying noise. A conventional noise suppression technique cannot solve this problem because these techniques themselves use estimates of the noise which depend on the degree of vocal activity of the signal.
A main object of the present invention is to make vocal activity detection methods more robust to noise.
SUMMARY OF THE INVENTION
The invention therefore proposes a method of detecting vocal activity in a digital speech signal processed by successive frames, in which method the speech signal is subjected to noise suppression taking account of estimates of the noise included in the signal, updated for each frame in a manner dependent on at least one degree of vocal activity determined for said frame. According to the invention, a priori noise suppression is applied to the speech signal of each frame on the basis of estimates of the noise obtained on processing at least one preceding frame, and the energy variations of the a priori noise-suppressed signal are analyzed to detect the degree of vocal activity of said frame.
Detecting vocal activity (as a general rule by any method known in the art) on the basis of a noise-suppressed signal a priori significantly improves the performance of detection if the level of surrounding noise is relatively high.
In the remainder of the present description, the vocal activity detection method of the invention is illustrated within a system for eliminating noise from a speech signal. Clearly the method can find applications in many other types of digital speech processing requiring information on the degree of vocal activity of the processed signal: coding, recognition, echo cancellation, etc.
REFERENCES:
patent: 3840708 (1974-10-01), Clark
patent: 4277645 (1981-07-01), May, Jr.
patent: 4281218 (1981-07-01), Chuang et al.
patent: 5212764 (1993-05-01), Ariyoshi
patent: 5228088 (1993-07-01), Kane et al.
patent: 5469087 (1995-11-01), Eatwell
patent: 5555190 (1996-09-01), Derby et al.
patent: 5657422 (1997-08-01), Janiszewski et al.
patent: 5659622 (1997-08-01), Ashley
patent: 5732390 (1998-03-01), Katayanagi et al.
patent: 5742927 (1998-04-01), Crozier et al.
patent: 5839101 (1998-11-01), Vahatalo et al.
patent: 5890108 (1999-03-01), Yeldener
patent: 40 12 349 (1990-10-01), None
patent: 0 438 174 (1991-07-01), None
Cavallaro et al., “A fuzzy logic-based speech detection algorithm for communications in noisy environments,” Proceedings of the 1998 IEEE International Conference on Acoustics, Speech, and Signal Processing, May 12-15, 1998, vol. 1, pp. 565 to 568.*
Nishiguchi Masayuki et al., <<Voice Signal Transmitter-Receiver>>, Sony Corp., Mar. 1995, vol. 095, No. 006, Abstract.
R Le Bouquin et al., <<Enhancement of Noisy Speech Signals: Application to Mobile Radio Communications>>, Speech Communication, Jan. 1996, vol. 18, No. 1, pp. 3-19.
S Nandkumar et al., <<Speech Enhancement Based on a New Set of Auditaury Constrained Parameters>>, Proceedings of the International Conference on Acoustics, Speech, Signal Processing, ICASSP 1994, Apr. 1994, vol. 1, pp. 1-4.
P Lockwood et al., <<Experiments with a Nonlinear Spectral Subtractor (NSS), Hidden Markov Models and the Projection, for Robust Speech Recognition in Cars>>, Speech Communication, Jun. 1992, vol. 11, No. 2/3, pp. 215-228.
Lockwood Philip
Lubiarz Stéphane
Dorvil Richemond
Lerner Martin
Matra Nortel Communications
Trop Pruner & Hu P.C.
LandOfFree
Method for detecting speech activity does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Method for detecting speech activity, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Method for detecting speech activity will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-3148488