Data processing: speech signal processing – linguistics – language – Speech signal processing – Recognition
Reexamination Certificate
2007-01-30
2007-01-30
Dorvil, Richemond (Department: 2626)
Data processing: speech signal processing, linguistics, language
Speech signal processing
Recognition
C704S208000, C704S214000, C704S205000
Reexamination Certificate
active
09813525
ABSTRACT:
A voice activity detector (100) filters (204) out noise energy and then computes a high-frequency (2400 Hz to 4000 Hz) versus low-frequency (100 Hz to 2400 Hz) signal energy ratio (224), total voiceband (100 Hz to 4000 Hz) signal energy (214), and signal periodicity (208) on successive frames of signal samples. Signal periodicity is determined by estimating the pitch period (206) of the signal, determining a gain value of the signal over the pitch period as a function of the estimated pitch period, and estimating a periodicity of the signal over the pitch period as a function of the estimated pitch period and the gain value. Voice is detected (230–232) in a segment if either (a) the difference between the average high-frequency versus low-frequency signal energy ratio and the present segment's high-frequency versus low-frequency energy ratio either exceeds (310) a high threshold value or is exceeded (312) by a low threshold value, or (b) the average periodicity of the signal is lower (306) than a low threshold value, or (c) the difference between the average total signal energy and the present segment's total energy exceeds (304) a threshold value and the average periodicity of the signal is lower (304) than a high threshold value, or (d) the average total signal energy exceeds (412) a minimum average total signal energy by a threshold value and voice has been detected (410) in the preceding segment.
REFERENCES:
patent: 4074069 (1978-02-01), Tokura et al.
patent: 6275794 (2001-08-01), Benyassine et al.
patent: 6453291 (2002-09-01), Ashley
patent: 6456964 (2002-09-01), Manjunath et al.
patent: 6504838 (2003-01-01), Kwan
patent: 6687668 (2004-02-01), Kim et al.
International Telecommunication Union, G.727, A Silence compression scheme for G.729 optimized for terminals conforming to Recommendation V.70, Annex B (Nov. 1996), pp. Title-16.
K. El-Maleh et al., “Comparison Of Voice Activity Detection Algorithms For Wireless Personal Communications Systems”,Proc. IEEE Canadian Conference on Electrical and Computer Engineering(St. John's, Nfld.), May 1997 pp. 470-473.
D.K. Freeman et al., “The Voice Activity Detector For The Pan-European Digital Cellular Mobile Telephone Service”, British Telecom Research Laboratories, 1989 IEEE, CH2673-2/89/0000-0369, pp. 369-372.
K. Srinivasan et al., “Voice Activity Detection For Cellular Networks”, Center For Information Processing Research, pp. 85-86.
International Telecommunication Union, G.729, A Silence compression scheme for G.729 optimized for terminals conforming to Recommendation V.70, Annex B (Nov. 1996), pp. Title-16.
Nikos Doukas et al., “Voice Activity Detection Using Source Separation Techniques”, Signal Processing Section, Dept. of Electrical Engineering, Imperial College, UK, four (4) pages.
R. Tucker, “Voice activity detection using a periodicity measure”IEE Proceedings-I,vol. 139, No. 4, Aug. 1992, pp. 377-380.
M. Rangoussi et al. “Higher Order Statistics Based Gaussianity Test Applied To On-Line Speech Procesing [sic]”, 1995 IEEE 1058-6393/95, pp. 303-307.
L.R. Rabiner and R.W. Schafer, “Digital Processing of Speech Signals”, pp. 149-150.
R. Steele, “Analysis-By-Synthesis Predictive Coding”, pp. 244-253.
L.A. Tucker, et al. “Frequency-Domain Post-Filtering Voice-Activity Detector”, U.S. Appl. No. 09/770,922, filed Jan. 26, 2001.
Avaya Technology Corp.
Dorvil Richemond
Vo Huyen X.
Volejnicek David
LandOfFree
Voice-activity detection using energy ratios and periodicity does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Voice-activity detection using energy ratios and periodicity, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Voice-activity detection using energy ratios and periodicity will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-3790351