Data processing: speech signal processing – linguistics – language – Speech signal processing – Recognition
Reexamination Certificate
2006-12-05
2006-12-05
Chawan, Vijay B. (Department: 2626)
Data processing: speech signal processing, linguistics, language
Speech signal processing
Recognition
C704S247000, C381S094300, C381S056000, C381S110000, C379S406040
Reexamination Certificate
active
07146315
ABSTRACT:
A multichannel source activity detection system, e.g., a voice activity detection (VAD) system, and method that exploits spatial localization of a target audio source is provided. The method includes the steps of receiving a mixed sound signal by at least two microphones; Fast Fourier transforming each received mixed sound signal into the frequency domain; filtering the transformed signals to output a signal corresponding to a spatial signature of a source; summing an absolute value squared of the filtered signal over a predetermined range of frequencies; and comparing the sum to a threshold to determine if a voice is present. Additionally, the filtering step includes multiplying the transformed signals by an inverse of a noise spectral power matrix, a vector of channel transfer function ratios, and a source signal spectral power.
REFERENCES:
patent: 5012519 (1991-04-01), Adlersberg et al.
patent: 5276765 (1994-01-01), Freeman et al.
patent: 5550924 (1996-08-01), Helf et al.
patent: 5563944 (1996-10-01), Hasegawa
patent: 5839101 (1998-11-01), Vahatalo et al.
patent: 6011853 (2000-01-01), Koski et al.
patent: 6070140 (2000-05-01), Tran
patent: 6088668 (2000-07-01), Zack
patent: 6097820 (2000-08-01), Turner
patent: 6141426 (2000-10-01), Stobba et al.
patent: 6363345 (2002-03-01), Marash et al.
patent: 6377637 (2002-04-01), Berdugo
patent: 2003/0004720 (2003-01-01), Garudadri et al.
patent: 1081985 (2001-07-01), None
Rosca et al.: “Multichannel voice detection in adverse environments” XI European Signal Processing Conference EUSIPCO Sep. 2, 2002, XP008025382.
Aalburg et al.: “Single-and two-channel noise reduction for robust speech recognition in car” ISCA Workshop Multi-Modal Dialogue in Mobile Environments Jun. 2002 XP002264041.
Balan R et al.: “Microphone array speech enhancement by Bayesian estimation of spectral amplitude and phase” Aug. 2002 pp. 209-213, XP010635740.
Philippe Renevey et al.: “Entropy Based Voice Activity Detection in very noisy conditions” Eurospeech 2001 Proceedings vol. 3, Sep. 2001 pp. 1887-1890 XP007004739.
Srinivasan K et al.: “Voice activity detection for cellular networks” Proceedings of the IEEE Workshop on Speech Coding for Telecommunications Oct. 1993 pp. 85-86 XP002204645.
International Search Report.
Balan Radu Victor
Beaugeant Christophe
Rosca Justinian
Chawan Vijay B.
F. Chau & Associates LLC.
Paschburg Donald B.
Siemens Corporate Research Inc.
LandOfFree
Multichannel voice detection in adverse environments does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Multichannel voice detection in adverse environments, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Multichannel voice detection in adverse environments will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-3664404