Data processing: speech signal processing – linguistics – language – Speech signal processing – Recognition
Reexamination Certificate
2011-04-12
2011-11-01
Albertalli, Brian (Department: 2626)
Data processing: speech signal processing, linguistics, language
Speech signal processing
Recognition
C704S206000, C704S253000, C381S110000
Reexamination Certificate
active
08050916
ABSTRACT:
A signal classifying method and apparatus are disclosed. The signal classifying method includes: obtaining a spectrum fluctuation parameter of a current signal frame determined as a foreground frame, and buffering the spectrum fluctuation parameter; obtaining a spectrum fluctuation variance of the current signal frame according to spectrum fluctuation parameters of all buffered signal frames, and buffering the spectrum fluctuation variance; and calculating a ratio of signal frames whose spectrum fluctuation variance is above or equal to a first threshold to all the buffered signal frames, and determining the current signal frame as a speech frame if the ratio is above or equal to a second threshold or determining the current signal frame as a music frame if the ratio is below the second threshold. In the embodiments of the present invention, the spectrum fluctuation variance of the signal is used as a parameter for classifying the signals, and a local statistical method is applied to decide the type of the signal. Therefore, the signals are classified with few parameters, simple logical relations and low complexity.
REFERENCES:
patent: 5712953 (1998-01-01), Langs
patent: 5732392 (1998-03-01), Mizuno et al.
patent: 6570991 (2003-05-01), Scheirer et al.
patent: 6785645 (2004-08-01), Khalil et al.
patent: 7179980 (2007-02-01), Kirkeby et al.
patent: 7328149 (2008-02-01), Jiang et al.
patent: 7346516 (2008-03-01), Sall et al.
patent: 7809560 (2010-10-01), Yen et al.
patent: 7844452 (2010-11-01), Takeuchi et al.
patent: 7858868 (2010-12-01), Kemp et al.
patent: 7864967 (2011-01-01), Takeuchi et al.
patent: 2002/0172372 (2002-11-01), Tagawa et al.
patent: 2003/0101050 (2003-05-01), Khalil et al.
patent: 2007/0136053 (2007-06-01), Ebenezer
patent: 2008/0082323 (2008-04-01), Bai et al.
patent: 1354455 (2002-06-01), None
patent: 1815550 (2006-08-01), None
patent: 101256772 (2008-09-01), None
patent: 0764937 (1997-03-01), None
patent: 1244093 (2002-03-01), None
patent: 2007106384 (2007-09-01), None
patent: 2008106852 (2008-09-01), None
Foreign communication from a counterpart application, PCT application PCT/CN2010/076499, International Search Report and Written Opinion dated Oct. 15, 2009.
Foreign communication from a counterpart application, PCT application PCT/CN2010/076499, Partial English Translation Written Opinion dated Oct. 15, 2009.
Jia, Lan-Ian, “A Fast and Robust Speech/Music Discrimination Approach,” Information and Electronic Egineering, vol. 6, No. 4, Aug. 2008.
Foreign communication from a counterpart application, Chinese application CN200910110798.4, Partial English Translation Office Action dated Jul. 8, 2011, 1 page.
Foreign communication from a counterpart application, European application 10790605.9, Extended European Search Report dated Aug. 18, 2011, 9 pages.
Foreign communication from a counterpart application, Chinese application CN200910110798.4, Office Action dated Jul. 8, 2011, 3 pages.
Huang, et al., “Advances in Unsupervised Audio Classification and Segmentation for the Broadcast News and NGSW Corpora”, IEEE Transactions on Audio, Speech, and Language Processing, vol. 14, No. 3, May 1, 2006, pp. 907-919.
Wang Zhe, Proposed Text for Draft new ITU-T Recommendation G.GSAD a Generic Sound Activity Detectora; C 348:, ITU-T Drafts; Study Period 2009-2012, Oct. 18, 2009, pp. 1-14.
“3rd Generation Partnership Project; Technical Specification Services and System Aspects; Mandatory Speech Codec Speech Processing Functions; Adaptive Multi-Rate (AMR) Speech Codec; Voice Activity Detector (VAD)” (Release 8), Dec. 2008, 25 pgs.
Itut, “Series G: Transmission Systems and Media, Digital Systems and Networks, Digital Terminal Equipments-Coding of Voice and Audio Signals, Generic Sound Activity Detector; (GSAD)”, G720.1, Jan. 2010, 26 pages.
Liu Yuanyuan
Shlomot Eyal
Wang Zhe
Albertalli Brian
Conley & Rose, P.C.
Huawei Technologies Co. Ltd.
Rodolph Grant
LandOfFree
Signal classifying method and apparatus does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Signal classifying method and apparatus, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Signal classifying method and apparatus will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-4294750