Data processing: speech signal processing – linguistics – language – Speech signal processing – For storage or transmission
Reexamination Certificate
2007-07-24
2007-07-24
Opsasnick, Michael N. (Department: 2626)
Data processing: speech signal processing, linguistics, language
Speech signal processing
For storage or transmission
Reexamination Certificate
active
11276419
ABSTRACT:
A portion of an audio signal is separated into multiple frames from which one or more different features are extracted. These different features are used, in combination with a set of rules, to classify the portion of the audio signal into one of multiple different classifications (for example, speech, non-speech, music, environment sound, silence, etc.). In one embodiment, these different features include one or more of line spectrum pairs (LSPs), a noise frame ratio, periodicity of particular bands, spectrum flux features, and energy distribution in one or more of the bands. The line spectrum pairs are also optionally used to segment the audio signal, identifying audio classification changes as well as speaker changes when the audio signal is speech.
REFERENCES:
patent: 4559602 (1985-12-01), Bates, Jr.
patent: 4933973 (1990-06-01), Porter
patent: 5152007 (1992-09-01), Uribe
patent: 5307441 (1994-04-01), Tzeng
patent: 5473727 (1995-12-01), Nishiguchi et al.
patent: 5596680 (1997-01-01), Chow et al.
patent: 5630012 (1997-05-01), Nishiguchi et al.
patent: 5664052 (1997-09-01), Nishiguchi et al.
patent: 5809455 (1998-09-01), Nishiguchi et al.
patent: 5828996 (1998-10-01), Iijima et al.
patent: 5848347 (1998-12-01), Kuo et al.
patent: 5878388 (1999-03-01), Nishiguchi et al.
patent: 5911128 (1999-06-01), DeJaco
patent: 5960388 (1999-09-01), Nishiguchi et al.
patent: 6054646 (2000-04-01), Pal et al.
patent: 6078880 (2000-06-01), Zinser, Jr. et al.
patent: 6456964 (2002-09-01), Manjunath et al.
patent: 6493665 (2002-12-01), Su et al.
patent: 6507814 (2003-01-01), Gao
patent: 6694293 (2004-02-01), Benyassine et al.
“Acoustic Segmentation for Audio Browsers” Proc. Interface Conference Sydney Australia Jul. 1996.
“Real-Time Discrimination of Broadcast Speech/Music” Sanders A Lockheed Martin Co. Nashua NH 1996 IEEE pp. 993-996.
“Speaker Recognition: A Tutorial” Proceedings of the IEEE vol. 85 No. 9 Sep. 1997 pp. 1437-1462.
“Real-time Discimination of Broadcast Speech/Music”JASSP 1996 pp. 993-996.
“Construction and Evaluation of a Robust Multifeature Speech/Music Discriminator” 1997 IEEE pp. 1331-1334.
“Heuristic Approach for Generic Audi Data Segmentation and Annotation” ACM Multimedia Conference Orland FL Nov. 1999 pp. 67-76.
Jiang Hao
Zhang Hong-Jiang
LandOfFree
Classification of audio as speech or non-speech using... does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Classification of audio as speech or non-speech using..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Classification of audio as speech or non-speech using... will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-3742378