Data processing: speech signal processing – linguistics – language – Speech signal processing – For storage or transmission
Reexamination Certificate
2005-05-31
2005-05-31
Ometz, David L. (Department: 2655)
Data processing: speech signal processing, linguistics, language
Speech signal processing
For storage or transmission
C704S208000, C704S233000
Reexamination Certificate
active
06901362
ABSTRACT:
A portion of an audio signal is separated into multiple frames from which one or more different features are extracted. These different features are used, in combination with a set of rules, to classify the portion of the audio signal into one of multiple different classifications (for example, speech, non-speech, music, environment sound, silence, etc.). In one embodiment, these different features include one or more of line spectrum pairs (LSPs), a noise frame ratio, periodicity of particular bands, spectrum flux features, and energy distribution in one or more of the bands. The line spectrum pairs are also optionally used to segment the audio signal, identifying audio classification changes as well as speaker changes when the audio signal is speech.
REFERENCES:
patent: 4559602 (1985-12-01), Bates, Jr.
patent: 5473727 (1995-12-01), Nishiguchi et al.
patent: 5630012 (1997-05-01), Nishiguchi et al.
patent: 5664052 (1997-09-01), Nishiguchi et al.
patent: 5809455 (1998-09-01), Nishiguchi et al.
patent: 6054646 (2000-04-01), Pal et al.
patent: 6493665 (2002-12-01), Su et al.
patent: 6507814 (2003-01-01), Gao
patent: 6694293 (2004-02-01), Benyassine et al.
Scheirer et al, “Construction and Evaluation of a Robust Multifeature Speech/Music Discriminator”, 1997, IEEE, pp 1331-1334.*
Saunders, “Real-time Discimination of Broadcast Speech/Music”, JASSP, 1996, pp. 993-996.*
Tong Zhang and C.-C. Jay Kuo, “Heuristic Approach for Generic Audio Data Segmentation and Annotation,”ACM Multimedia Conference, Orlando, Florida, Nov., 1999, pp. 67-76.
Don Kimber and Lynn Wilcox, “Acoustic Segmentation for Audio Browsers,”Proc. Interface Conference, Sydney, Australia, Jul. 1996.
Joseph P. Campbell, Jr., “Speaker Recognition: A Tutorial,”Proceedings of the IEEE, vol. 85, No. 9, Sep. 1997, pp. 1437-1462.
John Saunders, “Real-Time Discrimination of Broadcast Speech/Music,” Sanders, A Lockheed Martin Co., Nashua, NH, 1996 IEEE, pp. 993-996.
Jiang Hao
Zhang Hongjiang
Ometz David L.
Opsasnick Michael N
LandOfFree
Audio segmentation and classification does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Audio segmentation and classification, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Audio segmentation and classification will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-3439279