Data processing: speech signal processing – linguistics – language – Speech signal processing – For storage or transmission
Reexamination Certificate
2006-07-18
2006-07-18
Dorvil, Richemond (Department: 2655)
Data processing: speech signal processing, linguistics, language
Speech signal processing
For storage or transmission
C704S228000
Reexamination Certificate
active
07080008
ABSTRACT:
A portion of an audio signal is separated into multiple frames from which one or more different features are extracted. These different features are used, in combination with a set of rules, to classify the portion of the audio signal into one of multiple different classifications (for example, speech, non-speech, music, environment sound, silence, etc.). In one embodiment, these different features include one or more of line spectrum pairs (LSPs), a noise frame ratio, periodicity of particular bands, spectrum flux features, and energy distribution in one or more of the bands. The line spectrum pairs are also optionally used to segment the audio signal, identifying audio classification changes as well as speaker changes when the audio signal is speech.
REFERENCES:
patent: 4559602 (1985-12-01), Bates, Jr.
patent: 5307441 (1994-04-01), Tzeng
patent: 5473727 (1995-12-01), Nishiguchi et al.
patent: 5630012 (1997-05-01), Nishiguchi et al.
patent: 5664052 (1997-09-01), Nishiguchi et al.
patent: 5809455 (1998-09-01), Nishiguchi et al.
patent: 5848347 (1998-12-01), Kuo et al.
patent: 5878388 (1999-03-01), Nishiguchi et al.
patent: 5960388 (1999-09-01), Nishiguchi et al.
patent: 6054646 (2000-04-01), Pal et al.
patent: 6078880 (2000-06-01), Zinser et al.
patent: 6456964 (2002-09-01), Manjunath et al.
patent: 6493665 (2002-12-01), Su et al.
patent: 6507814 (2003-01-01), Gao
patent: 6694293 (2004-02-01), Benyassine et al.
Tong Zhang and C.-C. Jay Kuo, “Heuristic Approach for Generic Audio Data Segmentation and Annotation,” ACM Multimedia Conference, Orlando, Florida, Nov., 1999, pp. 67-76.
Don Kimber and Lynn Wilcox, “Acoustic Segmentation for Audio Browsers,” Proc. Interface Confernce, Sydney, Australia, Jul. 1996.
Joseph P. Campbell, Jr., “Speaker Recognition: A Tutorial,” Proceedings of the IEEE, vol. 85, No. 9, Sep. 1997, pp. 1437-1462.
John Saunders, “Real-Time Discrimination of Broadcast Speech/Music,” Sanders, A Lockheed Martin Co., Nashua, NH, 1996 IEEE, pp. 993-996.
Scheirer et al, “Construction and Evaluation of a Robust Multifeature Speech/Music Discriminator”, 1997, IEEE, pp. 1331-1334.
Saunders, “Real-time Discrimination of Broadcast Speech/Music”, 1996, pp. 993-996.
Jiang Hao
Zhang Hong-Jiang
Dorvil Richemond
Lee & Hayes PLLC
Microsoft Corporation
Opsasnick Michael N.
LandOfFree
Audio segmentation and classification using threshold values does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Audio segmentation and classification using threshold values, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Audio segmentation and classification using threshold values will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-3554132