Audio segmentation and classification using threshold values

Data processing: speech signal processing – linguistics – language – Speech signal processing – For storage or transmission

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C704S228000

Reexamination Certificate

active

07080008

ABSTRACT:
A portion of an audio signal is separated into multiple frames from which one or more different features are extracted. These different features are used, in combination with a set of rules, to classify the portion of the audio signal into one of multiple different classifications (for example, speech, non-speech, music, environment sound, silence, etc.). In one embodiment, these different features include one or more of line spectrum pairs (LSPs), a noise frame ratio, periodicity of particular bands, spectrum flux features, and energy distribution in one or more of the bands. The line spectrum pairs are also optionally used to segment the audio signal, identifying audio classification changes as well as speaker changes when the audio signal is speech.

REFERENCES:
patent: 4559602 (1985-12-01), Bates, Jr.
patent: 5307441 (1994-04-01), Tzeng
patent: 5473727 (1995-12-01), Nishiguchi et al.
patent: 5630012 (1997-05-01), Nishiguchi et al.
patent: 5664052 (1997-09-01), Nishiguchi et al.
patent: 5809455 (1998-09-01), Nishiguchi et al.
patent: 5848347 (1998-12-01), Kuo et al.
patent: 5878388 (1999-03-01), Nishiguchi et al.
patent: 5960388 (1999-09-01), Nishiguchi et al.
patent: 6054646 (2000-04-01), Pal et al.
patent: 6078880 (2000-06-01), Zinser et al.
patent: 6456964 (2002-09-01), Manjunath et al.
patent: 6493665 (2002-12-01), Su et al.
patent: 6507814 (2003-01-01), Gao
patent: 6694293 (2004-02-01), Benyassine et al.
Tong Zhang and C.-C. Jay Kuo, “Heuristic Approach for Generic Audio Data Segmentation and Annotation,” ACM Multimedia Conference, Orlando, Florida, Nov., 1999, pp. 67-76.
Don Kimber and Lynn Wilcox, “Acoustic Segmentation for Audio Browsers,” Proc. Interface Confernce, Sydney, Australia, Jul. 1996.
Joseph P. Campbell, Jr., “Speaker Recognition: A Tutorial,” Proceedings of the IEEE, vol. 85, No. 9, Sep. 1997, pp. 1437-1462.
John Saunders, “Real-Time Discrimination of Broadcast Speech/Music,” Sanders, A Lockheed Martin Co., Nashua, NH, 1996 IEEE, pp. 993-996.
Scheirer et al, “Construction and Evaluation of a Robust Multifeature Speech/Music Discriminator”, 1997, IEEE, pp. 1331-1334.
Saunders, “Real-time Discrimination of Broadcast Speech/Music”, 1996, pp. 993-996.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Audio segmentation and classification using threshold values does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Audio segmentation and classification using threshold values, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Audio segmentation and classification using threshold values will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-3554132

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.