Data processing: speech signal processing – linguistics – language – Speech signal processing – For storage or transmission
Reexamination Certificate
2006-04-25
2006-04-25
Chawan, Vijay (Department: 2655)
Data processing: speech signal processing, linguistics, language
Speech signal processing
For storage or transmission
C704S208000, C704S233000
Reexamination Certificate
active
07035793
ABSTRACT:
A portion of an audio signal is separated into multiple frames from which one or more different features are extracted. These different features are used, in combination with a set of rules, to classify the portion of the audio signal into one of multiple different classifications (for example, speech, non-speech, music, environment sound, silence, etc.). In one embodiment, these different features include one or more of line spectrum pairs (LSPs), a noise frame ratio, periodicity of particular bands, spectrum flux features, and energy distribution in one or more of the bands. The line spectrum pairs are also optionally used to segment the audio signal, identifying audio classification changes as well as speaker changes when the audio signal is speech.
REFERENCES:
patent: 4559602 (1985-12-01), Bates, Jr.
patent: 5473727 (1995-12-01), Nishiguchi et al.
patent: 5630012 (1997-05-01), Nishiguchi et al.
patent: 5664052 (1997-09-01), Nishiguchi et al.
patent: 5809455 (1998-09-01), Nishiguchi et al.
patent: 6054646 (2000-04-01), Pal et al.
patent: 6493665 (2002-12-01), Su et al.
patent: 6507814 (2003-01-01), Gao
patent: 6694293 (2004-02-01), Benyassine et al.
Don Kimber and Lynn Wilcox, “Acoustic Segmentation for Audio Browsers,” Proc. Interface Conference, Sydney, Australia, Jul. 1996.
John Saunders, “Real-Time Discrimination of Broadcast Speech/Music,” Sanders, A Lockheed Martin Co., Nashua NH, 1966 IEEE, pp. 993-996.
Joseph P. Campbell, Jr., “Speaker Recognition: A Tutorial,” Proceedings of the IEEE, vol. 85, No. 9, Sep. 1997, pp. 1437-1462.
Saunders, “Real-time Discrimination of Broadcast Speech/Music,”JASSP 1996, pp. 993-996.
Scheirer et al., “Construction and Evaluation of a Robust Multifeature Speech/Music Discriminator,” 1997 IEEE, pp. 1331-1334.
Zhang, Tong and Kuo, C.-C. Jay, “Heuristic Approach for Generic Audi Data Segmentation and Annotation,” ACM Multimedia Conference, Orlando FL, Nov. 1999, pp. 67-76.
Jiang Hao
Zhang Hongjiang
Chawan Vijay
Lee & Hayes PLLC
Microsoft Corporation
Opsasnick Michael N
LandOfFree
Audio segmentation and classification does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Audio segmentation and classification, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Audio segmentation and classification will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-3596686