Data processing: speech signal processing – linguistics – language – Speech signal processing – For storage or transmission
Reexamination Certificate
2007-07-10
2007-07-10
{hacek over (S)}mits, Talivaldis Ivars (Department: 2626)
Data processing: speech signal processing, linguistics, language
Speech signal processing
For storage or transmission
C704S208000
Reexamination Certificate
active
10279720
ABSTRACT:
A method (200) and apparatus (100) for segmenting a sequence of audio samples into homogeneous segments (550and555) are disclosed. The method (200) forms a sequence of frames (701to704) along the sequence of audio samples, and extracts, for each frame, a data feature. The data features form a sequence of data features. Transition points in the sequence of data features are thin detected by applying the Bayesian Information Criterion to the sequence of data features. The transition points define the homogeneous segments (550and555). Preferably the data feature is single-dimensional and a leptokurtic distribution is used as an event model in the Bayesian Information Criterion.
REFERENCES:
patent: 6140874 (2000-10-01), French et al.
patent: 6424946 (2002-07-01), Tritschler et al.
patent: 7006568 (2006-02-01), Gu et al.
patent: 2003/0231775 (2003-12-01), Wark
Tritschler et al. “Improved speaker segmentation and segments clustering using the Bayesian Information Criterion,” in Proc. EUROSPEECH, Budapest, Hungary, 1999, vol. 2, pp. 679-682.
Sivakumaran, et al. “On the use of the Bayesian Information Criterion in multiple speaker detection,” in Proc. EUROSPEECH, Aalborg, Denmark, 2001, vol. 2, pp. 795-798.
Zhang et al. “Statistical modelling of speech signals,” Proceedings of the Sixth International Conference on Signal Processing ICSP 2002, Beijing, China, vol. 1, pp. 480-483, Aug. 2002.
Matthew Harris, et al., “A Study Of Broadcast News Audio Stream Segmentation And Segment Clustering”, Philips Research Laboratories.
Bowen Zhou, et al., “Unsupervised Audio Stream Segmentation And Clustering Via The Bayesian Information Criterion”, Robust Speech Processing Laboratory, The Center for Spoken Language Research, University of Colorado at Boulder.
Scott Shaobing Chen, et al., “Speaker, Environment And Channel Change Detection And Clustering Via The Bayesian Information Criterion”, IBM T.J. Watson Research Center.
Javier Ferreiros, et al., “Acoustic Change Detection And Clustering On Broadcast News”, International Computer Science Institute, pp. 1-22 (Mar. 2000).
Ng Eunice
{hacek over (S)}mits Talivaldis Ivars
LandOfFree
Audio segmentation with energy-weighted bandwidth bias does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Audio segmentation with energy-weighted bandwidth bias, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Audio segmentation with energy-weighted bandwidth bias will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-3779846