Patent
1996-09-09
1997-08-19
MacDonald, Allen R.
395 252, 395 253, 395 264, G10L 900
Patent
active
056596626
ABSTRACT:
A system and method for unsupervised clustering of audio data segments in an audio data recording containing speech from multiple speakers including the steps of: 1) providing a portion of the audio data containing speech from all of the speakers; 2) forming initial clusters by dividing the portion of the audio data into segments, each of which includes an ordered data set; 3) computing the pairwise distance between each pair of clusters using a likelihood ration independent of the order of data within the segments; and 4) combining into a new cluster the two clusters with a minimum pairwise distance. These steps are repeated until a number of clusters equal to the number of speakers is obtained.
REFERENCES:
patent: 4802224 (1989-01-01), Shiraki et al.
Levinson et al, "Interactive Clustering Techniques for Selecting Speaker-Independent Ref Templates for Isolated Word Recognition," IEEE Trans. Acoustics, Speech and Signal Processing, Apr. 1979, vol. ASSP-27, No. 2, pp. 134-141.
Parsons, Voice and Speech Processing, McGraw Hill Book Co, 1986, pp. 188-191.
Rabiner et al, "HMM Clustering for connected word recognition", 1989, ICASSP-89, vol. 1, pp. 405-408.
Gish et al., "Segregation of Speakers for Speech Recognition and Speaker Identification," Proc. Int. Conf. Acoustics, Speech and Signal Processing, May 1991, vol. 2 pp. 873-876.
Siu et al., "An Unsupervised Sequential Learning Algorithm for the Segmentation of Speech Waveforms with Multiple Speakers," Proc. Int. Conf. Acoustics, Speech and Signal Processing, Mar. 1992, vol. 2 pp. 189-192.
Sugiyama et al., "Speech Segmentation and Clustering Based on Speaker Features," Proc. Int. Conf. Acoustics, Speech and Signal Processing, Apr. 1993, vol. 2, pp. 395-398.
Matsui et al., "Comparison of Text-Independent Speaker Recognition Methods Using VQ-Distortion and Discrete/Continuous HMMs," Proc. Int. Conf. Acoustics, Speech and Signal Processing, Mar. 1992, vol. 2, pp. 157-160.
Kimber Donald G.
Wilcox Lynn D.
Chawan Vijay B.
Hurt Tracy L.
Jacobs R. Christine
MacDonald Allen R.
Xerox Corporation
LandOfFree
Unsupervised speaker clustering for automatic speaker indexing o does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Unsupervised speaker clustering for automatic speaker indexing o, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Unsupervised speaker clustering for automatic speaker indexing o will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-1111291