Patent
1994-04-12
1997-02-25
MacDonald, Allen R.
395 254, G01L 900
Patent
active
056066430
ABSTRACT:
A processor controlled system for correlating an electronic index according to speaker for audio data being recorded in real time. The system includes a source of training data for each of the plurality of individual speakers and audio input system for providing real time audio data including speech for the individual speakers. The audio data is converted into spectral feature data by an audio processor, and is simultaneously recorded on a storage medium by a recording device. A system processor accepts the training data to create individual speaker models, which are combined in parallel to form a speaker network. The system processor then accepts the spectral feature data of the audio data and, using the speaker network, determines segments in the audio data corresponding to each speaker.
REFERENCES:
patent: 4783804 (1988-11-01), Juang et al.
patent: 4837830 (1989-06-01), Wrench et al.
patent: 5199077 (1993-03-01), Wilcox et al.
patent: 5202952 (1993-04-01), Gillick et al.
patent: 5271088 (1993-12-01), Bahler
patent: 5406634 (1995-04-01), Anderson et al.
patent: 5473728 (1995-12-01), Luginbuhl et al.
Euler et al., "Statistical segmentation and word modeling techniques in isolated word recognition", ICASSP'90: Acoustics, speech & Signal Processing Conference, pp. 745-748.
Ostendorf et al., "The stochastic segment model for continuous speech recognition", 1991, Signals, Systems & computers, 1991 Asilomar Conference, pp. 964-968.
Iwasaki et al., "A real time Speaker-independent continuous speech recognition system", 1992, Patter Recognition, 1992 11th International Conference, pp. 663-666.
Russell, "A segmental HMM for speech pattern modeling", ICASSP'93:Acoustics, Speech & Signal Processing Conference, vol. II, pp. 499-502.
Wilcox et al., "Segmentation of speech using speaker identification", ICASSP'94: Acoustics, Speech & Signal Processing Conference, vol. I, pp. 161-164.
Gish et al., "Segregation of Speakers for Speech Recognition and Speaker Identification," Proc. Int. Conf. Acoustics, Speech and Signal Processing, May 1991, vol. 2 pp. 873-876.
Siu et al., "An Unsupervised Sequential Learning Algorithm for the Segmentation of Speech Waveforms with Multiple Speakers," Proc. Int. Conf. Acoustics, Speech and Signal Processing, Mar. 1992, vol. 2 pp. 189-192.
Sugiyama et al., "Speech Segmentation and Clustering Based on Speaker Features," Proc. Int. Conf. Acoustics, Speech and Signal Processing, Apr. 1993, vol. 2, pp. 395-398.
Matsui et al., "Comparison of Text-Independent Speaker Recognition Methods Using VQ-Distortion and Discrete/Continuous HMMs," Proc. Int. Conf. Acoustics, Speech and Signal Processing, Mar. 1992, vol. 2, pp. 157-160.
Balasubramanian Vijay
Chen Francine R.
Chou Philip A.
Kimber Donald G.
Poon Alex D.
Chawan Vijay B.
Hurt Tracy L.
Jacobs R. Christine
MacDonald Allen R.
Xerox Corporation
LandOfFree
Real-time audio recording system for automatic speaker indexing does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Real-time audio recording system for automatic speaker indexing, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Real-time audio recording system for automatic speaker indexing will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-1979911