Real-time audio recording system for automatic speaker indexing

Patent

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

395 254, G01L 900

Patent

active

056066430

ABSTRACT:
A processor controlled system for correlating an electronic index according to speaker for audio data being recorded in real time. The system includes a source of training data for each of the plurality of individual speakers and audio input system for providing real time audio data including speech for the individual speakers. The audio data is converted into spectral feature data by an audio processor, and is simultaneously recorded on a storage medium by a recording device. A system processor accepts the training data to create individual speaker models, which are combined in parallel to form a speaker network. The system processor then accepts the spectral feature data of the audio data and, using the speaker network, determines segments in the audio data corresponding to each speaker.

REFERENCES:
patent: 4783804 (1988-11-01), Juang et al.
patent: 4837830 (1989-06-01), Wrench et al.
patent: 5199077 (1993-03-01), Wilcox et al.
patent: 5202952 (1993-04-01), Gillick et al.
patent: 5271088 (1993-12-01), Bahler
patent: 5406634 (1995-04-01), Anderson et al.
patent: 5473728 (1995-12-01), Luginbuhl et al.
Euler et al., "Statistical segmentation and word modeling techniques in isolated word recognition", ICASSP'90: Acoustics, speech & Signal Processing Conference, pp. 745-748.
Ostendorf et al., "The stochastic segment model for continuous speech recognition", 1991, Signals, Systems & computers, 1991 Asilomar Conference, pp. 964-968.
Iwasaki et al., "A real time Speaker-independent continuous speech recognition system", 1992, Patter Recognition, 1992 11th International Conference, pp. 663-666.
Russell, "A segmental HMM for speech pattern modeling", ICASSP'93:Acoustics, Speech & Signal Processing Conference, vol. II, pp. 499-502.
Wilcox et al., "Segmentation of speech using speaker identification", ICASSP'94: Acoustics, Speech & Signal Processing Conference, vol. I, pp. 161-164.
Gish et al., "Segregation of Speakers for Speech Recognition and Speaker Identification," Proc. Int. Conf. Acoustics, Speech and Signal Processing, May 1991, vol. 2 pp. 873-876.
Siu et al., "An Unsupervised Sequential Learning Algorithm for the Segmentation of Speech Waveforms with Multiple Speakers," Proc. Int. Conf. Acoustics, Speech and Signal Processing, Mar. 1992, vol. 2 pp. 189-192.
Sugiyama et al., "Speech Segmentation and Clustering Based on Speaker Features," Proc. Int. Conf. Acoustics, Speech and Signal Processing, Apr. 1993, vol. 2, pp. 395-398.
Matsui et al., "Comparison of Text-Independent Speaker Recognition Methods Using VQ-Distortion and Discrete/Continuous HMMs," Proc. Int. Conf. Acoustics, Speech and Signal Processing, Mar. 1992, vol. 2, pp. 157-160.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Real-time audio recording system for automatic speaker indexing does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Real-time audio recording system for automatic speaker indexing, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Real-time audio recording system for automatic speaker indexing will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-1979911

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.