Data processing: speech signal processing – linguistics – language – Speech signal processing – Recognition
Patent
1996-02-02
2000-05-23
Dorvil, Richemond
Data processing: speech signal processing, linguistics, language
Speech signal processing
Recognition
704242, G10L 1520
Patent
active
060675179
ABSTRACT:
A technique to improve the recognition accuracy when transcribing speech data that contains data from a wide range of environments. Input data in many situations contains data from a variety of sources in different environments. Such classes include: clean speech, speech corrupted by noise (e.g., music), non-speech (e.g., pure music with no speech), telephone speech, and the identity of a speaker. A technique is described whereby the different classes of data are first automatically identified, and then each class is transcribed by a system that is made specifically for it. The invention also describes a segmentation algorithm that is based on making up an acoustic model that characterizes the data in each class, and then using a dynamic programming algorithm (the viterbi algorithm) to automatically identify segments that belong to each class. The acoustic models are made in a certain feature space, and the invention also describes different feature spaces for use with different classes.
REFERENCES:
patent: 4430726 (1984-02-01), Kasday
patent: 5333275 (1994-07-01), Wheatley et al.
patent: 5579436 (1996-11-01), Chou et al.
Bahl Lalit Rai
Gopalakrishnan Ponani
Gopinath Ramesh Ambat
Maes Stephane Herman
Panmanabhan Mukund
Dorvil Richemond
International Business Machines - Corporation
Otterstedt Paul J.
LandOfFree
Transcription of speech data with segments from acoustically dis does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Transcription of speech data with segments from acoustically dis, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Transcription of speech data with segments from acoustically dis will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-1843741