Data processing: speech signal processing – linguistics – language – Speech signal processing – Recognition
Patent
1998-01-15
1999-12-28
Hudspeth, David R.
Data processing: speech signal processing, linguistics, language
Speech signal processing
Recognition
704240, 704255, G01L 900
Patent
active
060093927
ABSTRACT:
A method is provided which trains acoustic models in an automatic speech recognizer ("ASR") without explicitly matching decoded scripts with correct scripts from which acoustic training data is generated. In the method, audio data is input and segmented to produce audio segments. The audio segments are clustered into groups of clustered audio segments such that the clustered audio segments in each of the groups have similar characteristics. Also, the groups respectively form audio similarity classes. Then, audio segment probability distributions for the clustered audio segments in the audio similarity classes are calculated, and audio segment frequencies for the clustered audio segments are determined based on the audio segment probability distributions. The audio segment frequencies are matched to known audio segment frequencies for at least one of letters, combination of letters, and words to determine frequency matches, and a textual corpus of words is formed based on the frequency matches. Then, acoustic models of the automatic speech recognizer are trained based on the textual corpus. In addition, the method may receive and cluster video or biometric data, and match such data to the audio data to more accurately cluster the audio segments into the groups of audio segments. Also, an apparatus for performing the method is provided.
REFERENCES:
patent: 5122951 (1992-06-01), Kayima
patent: 5625748 (1997-04-01), McDonough et al.
patent: 5649060 (1997-07-01), Ellozy et al.
patent: 5659662 (1997-08-01), Wilcox et al.
Kanevsky Dimitri
Zadrozny Wlodek Wlodzimierz
Hudspeth David R.
International Business Machines - Corporation
Storm Donald L.
LandOfFree
Training speech recognition by matching audio segment frequency does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Training speech recognition by matching audio segment frequency , we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Training speech recognition by matching audio segment frequency will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-2389484