Training speech recognition by matching audio segment frequency

Data processing: speech signal processing – linguistics – language – Speech signal processing – Recognition

Patent

Rate now

[ 0.00 ] – not rated yet Voters 0 Comments 0

Details Training speech recognition by matching audio segment frequency Training speech recognition by matching audio segment frequency

: 1998-01-15
: 1999-12-28
: Hudspeth, David R.
: Data processing: speech signal processing, linguistics, language
: Speech signal processing
: Recognition

: 704240, 704255, G01L 900
: Patent
: active
: 060093927
: ABSTRACT:
A method is provided which trains acoustic models in an automatic speech recognizer ("ASR") without explicitly matching decoded scripts with correct scripts from which acoustic training data is generated. In the method, audio data is input and segmented to produce audio segments. The audio segments are clustered into groups of clustered audio segments such that the clustered audio segments in each of the groups have similar characteristics. Also, the groups respectively form audio similarity classes. Then, audio segment probability distributions for the clustered audio segments in the audio similarity classes are calculated, and audio segment frequencies for the clustered audio segments are determined based on the audio segment probability distributions. The audio segment frequencies are matched to known audio segment frequencies for at least one of letters, combination of letters, and words to determine frequency matches, and a textual corpus of words is formed based on the frequency matches. Then, acoustic models of the automatic speech recognizer are trained based on the textual corpus. In addition, the method may receive and cluster video or biometric data, and match such data to the audio data to more accurately cluster the audio segments into the groups of audio segments. Also, an apparatus for performing the method is provided.

REFERENCES:
patent: 5122951 (1992-06-01), Kayima
patent: 5625748 (1997-04-01), McDonough et al.
patent: 5649060 (1997-07-01), Ellozy et al.
patent: 5659662 (1997-08-01), Wilcox et al.

Affiliated with

Kanevsky Dimitri

Inventor

[ 0.00 ] – not rated yet Voters 0 Comments 0

Zadrozny Wlodek Wlodzimierz

Inventor

[ 0.00 ] – not rated yet Voters 0 Comments 0

Also associated with

Hudspeth David R.

Examiner

[ 0.00 ] – not rated yet Voters 0 Comments 0

International Business Machines - Corporation

Corporate Assignee

[ 0.00 ] – not rated yet Voters 0 Comments 0

Storm Donald L.

Examiner

[ 0.00 ] – not rated yet Voters 0 Comments 0

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Training speech recognition by matching audio segment frequency does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Training speech recognition by matching audio segment frequency , we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Training speech recognition by matching audio segment frequency will most certainly appreciate the feedback.

Rate now

Comments { 0 }

Profile ID: LFUS-PAI-O-2389484

All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.

Canada

Charities
Companies
MP Candidates
Patents
Employee Salary Disclosure

World

Places of the World
Scientific Papers

United States

Banks
Companies
Counties
Patents
Employee Salary Disclosure