Synthesizing word baseforms used in speech recognition

Electrical audio signal processing systems and devices – Monitoring/measuring of audio devices – Loudspeaker operation

Patent

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

381 43, 381 36, G10L 500

Patent

active

048827592

ABSTRACT:
Apparatus and method for synthesizing word baseforms for words not spoken during a training session, wherein each synthesized baseform represents a series of models from a first set of models, which include: (a) uttering speech during a training session and representing the uttered speech as a sequence of models from a second set of models; (b) for each of at least some of the second set models spoken in a given phonetic model context during the training session, storing a respective string of first set models; and (c) constructing a word baseform of first set models for a word not spoken during the training session, including the step of representing each piece of a word that corresponds to a second set model in a given context by the stored respective string, if any, corresponding thereto.

REFERENCES:
patent: 4181821 (1980-01-01), Pirz et al.
patent: 4513436 (1985-04-01), Nose et al.
patent: 4587670 (1986-05-01), Levinson et al.
patent: 4593367 (1986-06-01), Slack et al.
ICASSP 84 Proceedings of the IEEE International Conf on ASSP, Mar. 1984, vol. 3, pp. 35.6.1-35.6.4 "Improved Hidden Markov Modeling of Phonemes for Continuous Speech Recognition" by R. Schwartz.
ICASSP 84 Proceedings of the IEEE International Conf on ASSP, Mar. 1984 pp. 42.5.1-42.5.4 "An Information Theoretic Approach to the Automatic Determination of Phonemic Baseforms" by J. M. Lucassen.
IEEE Trans on Acoustics, Speech and Signal Processing, vol. ASSP-28, No. 2, Apr. 1980, pp. 129-136, "A Training Procedure for Isolated Word Recognition Systems" by S. Furui.
Research Disclosure, nr. 256, Aug. 1985, p. 418, Abstract No. 25649, Emsworth, Hampshire, GB; "Composite Fenemic Phones".
IBM Technical Disclosure Bulletin, vol. 24, No. 4, Sep. 1981, pp. 2042-2043, New York; L. R. Bahl et al "Automatic Selection of Speech Prototypes".
M. Cravero et al, "Phonetic Units for Hidden Markov Models", CSELT Technical Report, vol. 14, No. 2, Oct. 7, 1986, pp. 121-125.
B. H. Juang et al, "Recent Developments in the Application of Hidden Markov Models to Speaker-Independent Isolated Word Recognition", IEEE, 1985, pp. 1.3.1-1.3.4.
H. Bourlard et al, "Speaker Dependent Connected Speech Recognition via Phonemic Markov Models", IEEE, 1985, pp. 1213-1216.
Y. Kamp et al, "State Reduction in Hidden Markov Chains Used for Speech Recognition", IEEE, 1985, pp. 1138-1145.
R. Schwartz et al, "Context-Dependent Modeling for Acoustic-Phonetic Recognition of Continuous Speech", IEEE 1985, pp. 1205-1208.
J. P. Mari et al, "Speaker Independent Connected Digit Recognition Using Hidden Markov Models", Speech Tech '85, vol. 1, No. 2, pp. 127-132.
S. Levinson et al, "Speaker Independent Isolated Digit Recognition Using Hidden Markov Models", IEEE, 1983, pp. 1049-1052.
D. M. Choy et al, "Speech Compression by Phoneme Recognition", IBM TDB, vol. 25, No. 6, Nov. 1982, pp. 2884-2886.
R. Bakis et al, "Continuous Speech Recognition via Centisecond Acoustic States", Research Report, 1978, pp. 1-9.
R. Bakis, "Spoken Word Spotting via Centisecond Acoustic States", IBM TDB vol. 18, No. 10, Mar. 1976, pp. 3479-3481.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Synthesizing word baseforms used in speech recognition does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Synthesizing word baseforms used in speech recognition, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Synthesizing word baseforms used in speech recognition will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-1430242

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.