Computer system and computer-implemented process for phonology-b

Patent

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

395 264, 395 24, G10L 506, G10L 900

Patent

active

056236093

ABSTRACT:
The present invention is based on the use of linguistic, especially phonological, knowledge to guide the speech recognition process. A speech signal containing an utterance is received and linguistic cues in the speech signal are detected. From these detected linguistic cues, a symbolic representation of the contents of the speech signal is generated. This symbolic representation comprises at least one word division, wherein each word division consists of an onset-rhyme pair and associated phonological elements. These phonological elements are univalent, may appear in all languages and are distinguishable from each other and directly interpretable in the speech signal. A lexicon of predetermined symbolic representations is provided for words in a particular language. A best match to the generated symbolic representation in found in the lexicon, thereby recognizing the spoken word.

REFERENCES:
patent: 3679830 (1972-07-01), Uffelman et al.
patent: 4087632 (1978-05-01), Hafer
patent: 4670851 (1987-06-01), Murakami et al.
patent: 4783804 (1988-11-01), Juang et al.
patent: 4811399 (1989-03-01), Landell et al.
patent: 4813076 (1989-03-01), Miller
patent: 4815134 (1989-03-01), Picone et al.
patent: 4820059 (1989-04-01), Miller et al.
patent: 4868867 (1989-09-01), Davidson et al.
patent: 4885791 (1989-12-01), Fujii et al.
patent: 4896358 (1990-01-01), Bahler et al.
patent: 4899385 (1990-02-01), Ketchum et al.
patent: 4910781 (1990-03-01), Ketchum et al.
patent: 4933973 (1990-06-01), Porter
patent: 4965580 (1990-10-01), Tasaki et al.
patent: 5054085 (1991-10-01), Meisel et al.
patent: 5129001 (1992-07-01), Bahl et al.
ISBN 4-88552-072-X, In Japanese with translation, pp. 26-60, 19988.
Kaye et al., "Constituent Structure and Government in Phonology," Phonology, vol. 7, 1990, pp. 193-231.
Eide et al., "A Linguistic Feature Representation of the Speech Waveform," International Conf. on Acoustics, Speech and Signal Processing, vol. 2, Apr. 27, 1993-Apr. 30, 1993, pp. 483-486.
Datta, "Manner-based Labelling of Speech Signal Using Total Energy Profile," EPO Conf. on Speech Comm. and Technology, vol. 2, Sep. 26, 1989, pp. 100-107.
Giordana et al., "Use of Lexical Constraints in Continuous Speech Understanding," Proceedings of the Int'l Conf. on Systems, Man and Cybernetics, vol. 1, Dec. 29, 1993-Jan. 7, 1984, pp. 319-322.
Weintraub, "The GRASP Sound Separation System," Proceedings of the IEEE Int'l Conf. on Acoustics Speech and Signal Proc., vol. 2, Mar. 19, 1984-Mar. 21, 1984, pp. 18A.6.1-18A-6.4.
D. Touretzky and D. Wheeler, "Exploiting Syllable Structure in a Connectionist Phonology Model," pp. 612-618.
Ken-ichi Iso and T. Watanabe, "Speech Recognition Using Demi-Syllable Neural Prediction Model," pp. 227-233.
B. Dresher and J. Kaye, "A computational learning model for metrical phonology," Elsevier Science Publishers B.V., Cognition, vol. 34, (1990) pp. 137-195.
G. Williams and W. Brockhaus, "Automatic Speech Recognition: A Principle-Based Approach," SOAS Working Papers in Linguistics, vol. 2, pp. 293-313 (1992).
Lyons, J., Introduction to Theoretical Linguistics, (Cambridge: Camb. Univ. Press, 1968, 1991).
J. Kaye, J. Lowenstamm, and J. Vergnaud, "The internal structure of phonological elements: a theory of charm and government," Phonology Yearbook 2, (1985), pp. 305-328.
J. Harris and J. Kaye, "A Tale of Two Cities: London Glottalling and New York City Tapping," The Linguistics Review, vol. 7, pp. 251-274, 1990.
J. Kaye, "`Coda` Licensing," Phonology 7 (1990), pp. 301-330.
M. Randolph, "Syllable-based Constraints on Properties of English Sounds," Sep. 1989, Ph.D. Thesis, Massachusetts Institute of Technology.
Speech Recognition Update, "News and analysis of speech recognition markets, companies, and technology, vol. 2," Mar. 1993, pp. 1-24.
Lyn Frazier, "Structure in auditory word recognition," in Spoken Word Recognition, Frauenfelder et al., eds., conginition 25, 1987, pp. 7-187.
F. Jelinek, "Continuous Speech Recognition by Statistical Methods," Proceedings of the IEEE, vol. 64, No. 4, Apr. 1994, pp. 532-556.
L.R. Rabiner, B.H. Juang, S.E. Levinson, and M.M. Sondhi, "Some Properties of Continuous Hidden Markov Model Representations," AT&T Tech. Journal, vol. 64, No. 6, Jul.-Aug. 1985, pp. 1251-1270.
S. Roucos, A. Wilgus, and W. Russell, "A Segment Vocoder Algorithm for Real-Time Implementation," Proceedings of the IEEE, 1987, pp. 1949-1952, 45.7.1-45.7.4.
C. Tsao and R. Gray, "Shape-Gain Matrix Quantizers for LPC Speech," IEEE Transactions on Acoustics, Speech, and Signal Processing, vol. ASSP-34, No. 6, Dec. 1986, pp. 1427-1439.
Y. Shiraki and M. Honda, "LPC Speech Coding Based on Variable-Length Segment Quantization," IEEE Transactions on Acoustics, Speech, and Signal Processing, vol. 36, No. 9, Sep. 1988, pp. 1437-1444.
L. Bahl, P. Brown, P. deSouza and R. Mercer, "Speech Recognition with Continuous-Parameter Hidden Markov Models," IEEE Acoustic Speech and Signal Processing Society, 1988 Int'l Conf., pp. 40-43.
L.R. Rabiner, B.H. Juang, S.E. Levinson, and M.M. Sondhi, "Recognition of Isolated Digits Using Hidden Markov Models with Continuous Mixture Densities," AT&T Technical Journal, vol. 64, No. 6, Jul.-Aug. 1985, pp. 1211-1234.
L.R. Bahl, P.F. Brown, P.V.deSouza, and R.L. Mercer, "A New Algorithm for the Estimation for Hidden Markov Model Parameters," IEEE Acoustics, Speech, and Signal Processing Society, 1988 Int'l Conf. vol. 1, pp. 493-496.
M. Fanty and R. Cole, "Spoken Letter Recognition," pp. 220-226.
J. Aitchison, Words in the Mind--An Introduction to the Mental Lexicon--, Blackwell: Oxford, pp. 120-125, 1987.
S. Blumstein, "Acoustic Invariance in speech production: Evidence from measurements of the spectral characteristics of stop consonants," J. Acoust. Soc. Am. 66(4), Oct. 1979, pp. 1001-1017.
W. Brockhaus, "Colourful Leagues: A Government Phonology Approach to Final Obstruent Devoicing in German," UCL Working Papers In Linguistics, vol. 2, pp. 270-297, 1990.
W. Brockhaus, "Skeleton and supra segmental structure within Government Phonology," (dated 18 Jan. 1993), pp. 1-49.
S. Blumstein, "Perceptual invariance and onset spectra for stop consonants in different vowel environments," J. Acoust. Sco. Am. 67(2), Feb. 1980, pp. 648-661.
C. Browman and L. Goldstein, "Articulatory gestures as phonological units," Phonology 6 (1989), pp. 201-251.
M. Charette, "License to govern," Phonology 7 (1990), pp. 233-253.
M. Charette, "Mongolian and Polish meet Government Licensing," pp. 275-291, SOAS Working Papers in Linguistics, vo., 2, pp. 275-292 (1991/2).
K. Church, "Phonological parsing and lexical retrieval," In Spoken Work Recognition, Congnition 25, 1987, (Frauenfelder et al., eds.).
Altman, Gerry T., ed., Cognitive Models of Speech Processing, (Cambridge, Mass.: MIT Press, 1990), pp. 1-23, 211-235 and 263-280.
J. Harrington, "Acoustic Cues for Automatic Recognition of English Consonants," pp. 69-143.
John Harris, "Segmental complexity and phonological government," Phonology 7 (1990), pp. 255-300.
G. Lindsey and J. Harris, "Phonetic Interpretation in Generative Grammar," UCL Working Papers in Linguistics, vol. 2, pp. 355-369, 1990.
J. Harris and G. Lindsey, "The elements of phonological representation," Mar. 1993, in New Frontiers in Phonology, (Durand and Katamba, eds.) pp. 1-51 (Harlow; Essex: Longman).
J. Harris, "Licensing Inheritance," pp. 359-406, UCL Working Papers in Linguistics, vol. 4, 1992.
C. Hoequist, Jr., "Phonological Rules and Speech Recognition," pp. Hoequist 1 through Hoequist 8, Cambridge Papers in Phonetics and Linguistics, vol. 2, 1986.
R. Jakobson, C. Fant, M. Halle, Preliminaries to Speech Analysis, The MIT Press, Sep. 1976, pp. i-43.
Jonathan Kaye, "Do you believe in magic? the story of s+C sequences," pp. 1-21, SOAS Working Papers in Linguistics, vol. 2: 293-314, 1992.
D. Klatt, "Lexical Representations for Speech Production and Perception," in The Cognitive Representation of Speech (Myers et al, eds.), 1981, pp., 11-31 (North Holland Publishing Co.).
L.R. Rabiner and M.R. Sambur, "An Algorithm for Determining the Endpoints of Isolated Utterances," (Jun. 1

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Computer system and computer-implemented process for phonology-b does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Computer system and computer-implemented process for phonology-b, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Computer system and computer-implemented process for phonology-b will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-348560

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.