User independent, real-time speech recognition system and method

Patent

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

G10L 506

Patent

active

056404905

ABSTRACT:
A system and method for identifying the phoneme sound types that are contained within an audio speech signal is disclosed. The system includes a microphone and associated conditioning circuitry, for receiving an audio speech signal and converting it to a representative electrical signal. The electrical signal is then sampled and converted to a digital audio signal with a digital-to-analog converter. The digital audio signal is input to a programmable digital sound processor, which digitally processes the sound so as to extract various time domain and frequency domain sound characteristics. These characteristics are input to a programmable host sound processor which compares the sound characteristics to standard sound data. Based on this comparison, the host sound processor identifies the specific phoneme sounds that are contained within the audio speech signal. The programmable host sound processor further includes linguistic processing program methods to convert the phoneme sounds into English words or other natural language words. These words are input to a host processor, which then utilizes the words as either data or commands.

REFERENCES:
patent: 3581192 (1971-05-01), Miura et al.
patent: 3703609 (1972-11-01), Gluth
patent: 3838217 (1974-09-01), Dreyfus
patent: 3938394 (1976-02-01), Morrow et al.
patent: 3940565 (1976-02-01), Lindenberg
patent: 3969972 (1976-07-01), Bryant
patent: 4181813 (1980-01-01), Marley
patent: 4452079 (1984-06-01), Tiller
patent: 4658252 (1987-04-01), Rowe
patent: 4780906 (1988-10-01), Rajasekaran et al.
patent: 4817154 (1989-03-01), Hoyer
patent: 4852170 (1989-07-01), Bordeaux
patent: 4862503 (1989-08-01), Rothenberg
patent: 4975957 (1990-12-01), Ichikawa
patent: 4991216 (1991-02-01), Fujii et al.
patent: 4998280 (1991-03-01), Amano et al.
patent: 5027410 (1991-06-01), Williamson et al.
patent: 5065432 (1991-11-01), Sasaki et al.
patent: 5068900 (1991-11-01), Searcy et al.
patent: 5091948 (1992-02-01), Kametani
patent: 5121434 (1992-06-01), Mrayati et al.
patent: 5166981 (1992-11-01), Iwahashi et al.
patent: 5202926 (1993-04-01), Miki
patent: 5299125 (1994-03-01), Baker et al.
patent: 5321608 (1994-06-01), Namba et al.
Quenot, Gauvain, Gangolf & Mariani, A Dynamic Programming Processor for Speech Recognition, IEEE Journal of Solid-State Circuits, vol. 24, No. 2, pp. 349-357, Apr. 1989.
Wang, Wu, Chang & Lee, A Hierarchical Neural Network Model Based on C/V Segmentation Algorithm for Isolated Mandarin Speech Recognition, IEEE Transactions on Signal Processing, vol. 39, No. 9, pp. 2141-2147, Sep. 1991. Takahashi, Hamauchi, Tansho & Kimura, A Modularized Processor LSI with a Highly Parallel Structure for Continuous Speech Recognition, IEEE Journal of Solid-State Circuits, vol. 26, No. 6, pp. 833-843, Jun. 1991.
Elman, A Personal Computer-based Speech Analysis and Synthesis System, IEEE MICRO, pp. 4-21, June 1987.
Levinson & Roe, A Perspective on Speech Recognition--IEEE Communications Magazine, pp. 28-34, Jan. 1990.
Krubsack & Niederjohn, An Autocorrelation Pitch Detector and Voicing Decision with Confidence Measures Developed for Noise-Corrupted Speech, IEEE Transactions on Signal Processing, vol. 39, No. 2 pp. 319-329, Feb. 1991.
Hurst & Brodersen, An MOS-LSI Autocorrelator for Linear Prediction of Speech, IEEE Journal of Solid-State Circuits, vol. sc-19, No. 6, pp. 1022-1029, Dec. 1984.
Zhao, Atlas & Zhuang, Application of the Gibbs Distribution to Hidden Markov Modeling in Speaker Indepdent Isolated Word Recognition, IEEE Transactions on Signal Processing, vol. 39, No. 6, pp. 1291-1299, Jun. 1991.
Drews, Laroia, Pandel, Schumacher & Stolzle, CMOS Processor for Template-Based Speech-Recognition System, IEE Proceedings, vol. 136, Pt. 1, No. 2, pp. 155-161, Apr. 1989.
Young, Competitive Training: A Connectionist Approach to the Discriminative Training of Hidden Markov Models, IEE Proceedings-1, vol. 1338, No. 1, pp. 61-68, Feb. 1991.
Young Designing a Conversational Speech Interface, IEE Proceedings, vol. 133, Pt. E, No. 6, pp. 305-311, Nov. 1986.
Kong & Kosko, Differential Competitive Learning for Centroid Estimation and Phoneme Recognition, IEEE Transactions on Neural Networks, vol. 2, No. 1, pp. 118-124, Jan. 1991.
Ney, Dynamic Programming Parsing for Context-Free Grammars in Continuous Speech Recognition, IEEE Transactions on Signal Processing, vol. 39, No. 2, pp. 336-340, Feb. 1991.
Murano, Unagami & Amano, Echo Cancellation and Applications, IEEE Communications Magazine, pp. 49-55, Jan. 1990.
Jack, Laver & Blauert, Editorial: Speech Technology, IEE Proceedings, vol. 136, Pt. 1, No. 2, p. 109, Apr. 1989.
Chen & Pan, Fast Search Algorithm for VQ-Based Recognition of Isolated Words, IEE Proceedings, Vol. 136, Pt 1, No. 6, pp. 391-396, Dec. 1989.
Bengio, De Mori, Flammia & Kompe, Global Optimization of Neural Network-Hidden Markov Model Hybird, IEEE Transactions on Neural Networks, vol. 3, No. 2, pp. 253-259, Mar. 1992.
Mariani, Hamlet A Prototype of a Voice-Activated Typewriter, IEE Proceedings, vol. 136, Pt. 1, No. 2, pp.162-166, Apr. 1989.
Trancoso & Tribolet, Harmonic Postprocessing Speech Synthesized by Stochastic Coders, IEE Proceedings, vol. 136, Pt. 1, No. 2, pp. 141-144, Apr. 1989.
Martinelli, Orlandi, Ricotti & Ragazzini, Identification of Stable Nonstationary Lattice Predictors by Linear Programming, IEE Proceedings, vol. 74, No. 5, pp. 759-776, May 1986.
Sutherland, Jack & Laver, Improved Pitch Detection Algorithm Employing Temporal Structure Investigation of the Speech Waveform, IEEE Proceedings, vol. 135, Pt. F, No. 2, pp. 169-174, Apr. 1988.
Lee, Information-Theoretic Distortion measures for Speech Recognition, IEEE Transactions on Signal Processing, Vol. 39, No. 2, pp. 330-335, Feb. 1991.
Yuhas, Goldstein & Sejnowski, Integration of Acoustic and Visual Speech Signals Using Neural Networks, IEEE Communications Magazine, pp. 65-71, Nov. 1989.
Erell, Orgad & Goldstein, JND's in the LPC Poles of Speech and Their Application to Quantization of the LPC Filter, IEEE Transactions on Signal Processing, vol. 39, No. 2, pp. 308-318, Feb. 1991.
Liu, Lee, Wang & Chang, Layered Neutral Nets Applied in the Recognition of Voiceless Unaspirated Stops, IEE Proceedings, vol. 136, Pt. 1,No. 2, pp. 69-75, Apr. 1989.
De Mori, Lam & Gilloux, Learning and Plan Refinement in a Knowledge-Based System for Automatic Speech Recognition, IEEE Transactions on Pattern Analysis & Machine Intelligence, vol. PAMI-9, No. 2 pp. 289-305, Mar. 1987.
Schroeder, Linear Predictive Coding of Speech: Review and Current Directions, IEEE Communications Magazine, pp. 54-61, Aug. 1985.
Pal & Mirta, Multilayer Perception, Fuzzy Sets, and Classification, IEEE Transactions on Neural Networks, vol. 3, No. 5, pp. 683-697, Sep. 1992.
Yuhas,Goldstein, Sejnowski & Jenkins, Neural Network Models of Sensory Integration for Improved Vowe Recognition, IEEE Proceedings, vol. 78, No. 10, pp. 1658-1668, Oct. 1990.
Rashwan & Fahmy, New Technique for Speaker-Independent Isolated-Work Recognition, IEEE Proceedings, vol. 135, Pt. F, No. 3, pp. 251-546, Jun. 1988.
Brieseman, Thorpe & Bates, Nontactile Estimation of Glottal Excitation Characteristics of Voiced Speech, IEEE Proceedings, vol. 134, Pt. A, No. 10, pp. 807-813, Dec. 1987.
Chou, Optimal Partitioning for Classification and Regression Trees, IEEE Transactions on Pattern Analysis & Machine Intelligence, vol. 13, No. 4, pp. 340-355, Apr. 1991.
Lowe & Webb, Optimized Feature Extraction and the Bayes Decision in Feed-Forward Classifier Networks, IEEE Transactions on Pattern Analysis & Machine Intelligence, vol. 13, No. 4, pp. 355-364, Apr. 1991.
Lippmann, Pattern Classification Using Neural Networks, IEEE Communications Magazine, pp. 47-64, Nov. 1989.
Pisoni, Nusbaum & Greene, Perception of Synthetic Speech Generated by Rule, IEE Proceedings, Vol. 73, No. 11, pp. 1665-, Nov. 1985.
Mohan & Komandur, Performance of a Multiprocessor-Based Parallel Stack Algorithm Speech Encoder, IEEE, pp. 463-467, 1987.
Barnard, Cole, Vea & Alleva, Pitch Detection with a Neural-Net Classifier, IEEE Transactions on Signal Processing, vol. 39, No. 2, pp. 298-307, F

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

User independent, real-time speech recognition system and method does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with User independent, real-time speech recognition system and method, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and User independent, real-time speech recognition system and method will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-2164362

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.