Data processing: speech signal processing – linguistics – language – Speech signal processing – Application
Patent
1996-08-20
2000-05-30
Zele, Krista
Data processing: speech signal processing, linguistics, language
Speech signal processing
Application
704270, G10L 1528
Patent
active
060701390
ABSTRACT:
Bifurcated speaker specific and non-speaker specific method and apparatus is provided for enabling speech-based remote control and for recognizing the speech of an unspecified speaker at extremely high recognition rates regardless of the speaker's age, sex, or individual speech mannerisms. A device main unit is provided with a speech recognition processor for recognizing speech and taking an appropriate action, and with a user terminal containing specific speaker capture and/or preprocessing capabilities. The user terminal exchanges data with the speech recognition processor using radio transmission. The user terminal may be provided with a conversion rule generator that compares the speech of a user with previously compiled standard speech feature data and, based on this comparison result, generates a conversion rule for converting the speaker's speech feature parameters to corresponding standard speaker's feature information. The speech recognition processor, in turn, may reference the conversion rule developed in the user terminal and perform speech recognition based on the input speech feature parameters that have been converted above.
REFERENCES:
patent: 2338551 (1944-01-01), Stanko
patent: 3610831 (1971-10-01), Moshier
patent: 4052568 (1977-10-01), Jankowski
patent: 4305131 (1981-12-01), Best
patent: 4319085 (1982-03-01), Welch et al.
patent: 4336421 (1982-06-01), Welch et al.
patent: 4947482 (1990-08-01), Brown
patent: 4984177 (1991-01-01), Rondel et al.
patent: 5014219 (1991-05-01), White
patent: 5040215 (1991-08-01), Amano et al.
patent: 5046019 (1991-09-01), Basehore
patent: 5054082 (1991-10-01), Smith et al.
patent: 5086385 (1992-02-01), Launey et al.
patent: 5093899 (1992-03-01), Hiraiwa
patent: 5119469 (1992-06-01), Alkon et al.
patent: 5150323 (1992-09-01), Castelaz
patent: 5163111 (1992-11-01), Baji et al.
patent: 5175794 (1992-12-01), Tattersall
patent: 5182794 (1993-01-01), Gasperi et al.
patent: 5185848 (1993-02-01), Aritsuka et al.
patent: 5218668 (1993-06-01), Higgins et al.
patent: 5247584 (1993-09-01), Krogmann
patent: 5255342 (1993-10-01), Nitta
patent: 5263097 (1993-11-01), Katz et al.
patent: 5278944 (1994-01-01), Sasaki et al.
patent: 5285522 (1994-02-01), Mueller
patent: 5285523 (1994-02-01), Takahashi
patent: 5297237 (1994-03-01), Masuoka et al.
patent: 5307444 (1994-04-01), Tsuboka
patent: 5357596 (1994-10-01), Takebayashi et al.
patent: 5377301 (1994-12-01), Rosenberg et al.
patent: 5377305 (1994-12-01), Russo
patent: 5384892 (1995-01-01), Strong
patent: 5444673 (1995-08-01), Mathurin
patent: 5459815 (1995-10-01), Aikawa et al.
patent: 5481644 (1996-01-01), Inazumi
patent: 5548681 (1996-08-01), Gleaves et al.
patent: 5562453 (1996-10-01), Wen
patent: 5577164 (1996-11-01), Kaneko et al.
patent: 5664059 (1997-09-01), Zhao
patent: 5668929 (1997-09-01), Foster, Jr.
patent: 5704009 (1997-12-01), Cline et al.
C. Iooss, From Lattices Of Phonemes To Sentences: A Recurrent Neural Network Approach, International Joint Conference on Neural Networks, Jul. 8-14, 1991, vol. 2, pp. 833-838.
A.N. Michel, et al., Associative Memories Via Artificial Neural Networks, IEEE Control Systems Magazine, Apr. 1990, pp. 6-17.
M. Sompolinsky, et al., Chaos In Random Neural Networks, Physical Review Letter, Jul. 18, 1988, pp. 259-262.
M. Palakal, et al., Speaker-Invarient Pheneme Recognition Using Multiple Neural Network Models, 1991 IEEE International Conference on Neural Networks, Jul. 8-11, 1991, pp. 839-844.
J.A. Naylor, et al., The Application Of Neural Networks To Wordspotting, 1992 26.sup.th Asilomar Conference on Signals, Systems & Computers, Oct. 26-28, pp. 1081-1085.
K.P. Li, et al, A Whole Word Recurrent Neural Network For Keyword Spotting, ICASSP, Mar. 1992, pp. II-81 to II-84.
N.Z. Hakim, et al., Cursive Script Online Character Recognition With A Recurrent Neural Network Model, IJCNN, Jun. 1992, pp. III-777 to III-716.
J. Hertz, Introduction To The Theory Of Neural Computation, Santa Fe Institute Studies In The Sciences Of Complexity, Lecture Notes, vol. 1, Addison-Wesley, 1991, pp. 172-187.
J. Hwang, et al., Interactive Query Learning For Isolated Speech Recognition, Sp. 1992, IEEE, pp. 93-94.
A. Hirai, et al., Phoneme-Based Word Recognition By Neural Ntwork--A Step Toward Large Vocabulary Recognition, IEEE, Jun. 1990, pp. III-671, to III-676.
A. Waibel, et al., Modularity And Scaling In Large Phoenemic Neural Networks, IEEE Transactions On Acoustics, Speech And Signal Processing, vol. 37. No. 12, Dec. 1989, pp. 1888-1898.
A. Baloch, et al., A Neural System For Behavioral Conditioning Of Mobile Robots, IJCNN International Joint Conference On Neural Networks, San Diego, Jun. 17-21, 1990, pp. II-723 to II-728.
T. Robinson, et al., A Recurrent Error Propagation Network Speech Recognition System, 8300 Computer Speech & Language 5, (1991), Jul. No. 3, London, GB; pp. 259-274.
Y. Bengio, et al., Learning The Dynamic Nature Of Speech With Back-Propagation For Sequences, 8221 Pattern Recognition Letters 13 (1992) May, No. 5, Amsterdam, NL, pp. 375-385.
T. Tanaka, A Complex Sequence Recognition Model, IJCNN International Joint Conference On Neural Networks, Baltimore, MD, Jun. 7-11, 1992, vol. 4, pp. IV-201 to IV-207.
T. Watanabe, et al., Study Of Learning Methods And Shape Of The Learning Surface For Recurrent Neural Networks, Teses by IEICE, vol. J74D-II, No. 12, Dec. 25, 1991, pp. 1776-1787.
M. Inazumi, et al., Continuous Spoken Digit Recognition By Recurrent Neural Networks, Technical Report of IEICE, SP92-125, Jan. 19, 1993, pp. 17-24.
M. Inazumi, et al., Connected Word Recognition By Recurrent Neural Networks, Technical Report Of IEICE, SP-92-25, Jun. 30, 1992, pp. 9-16.
K. Funahashi, On The Recurring Neural Networks, Technical Research Report By IEICE, SP-92-80, Oct. 21, 1992, pp. 51-58.
Y. Fukuda, et al., Phoeneme Recognition Using Recurrent Neural Networks, Technical Research Report By IEICE, NC92-10, May 8, 1991, pp. 71-78.
T. English, et al., Back-Propagation Training Of A Neural Network For Word Spotting, ICASSP, vol. 2, Mar. 23-26, 1992, San Francisco, CA, US, pp. 357-360.
Z. Zhao, Connectionist Training Of Non-Linear Hidden Markov Models For Speech Recognition, IJCNN, 1991, vol. 2, Nov. 18-21, 1991, Singapore, SG, pp. 1647-1652.
F. Greco, et al., A Recurrent Time-Delay Neural Network For Improved Phoneme Recognition, ICASSP, 1991, vol. 1, May 14-17, 1991, Toronto, Canada, pp. 81-84.
H.U. Bauer, et al., Nonlinear Dynamics Of Feedback Multilayer Perceptrons, Physical Review A, vol. 2, No. 4, Aug. 15, 1990, US, pp. 2401-2408.
T. Robinson, A Real-Time Recurrent Error Propagation Network Word Recognition System, ICASSP-92, vol. 1, Mar. 23-26, 1992, pp. 617-620.
Edatsune Isao
Hasegawa Hiroshi
Inazumi Mitsuhiro
Miyazawa Yasunaga
Urano Osamu
Opsasnick Michael N.
Seiko Epson Corporation
Zele Krista
LandOfFree
Bifurcated speaker specific and non-speaker specific speech reco does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Bifurcated speaker specific and non-speaker specific speech reco, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Bifurcated speaker specific and non-speaker specific speech reco will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-1918178