Continuous speech recognition system

Boots – shoes – and leggings

Patent

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

381 43, G10L 500

Patent

active

050401270

DESCRIPTION:

BRIEF SUMMARY
BACKGROUND OF THE INVENTION

The present invention relates to speech recognition, and, more particularly, to speech recognition wherein spoken word end points are not predetermined.
Recognition of isolated words from a given vocabulary for a known speaker has been known for some time. Words of the vocabulary are prestored as individual templates, each template representing the sound pattern for a word in the vocabulary. When an isolated word is spoken, the system compares the word to each individual template representing the vocabulary. This method is commonly referred to as whole-word template matching. Many successful recognition systems use whole-word template matching with dynamic programming to cope with nonlinear time scale variations between the spoken word and the prestored template.
Although this technique has been effective for isolated word recognition applications, many practical applications require continuous word recognition. In continuous word recognition, the number of words in a phrase can be unlimited and the identity of the earlier words can be determined before the end of the phrase, whereas in isolated word recognition, delimiters are used to identify the beginning and ending of input patterns and recognition occurs one word at a time. Moreover, a continuous speech recognition system must distinguish an input pattern from other recognizable patterns, background noise, speaker induced noise such as breathing noise, while isolated recognition cannot usually tolerate other recognizable patterns at the beginning or ending of the word.
In "Two level DP Matching--A dynamic programming based pattern matching algorithm for connected word recognition", H. Sakoe, IEEE Trans. Acoustics, Speech and Signal Processing, Vol.ASSP-27, No.6, pp.588-595, Dec. 1979, the method of whole-word template matching has been extended to deal with connected word recognition. The paper suggests a two-pass dynamic programming algorithm to find a sequence of word templates which best matches the whole input pattern. In the first pass, a score is generated which indicates the similarity between every template matched against every possible portion of the input pattern. In the second pass, the score is used to find the best sequence of templates corresponding to the whole input pattern.
This extended method has distinct disadvantages. One disadvantage of this technique is the amount of computation time it requires. Depending on the specific design requirements, this limitation may create unwarranted need for an expensive high-speed processor.
Another disadvantage of this method is that the endpoints of the input pattern must be predetermined and the whole input pattern must be stored in the system before any accurate template matching can occur. For an input pattern of any significant length, recognition response time would be substantially degraded. Also, errors in endpoint detection will seriously degrade recognizer performance. Further, the memory required to store this information may become excessive.
In "Partial Traceback and Dynamic Programming", P. Brown, J. Spohrer, P. Hochschild, and J. Baker, IEEE Trans. Acoustics, Speech and Signal Processing, Vol.ASSP-27, No.6, pp.588-595, Dec. 1979, a technique is described which allows for continuous speech recognition of arbitrarily long input patters without predetermination of endpoints. This is accomplished using a technique called partial traceback. Partial traceback allows outputting of recognized words prior to completion of the complete input pattern without sacrificing recognizer performance. However, the partial traceback technique described appears to be processor burdensome and cumbersome to implement.
Accordingly, there is a need for a continuous speech recognition system which can easily be implemented, yet can operate efficiently and inexpensively in real time.


OBJECTS AND SUMMARY OF THE INVENTION

Accordingly, it is an object of the present invention to provide an arrangement and method of speech recognition which can be implemented for real time appli

REFERENCES:
patent: 4277644 (1981-07-01), Levinson et al.
patent: 4783809 (1988-11-01), Glinski
"Syntax driven recognition of connected word by Markov model", pp. 35.5.1-35.5.4, (IEEE International Conference on Acoustics, Speech and Signal Processing, Mar. 19-21, 1984 San Diego, U.S.; vol. 3.
H. Sakoe, "Two Level DP Matching-A Dynamic Programming Based Pattern Matching Algorithm for Connected Word Recognition", IEEE Trans. Acoustics, Speech and Signal Processing, vol. ASSP-27, No. 6, pp. 588-595, Dec. 1979.
P. Brown et al., "Partial Traceback and Dynamic Programming", IEEE Trans. Acoustics, Speech and Signal Processing, vol. ASSP-27, No. 6, pp. 588-595, Dec. 1979.
B. A. Dautrich et al., "On the Effects of Varying Filter Bank Parameters on Isolated Work Recognition", IEEE Trans, Acoustics, Speech, and Signal Processing, vol. ASSP-31, pp. 793-806, Aug. 1983.
J. Bridle et al., "An Algorithm for Connected Word Recognition", Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, pp. 899-902, 1982.
D. Jouvet, et al., "One-Pass Syntax-Directed Connected Word Recognition in a Time-Sharing Environment", CH1945-5/84/000-0389, 1984 IEEE pp. 35.8.1-35.8.4.
J. G. Ackenhusen, "The CDTWP: A Programmable Processor for Connected Word Recognition", CH1945-5/84/0000-0390, 1984.
H. Ney, "The Use of a One-Stage Dynmamic Programming Algorithm for Connected Work Recognition", IEEE Trans. on Acoustics, Speech and Signal Processing vol. ASSP-32, No. 2, Apr. 1984.
J. K. Baker, "The Dragon System-An Overview", IEEE Transitions on Acoustics Speech and Signal Processing, vol. ASSP-23, No. 1, Feb. 1975, pp. 24-29.
J. Peckham, et al., "A Real Time Hardware Continuous Speech Recognition System", CH1746-7/82/000-0863, 1982 IEEE.
L. R. Rabinger, et al., "A Simplified, Robust Training Procedure for Speaker Trained, Isolated Word Recognition Systems", J. Acoust. Soc. AM. vol. 68, No. 5 Nov. 1980 pp. 1271-1276.
L. R. Bahl, et al., "Decoding for Channels with Insertions, Deletions, and Substitutions with Applications to Speech Recognition", IEEE Trans, on Information Theory, vol. IT-21, Jul. 1975, pp. 404-411.
J. C. Spohrer et al., "Partial Traceback in Continuous Speech Recognition", Proc. Int. Conf. Cybemetics and Society, Boston, Oct. 1980.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Continuous speech recognition system does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Continuous speech recognition system, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Continuous speech recognition system will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-1532081

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.