Data processing: speech signal processing – linguistics – language – Speech signal processing – Recognition
Patent
1997-09-04
1999-10-19
Hudspeth, David R.
Data processing: speech signal processing, linguistics, language
Speech signal processing
Recognition
704233, 704244, 704256, G10L 506
Patent
active
059704527
DESCRIPTION:
BRIEF SUMMARY
BACKGROUND OF THE INVENTION
In many technical processes, pattern recognition acquires increased importance, since an increasing degree of automatization can thereby be achieved. Pattern recognition processes can as a rule be reduced to a time-variant measurement signal derived in a suitable way from the patterns to be recognized. However, in the automatic analysis of this measurement signal the problem arises that these measurement signals are not present in pure form, but rather are overlaid with stationary or non-stationary disturbing signals. In the examination of measurement signals derived from naturally uttered speech, these disturbing portions of the measurement signal are for example caused by background noises, breathing noises, machine noises, or also by the recording medium and the transmission path. Since the measurement signal is never present in pure form, it is particularly important to distinguish between the portions of the measurement signal containing the pattern to be recognized and other portions in which no pattern is present. For the better recognition of the patterns, it is thus particularly important to know exactly when patterns are present in the measurement signal and when no patterns, i.e. signals not resulting from the pattern are present as pause signals in the measurement signal.
A pause detection is for example also important in order to achieve a reduction in the quantity of the transmitted data, for example in speech communication channels and also in satellite transmission, for general distinguishing of useful signal from disturbing signal in signal processing, or else to find the end of an expression in the automatic speech recognition system. A robust pause detector thereby serves for the improvement of the efficiency of speech-controlled systems. This holds in particular for speech recognition systems, since what is concerned there is the comparison of a spoken expression as a pattern with an already-existing version. The problematic of pause determination specifically in automatic speech recognition has been described extensively by Rabiner (L. R. Rabiner and M. Sambur (1995), "An Algorithm for Determining the Endpoints of Isolated Utterances", The Bell system Technical Journal, 54(2), pages 297-315). He has also indicated an algorithm for pause detection. There, for pause detection items of information are taken into account that are calculated directly from the sampled time signal (energy, zero crossing rate, etc.). This procedure is common to all known pause detectors (J. H. Hansen, "Speech Enhancement Employing Boundary Detection and Morphological Based Spectral Constraints", IEEE International Conference On Acoustics, Speech and Signal Processing, pages 901-904, Toronto, ICASSP). As a rule, they use a more or less complicated control apparatus to carry out the classification of the pauses from the calculated features. As an alternative, statistical classifiers have also been used (H. Katterfeldt, "Sprachbestimmung mit Polynom Klassifikatoren", Proceedings Mustererkennung 7, DAGM-Symposium, Erlangen, pages 180-184). Due to this procedure, all these methods can operate only up to a certain disturbance level. The limit depends on the type of disturbance. They can no longer be used with small signal-noise ratios, since as a rule pause detectors are threshold-controlled. However, given very low signal to noise ratios, in environments with disturbances the current decision criteria with thresholds fail. In addition, there are non-stationary disturbances with a character similar to a signal, which can hardly be detected.
Previous approaches to the determination of speech pauses use e.g. a local parameter, i.e. one obtained on the basis of a temporal or, respectively, spectral item of frame information, for the detection of signal or, respectively, non-signal regions (S. Boll, (1979), "Suppression of Acoustic Noise In Speech Using Spectral Subtraction", IEEE Transactions on Acoustics, Speech and Signal Processing, Vol. ASS-27, No. 2, pages 113-120; and B. Widrow et al,
REFERENCES:
patent: 4481593 (1984-11-01), Bahler
patent: 4713777 (1987-12-01), Klovstad et al.
patent: 4811399 (1989-03-01), Landell et al.
patent: 4918687 (1990-04-01), Bustini et al.
patent: 5226091 (1993-07-01), Howell et al.
patent: 5293452 (1994-03-01), Picone et al.
patent: 5369728 (1994-11-01), Kosaka et al.
patent: 5465317 (1995-11-01), Epstein
patent: 5611019 (1997-03-01), Nakatoh et al.
Pattern Recognition, vol. 27, No. 10, Oct. 1994, Bose et al, Connected and Degraded Text Recognition Using Hidden Markov Model, pp. 1345-1363.
American Telephone and Telegraph Company, The Bell System Technical Journal, vol. 54, No. 2, Feb. 1975, Rabiner et al, An Algorithm for Determining the Endpoints of Isolated Utterances, pp. 297-315.
IEEE International Conference on Acoustics, Speech and Signal Processing, (1991), J.H. Hansen, Speech Enhancement Employing Adaptive Boundary Detection and Morphological Based Spectral Constraints, pp. 901-904.
DAGM-Symposium, Erlangen, H. Katterfeldt, Sprachbestimmung Mit Polynom Klassifikatoren, pp. 180-184. (In German).
IEEE Transactions on Acoustics, Speech and Signal Processing, vol. ASSP 27, No. 2, Apr. 1979, Steven Boll, Suppression of Acoustic Noise in Speech Using Spectral Subtraction, pp. 113-120.
Proceedings of the IEEE, vol. 63, No. 12, (1975), B. Widrow et al, Adaptive Noise Cancelling: Principles and Applications, pp. 1692-1716.
IEEE Transactions on Acoustics, Speech and Signal Processing, (1986), Rabiner et al, An Introduction to Hidden Markov Models, pp. 4-16.
Aktas Abdulmesih
Zunkler Klaus
Hudspeth David R.
Siemens Aktiengesellschaft
Storm Donald L.
LandOfFree
Method for detecting a signal pause between two patterns which a does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Method for detecting a signal pause between two patterns which a, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Method for detecting a signal pause between two patterns which a will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-2069268