Speech detection system and method

Data processing: speech signal processing – linguistics – language – Speech signal processing – Recognition

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C704S219000

Reexamination Certificate

active

06757651

ABSTRACT:

FIELD OF THE INVENTION
This invention relates generally to user interfaces and, more specifically, to speech detection.
BACKGROUND OF THE INVENTION
In speech detection systems, energy contour of an inputted signal is a major factor when detecting the beginning and ending of speech sequences. This is because the level of the input speech data is often greater than the level of the background noise. An energy contour-based speech detection algorithm (SDA) contains noise evaluation, beginning of speech detection, and end of speech detection.
At the initial second that the system starts, it is assumed that the input signal to a SDA consists only of noise. At this point, the input signal is made equal to the input noise level. If the energy of the current signal rises above the energy of the input noise level, speech is assumed to be included in the current signal. If the energy of the current signal drops a threshold amount below the initial noise level, speech is assumed to not be occurring in the current signal.
The above process works well when the noise stays at a consistent level (i.e., white noise). However, there exist many environments where the noise is not so obliging. For example, if the environment is a vehicle, extraneous noises such as car horns, sirens, passing truck noise, etc. can be included in the input signal to be evaluated by a Speech Recognition Engine (SRE). Absent an appropriate mechanism to adjust for the extraneous noises, the SRE will process the noise as if it were speech, resulting in suboptimal speech recognition. Therefore, there exists a need for better speech detection in a noisy environment.
SUMMARY OF THE INVENTION
The present invention comprises a system, method and computer program product for performing speech detection. The method first receives a sound signal and determines if the energy value of the received sound signal is above a threshold energy value. If the energy level of the received signal is above the threshold energy value, the method determines a predictive signal of the received signal, subtracts the predictive signal from the received signal, and determines if the result of the subtraction indicates the presence of speech. If it is determined that no speech is present, the threshold energy value is set to the energy level of the present received signal. If it is determined that the result of the subtraction indicates the presence of speech, the received signal is sent to a speech recognition engine.
In accordance with further aspects of the invention, the speech recognition engine generates control system commands for controlling one or more system components. The system components are vehicle system components.
As will be readily appreciated from the foregoing summary, the invention provides an improved method for performing preprocessing of sound signals for more efficient use in subsequent speech processing.


REFERENCES:
patent: 4052568 (1977-10-01), Jankowski
patent: 4625083 (1986-11-01), Poikela
patent: 5263181 (1993-11-01), Reed
patent: 5857169 (1999-01-01), Seide
patent: 6064323 (2000-05-01), Ishii et al.
Thomas W. Parsons, Voice and Speech Processing, 1987, McGraw-Hill, Inc., pp. 136-141.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Speech detection system and method does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Speech detection system and method, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Speech detection system and method will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-3337162

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.