Apparatus and methods for detecting onset of a signal

Data processing: speech signal processing – linguistics – language – Speech signal processing – For storage or transmission

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C704S233000

Reexamination Certificate

active

06240381

ABSTRACT:

BACKGROUND OF THE INVENTION
Apparatus and methods consistent with the present invention relate generally to detecting onset of a signal event, and in particular to apparatus and methods for detecting onset of a voicing event.
To analyze speech accurately, the point in time at which speech starts must be determined. Previous methods use a set time interval during which data is sampled and averaged over hundreds of data points. This can blur and distort time critical factors.
Raw voice data is very random and only some of the information is valuable for recognizing parts of speech. Several prior art techniques attempt to reduce the amount of randomness by processing the data into a more stable form. Typically, this has involved smoothing algorithms, which involve averaging the data. For example, a data point being analyzed is revalued by averaging the data point being smoothed with the two data points on either side of the data point being smoothed. Thus, the average of five data points is used to create the new value. This averaging, however, causes blurring of the data both in amplitude and in time. In many cases, data only exists for a portion of a millisecond. At 8 kHz sampling rate, which is a very typical sampling rate for many speech applications, the data is blurred over a 1.25 millisecond area. Thus, vital data is being destroyed by the very process of making it more useable for the algorithmic methods used to evaluate the data.
Windowing methods are another very common method of analyzing the data. Large window durations of time are often used, on the order of 25 milliseconds. The data is evaluated and averaged, with the average being calculated every 5 milliseconds. This creates a problem, for example, when analyzing information that has a just noticeable difference of one to two milliseconds. A just noticeable difference is a threshold at which a human is able to detect that a stimulus had changed, which occurs in a range of one to two milliseconds. Typically, windowing methods start sampling data at an arbitrary point in time that has no relationship to relevant portions of the data. Because of the arbitrary and random nature of the windowing, there is no way to determine where events of interest occur. An event could be bisected in the middle, thus distorting it even further. Even with smoothing the data is still too random in its motion to be able to detect the sudden onset of a signal in the midst of the randomness of noise.
The very act of arbitrary segmentation also imposes a granularity on the data. For example, if a segment is 128 samples in duration at a 44,100 Hz sampling rate, then the smallest unit of measure possible is 5.8 milliseconds, or twice the sampling rate of 2.9 milliseconds per sample (based on the Nyquist rule of two times oversampling).
Therefore, prior art smoothing techniques blur the data in both amplitude and time. Even with smoothing, the raw data in the prior art is too random to distinguish any significant features against the background of noise.
What is needed is a way to accurately determine event onset time so that signal details surrounding the event can be properly analyzed.
SUMMARY OF THE INVENTION
Systems and methods consistent with the present invention detect voice onset by distinguishing random noise from a repetitive and constant signal. This is accomplished by receiving a signal having a series of data points representing a physical event, forming a smoothed signal by selectively modifying a current data point in the series of data points based on an average of data points previous to the current data point in the series, and analyzing the smoothed signal to determine a rate of signal change indicating onset of an event.
Additional advantages of the invention will be set forth in part in the description which follows, and in part will be obvious from the description, or may be learned by practice of the invention. The objects and advantages of the invention will be realized and attained by means of the elements and combinations particularly pointed out in the appended claims. Both the foregoing general description and the following detailed description are exemplary and explanatory only, and not restrict of the invention, as claimed.


REFERENCES:
patent: 4630305 (1986-12-01), Borth et al.
patent: 4959865 (1990-09-01), Stettiner et al.
patent: 5602959 (1997-02-01), Bergstrom et al.
patent: 5649055 (1997-07-01), Gupta et al.
patent: 5710862 (1998-01-01), Urbanski
patent: 5787388 (1998-07-01), Hayata
patent: 5826230 (1998-10-01), Reaves
patent: 5884257 (1999-03-01), Maekawa et al.
patent: 6061651 (2000-05-01), Nguyen
Malah et al., “Tracking Speech-Presence Uncertainty to Improve Speech Enhancement in Non-Stationary Noise Environments,” 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing, vol. 2, pp. 789-792, Mar. 1999.*
Scalart et al., “Speech Enhancement Based on A Priori Signal to Noise Estimation,” 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing, vol. 2, pp. 629-632, May 1996.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Apparatus and methods for detecting onset of a signal does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Apparatus and methods for detecting onset of a signal, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Apparatus and methods for detecting onset of a signal will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-2494726

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.