Data processing: speech signal processing – linguistics – language – Speech signal processing – For storage or transmission
Patent
1997-01-28
1999-07-27
Hudspeth, David R.
Data processing: speech signal processing, linguistics, language
Speech signal processing
For storage or transmission
704219, 704233, 704262, G10L 914, G10L 702
Patent
active
059307498
ABSTRACT:
A system for processing a signal representing acoustical information performs a linear predictive coding (LPC) analysis and segments the signal into music, speech and noise components (including channel noise and acoustic artifacts) in accordance with behavior, over time, of the poles describing the signal, resulting from the LPC analysis. Poles exhibiting behavior characteristic of speech, music and channel noise of interest may then be selected while other poles representing random noise or information which is not of interest are suppressed. A "cleaned" signal can then be synthesized, with or without additional pre-processing to further suppress unwanted components of the signal. Additionally or alternatively, tags can be applied to frames or groups of frames of the original signal to control application of decoding procedures or speech recognition algorithms. Alternatively, the synthesized "cleaned" signal may be used as an input to a vector quantizer for training of codebooks and channel assignments for optimal processing of the original signal.
REFERENCES:
patent: 5298674 (1994-03-01), Yun
patent: 5375188 (1994-12-01), Serikawa et al.
patent: 5457769 (1995-10-01), Valley
John D. Hoyt and Harry Wechsler, "Detection of Human Speech in Structured Noise," Proc. IEEE ICASSP 94, vol. II, pp. 237-240, Apr. 1994.
John D. Hoyt and Harry Wechsler, "RBF Models for Detection of Human Speech in Structured Noise", Proc. IEEE Conf. on Neural Networks, pp. 4493-4496, Jun. 1994.
John D. Hoyt and Harry Wechsler, "Detection of Human Speech using Hybrid Recognition Models," Proc. 12th International Conf. on Pattern Recognition, pp. 330-333, Oct. 1994.
Richard O. Duda and Peter E. Hart, Pattern Classification and Scene Analysis, Wiley-Interscience, p. 24, 1973.
John R. Deller, Jr., John G. Proakis, and John H. L. Hansen, Discrete-Time Processing of Speech Signals, Prentice-Hall, pp. 65 and 878, 1987.
Hudspeth David R.
International Business Machines - Corporation
Smits Talivaldis Ivars
Tassinari, Esq. Robert P.
LandOfFree
Monitoring, identification, and selection of audio signal poles does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Monitoring, identification, and selection of audio signal poles , we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Monitoring, identification, and selection of audio signal poles will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-892491