Computer-implemented methods and systems for modeling and...

Data processing: speech signal processing – linguistics – language – Speech signal processing – For storage or transmission

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C704S223000, C704S200100, C704S208000, C704S220000, C704S203000, C704S201000, C704S216000, C704S219000, C704S217000, C704S209000

Reexamination Certificate

active

07636659

ABSTRACT:
In accordance with the present invention, computer implemented methods and systems are provided for representing and modeling the temporal structure of audio signals. In response to receiving a signal, a time-to-frequency domain transformation on at least a portion of the received signal to generate a frequency domain representation is performed. The time-to-frequency domain transformation converts the signal from a time domain representation to the frequency domain representation. A frequency domain linear prediction (FDLP) is performed on the frequency domain representation to estimate a temporal envelope of the frequency domain representation. Based on the temporal envelope, one or more speech features are generated.

REFERENCES:
patent: 4856068 (1989-08-01), Quatieri, Jr. et al.
patent: 5491771 (1996-02-01), Gupta et al.
patent: 5651090 (1997-07-01), Moriya et al.
patent: 5745873 (1998-04-01), Braida et al.
patent: 5774559 (1998-06-01), Feng
patent: 5787387 (1998-07-01), Aguilar
patent: 5848384 (1998-12-01), Hollier et al.
patent: 6073093 (2000-06-01), Zinser, Jr.
patent: 6091773 (2000-07-01), Sydorenko
patent: 6115684 (2000-09-01), Kawahara et al.
patent: 6182030 (2001-01-01), Hagen et al.
patent: 6311153 (2001-10-01), Nakatoh et al.
patent: 6424939 (2002-07-01), Herre et al.
patent: 6502069 (2002-12-01), Grill et al.
patent: 6680972 (2004-01-01), Liljeryd et al.
patent: 6718301 (2004-04-01), Woods
patent: 6721698 (2004-04-01), Hariharan et al.
patent: 7127389 (2006-10-01), Chazan et al.
patent: 7177803 (2007-02-01), Boillot et al.
patent: 2003/0187635 (2003-10-01), Ramabadran et al.
patent: 2003/0187663 (2003-10-01), Truman et al.
patent: 2004/0054526 (2004-03-01), Chazan et al.
“Musical Sound Signal Analysis/Synthesis: Sinusoidal+Residual and Elementary Waveform Models” Rodet, TFTS, 1997.
Audio coding with warped predictive methods Harma, Helsinki University of Technology, 1998.
Athineos et al., “Frequency-domain linear prediction for temporal features,” Proc. IEEE ASRU Workshop, St. Thomas, US Virgin Islands. Dec. 2003.
Athineos et al., “Sound texture modeling with linear prediction in both time and frequency domains,” Proc. ICASSP, vol. 5, pp. 648-651. 2003.
Athineos et al., “PLP2: Autoregressive modeling of auditory-like 2-D spectro-temporal patterns,” Submitted to SAPA-04, Jeju Island, Korea. Oct. 2004.
Brown, “The acoustic-modeling problem in automatic speech recognition,” Ph.D. dissertation, Computer Science Department, Carnegie Mellon University. 1987.
Cetin et al., “Cross-stream observation dependencies for multi-stream speech recognition,” Eurospeech, Geneva, Switzerland. 2003.
Cole et al., “Spoken letter recognition,” Advances in Neural Information Processing Systems 3, Morgan Kaufmann Publishers, Inc., pp. 385-390. 1990.
Dubnov et al., “Synthesizing sound textures through wavelet tree learning,” IEEE CGA, vol. 22, No. 4, pp. 38-48, Jul./Aug. 2002.
Goodwin, “Residual Modeling in Music Analysis-Synthesis,” Proc. ICASSP, vol. 2, pp. 1005-1008. 1996.
Hermansky et al., “Analysis and synthesis of speech based on spectral transform linear predictive method,” Proc. ICASSP, vol. 8, pp. 777-780. Apr. 1983.
Hermansky, “Should recognizers have ears?,” Speech Communication, vol. 25, pp. 3-27. 1998.
Hermansky et al., “TRAPS—Classifiers of temporal patterns,” Proc. ICSLP, Sydney, Australia. 1998.
Hermansky, “Exploring temporal domain for robustness in speech recognition,” Proc. of 15th International Congress on Acoustics, vol. II, Trondheim, Norway. Jun. 1995.
Hermansky et al., “Temporal patterns (TRAPs) in ASR of noisy speech,” Proc. ICASSP, vol. 1, pp. 289-292. Mar. 1999.
Hermansky et al., “RASTA processing of speech”. IEEE Trans. Speech and Audio Processing. vol. 2, No. 4, pp. 578-589. Oct. 1994.
Hermansky et al., “Tandem Connectionist feature extraction for conventional hmm systems,” Proc. ICASSP, Istanbul, Turkey. 2000.
Hermansky, “Perceptual linear predictive (PLP) analysis of speech,” J. Acoust. Soc. Am. vol. 87(4). Apr. 1990.
Herre et al., “Enhancing the performance of Perceptual Audio Coders by using Temporal Noise Shaping (TNS),” Proc. 101st AES Conv. Nov. 1996.
Jain et al., “Beyond a single critical-band in TRAP based ASR,” Proc. Eurospeech, Geneva, Switzerland. Nov. 2003.
Kay, “Modern Spectral Estimation: Theory & Application,” Prentice-Hall. 1988.
Klein et al., “Robust spectro-temporal reverse correlation for the auditory system: Optimizing stimulus design,” J. Comput. Neurosci, vol. 9. 2000.
Koenig et al., “The sound spectrograph,” J. Acoust. Soc. Am., vol. 18, No. 1, pp. 19-49. 1946.
Makino et al., “Recognition of consonant based on the perception model,” Proc. ICASSP, Boston, MA. 1983.
Markel et al., “Linear Prediction of Speech,” Springer-Verlag. 1976.
Rabiner et al. “Digital processing of speech signals,” Prentice Hall. 1978.
Saint-Arnaud et al., “Analysis and Synthesis of Sound Textures,” Computational Auditory Scene Analysis, D.F. Rosenthal and H.G. Okuno Eds., pp. 293-308, LEA. 1997.
Schwarz et al., “Recognition of phoneme strings using TRAP technique,” Proc. Eurospeech, Geneva, Switzerland. Sep. 2003.
Serra, “Musical sound Modeling with Sinusoids plus noise,” Musical Signal Processing, G. De Poli et al., Ed. Swets & Zeitlinger. 1997.
Shamma et al., “Ripple analysis in ferret primary auditory cortex: I. Response characteristics of single units to sinusoidally rippled spectra,” Aud. Neurosci, vol. 1. 1995.
Sharma et al., “Feature extraction using non-linear transformation for robust speech recognition on the AURORA data-base,” Proc. ICASSP, Istanbul, Turkey. 2000.
Somervuo et al., “Feature transformations and combinations for improving ASR performance,” Eurospeech, Geneva, Switzerland. 2003.
Terhardt, “On the perception of periodic sound fluctuation (roughness),” Acustica, vol. 30, pp. 201-213. 1974.
Thornburg et al., “A flexible Analysis-Synthesis Method for Transients,” Proc. ICMC. 2000.
Tribolet et al., “Frequency domain coding of speech,” IEEE Trans. ASSP, vol. 27, No. 5, pp. 512-530. Oct. 1979.
Verma et al., “Transient Modeling Synthesis: A flexible analysis/synthesis tool for transient signals,” Proc. ICMC, vol. 2, pp. 164-167. 1997.
Non Final Office Action for U.S. Appl. No. 11/000,874, mailed on Oct. 29, 2007.
Non Final Office Action for U.S. Appl. No. 11/000,874, mailed on Jun. 23, 2008.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Computer-implemented methods and systems for modeling and... does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Computer-implemented methods and systems for modeling and..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Computer-implemented methods and systems for modeling and... will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-4149274

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.