Data processing: speech signal processing – linguistics – language – Speech signal processing – For storage or transmission
Reexamination Certificate
2006-06-20
2006-06-20
Chawan, Vijay (Department: 2654)
Data processing: speech signal processing, linguistics, language
Speech signal processing
For storage or transmission
C704S200100, C704S203000, C704S003000, C704S500000
Reexamination Certificate
active
07065485
ABSTRACT:
The method and preprocessor enhances the intelligibility of narrowband speech without essentially lengthening the overall time duration of the signal. Both spectral enhancements and variable-rate time-scaling procedures are implemented to improve the salience of initial consonants, particularly the perceptually important formant transitions. Emphasis is transferred from the dominating vowel to the preceding consonant through adaptation of the phoneme timing structure. In a further embodiment, the technique is applied as a preprocessor to a speech coder.
REFERENCES:
patent: 4692941 (1987-09-01), Jacks et al.
patent: 4820059 (1989-04-01), Miller et al.
patent: 4979212 (1990-12-01), Yamada et al.
patent: 5327521 (1994-07-01), Savic et al.
patent: 5553151 (1996-09-01), Goldberg
patent: 5611018 (1997-03-01), Tanaka et al.
patent: 5625749 (1997-04-01), Goldenthal et al.
patent: 5729658 (1998-03-01), Hou et al.
patent: 5752222 (1998-05-01), Nishiguchi et al.
patent: 5774837 (1998-06-01), Yeldener et al.
patent: 5828995 (1998-10-01), Satyamurti et al.
patent: 5864812 (1999-01-01), Kamai et al.
patent: 5903655 (1999-05-01), Salmi et al.
patent: 6026361 (2000-02-01), Hura
patent: 6104822 (2000-08-01), Melanson et al.
patent: 6233550 (2001-05-01), Gersho et al.
patent: 6285979 (2001-09-01), Ginzburg et al.
patent: 6304843 (2001-10-01), Choi et al.
patent: 6413098 (2002-07-01), Tallal et al.
patent: 6563931 (2003-05-01), Soli et al.
patent: 6691082 (2004-02-01), Aguilar et al.
patent: 6745155 (2004-06-01), Andringa et al.
patent: 6850577 (2005-02-01), Li
patent: 2001/0015968 (2001-08-01), Sicher et al.
patent: 2002/0133332 (2002-09-01), Bu et al.
patent: 2003/0093282 (2003-05-01), Goodwin
patent: 2004/0120309 (2004-06-01), Kurittu et al.
Sanneck, H. et al., A new technique for audio packet loss concealment, Nov. 18-22, 1996, GLOBECOM '96, pp.:48-52.
Roelands, Marc et al., Waveform similarity based overlap-add (WSOLA) for time-scale modification of speech: structures and evaluaton, EUROSPEECH'93, 337-340.
Wayman, J.L. et al., High quality speech expansion, compression, and noise filtering using the sola method of time scale modification, Oct. 30-Nov. 1, 1989, Signals, Systems and Computers, Twenty-Third Asilomar Conference,vol. 2, pp.:714-717.
Wong, P.H.W. et al. On improving the intelligibility of synchronized over-lap-and-add (SOLA) at low TSM factor, Dec. 2-4, 1997, TENCON '97. IEEE Region 10 Annual Conference. SITCT, vol. 2, pp.: 487-490.
Covell, M., et al., MACH1: nonuniform time-scale modification of speech Acoustics, Speech, and Signal Processing, May 12-15, 1998, ICASSP '98. Proceedings of the 1998 IEEE International Conference, vol. 1, pp.:349-352.
Erogul, O. et al., Time-scale modification of speech signals for language-learning impaired children, May 20-22, 1998, Biomedical Engineering Days, 1998. Proceedings of the 1998 2nd International Conference, pp.:33-35.
Ross, K.N. et al., A dynamical system model for generating fundamental frequency for speech synthesis, May 1999, Speech and Audio Processing, IEEE Transactions, vol. 7, Issue 3, pp.: 295-309.
Yong, M., et al., Study of voice packet reconstruction methods applied to CELP speech coding, Mar. 23-26, 1992, ICASSP-92, IEEE International Conference, vol. 2, pp.:125-128.
Verhelst, Werner, “Overlap-add Methods for Time-Scaling of Speech,”Speech Communications,30(2000), pp. 207-221.
David Kapilow, et al., “Detection of Non-Stationarity in Speech Signals and Its Application to Time-Scaling.”, 6th European Conference onSpeech Communication and Technology,Sep. 5-9, 1999, Budapest, Hungary, vol. 5, pp. 2307-2310.
Hazan, Valerie, et al., “The Effect of Cue-Enhancement on the Intelligibility of Nonsense Word and Sentence Materials Present in Noise,”Speech Communication,4(1998), pp. 211-226.
Balakrishnan, Uma, et al., “Consonant Recognition for Spectrally Degraded Speech as a Function of Consonant-Vowel Intensity Ratio,”Journal of the Acoustical Society,99(6), Jun. 1996, pp. 3758-3768.
Gordon-Salant, “Recognition of Natural and Time/Intensity Altered CVs by Young and Elderly Subjects with Normal Hearing,”Journal of the Acoustical Society,80(6), Dec. 1986, pp. 1599-1607.
Furui, Sadaoki, “On the Role of Spectral Transition for Speech Perception,”Journal of the Acoustical Society of America,80(4), Oct. 1986, pp. 1016-1025.
Dorman, M.F., et al., “Phonetic Identification by Elderly Normal and Hearing-Impaired Listeners,”Journal of the Acoustical Society of America,77(2), Feb. 1985, pp. 664-670.
Steven, Kenneth N., Phonetic Linguistics, ISBN 0-12268990-9,Academic Press, Inc,1985, pp. 243-255.
Huggins, A.W.F., “Just Noticeable Differences for Segment Duration in Natural Speech,”Journal of the Acoustical Society of America,51(4), 1972, pp. 1270-1278.
Miller, George A., et al., “An Analysis of Perceptual Confusions Among Some English Consonants,”Journal of the Acoustical Society of America,27(2), Mar. 1955, pp. 338-352.
Chong-White Nicola R.
Cox Richard Vandervoort
AT&T Corp
Chawan Vijay
Pierre Myriam
LandOfFree
Enhancing speech intelligibility using variable-rate... does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Enhancing speech intelligibility using variable-rate..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Enhancing speech intelligibility using variable-rate... will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-3636546