Apparatus and method for extracting syllabic nuclei

Data processing: speech signal processing – linguistics – language – Speech signal processing – For storage or transmission

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C704S200000, C704S205000, C704S206000, C704S207000, C704S208000, C704S211000, C704S214000, C704S231000, C704S233000, C704S236000

Reexamination Certificate

active

07627468

ABSTRACT:
An apparatus enabling automatic determination of a portion that reliably represents a feature of a speech waveform includes: an acoustic/prosodic analysis unit calculating, from data, distribution of an energy of a prescribed frequency range of the speech waveform on a time axis, and for extracting, among various syllables of the speech waveform, a range that is generated stably, based on the distribution and the pitch of the speech waveform; cepstral analysis unit estimating, based on the spectral distribution of the speech waveform on the time axis, a range of the speech waveform of which change is well controlled by a speaker; and a pseudo-syllabic center extracting unit extracting, as a portion of high reliability of the speech waveform, that range which has been estimated to be the stably generated range and of which change is estimated to be well controlled by the speaker.

REFERENCES:
patent: 3649765 (1972-03-01), Rabiner et al.
patent: 4802223 (1989-01-01), Lin et al.
patent: 5479560 (1995-12-01), Mekata
patent: 5577160 (1996-11-01), Hosom et al.
patent: 5596680 (1997-01-01), Chow et al.
patent: 5630015 (1997-05-01), Kane et al.
patent: 5675705 (1997-10-01), Singhal
patent: 5710865 (1998-01-01), Abe
patent: 5732392 (1998-03-01), Mizuno et al.
patent: 5893058 (1999-04-01), Kosaka
patent: 5940794 (1999-08-01), Abe
patent: 6535851 (2003-03-01), Fanty et al.
patent: 7035798 (2006-04-01), Kobayashi
patent: 7043430 (2006-05-01), Chung et al.
patent: 7231346 (2007-06-01), Yamato et al.
patent: 2002/0051955 (2002-05-01), Okutani et al.
patent: 2003/0014245 (2003-01-01), Brandman
patent: 2004/0133424 (2004-07-01), Ealey et al.
patent: 2005/0165604 (2005-07-01), Hanazawa
patent: 2006/0053003 (2006-03-01), Suzuki et al.
patent: 1-244499 (1989-09-01), None
patent: 10-260697 (1998-09-01), None
patent: 2001-306087 (2001-11-01), None
Wayne Lea; Medress, M.; Skinner, T., “A prosodically guided speech understanding strategy,” Acoustics, Speech, and Signal Processing [see also IEEE Transactions on Signal Processing], IEEE Transactions on , vol. 23, No. 1, pp. 30-38, Feb. 1975.
Mercier, G.; Callec, A.; Monne, J.; Querre, M.; Trevarain, O., “Automatic segmentation, recognition of phonetic units and training in the KEAL speech recognition system,” Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP '82. , vol. 7, no., pp. 2000-2003, May 1982.
Schmidbauer, O., “Syllable-based segment-hypotheses generation in fluently spoken speech using gross articulatory features,” Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP '87. , vol. 12, no., pp. 391-394, Apr. 1987.
Ronald W. Schafer and Lawrence R. Rabiner, System for Automatic Formant Analysis of Voiced Speech. J. Acoust. Soc. Am. 47, 634 (1970).
McCandless, S., “An algorithm for automatic formant extraction using linear prediction spectra,” Acoustics, Speech and Signal Processing, IEEE Transactions on , vol. 22, No. 2, pp. 135-141, Apr. 1974.
De Mori, R.; Laface, P.; Piccolo, E., “Automatic detection and description of syllabic features in continuous speech,” Acoustics, Speech and Signal Processing, IEEE Transactions on , vol. 24, No. 5, pp. 365-379, Oct. 1976.
Medress, M.; Diller, T.; Kloker, D.; Lutton, L.; Oredson, H.; Skinner, T., “An automatic word spotting system for conversational speech,” Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP '78. , vol. 3, no., pp. 712-717, Apr. 1978.
. C. Bagshaw, Automatic Prosodic Analysis for Computer Aided Pronunciation Teaching, Ph.D., University of Edinburgh, pp. 1-266, 1994.
Buckow, Jan / Batliner, Anton / Huber, Richard / Nöth, Elmar / Warnke, Volker / Niemann, Heinrich (1998): “Dovetailing of acoustics and prosody in spontaneous speech recognition”, In ICSLP-1998, paper 0336, pp. 1-4.
Mokhtari, Parham., et al. “Automatic Detection of Acoustic Centres of Reliability for Tagging Paralinguistic Information in Expressive Speech.” Proc. 3rd Int. Conf. on Language Resources and Evaluation, 2002, Las Palmas, Canary Islands, Spain pp. 2015-2018.
Mokhtari, Parham., et al. “Perceptual validation of a voice quality parameter AQ automatically measured in acoustic islands of reliability.” Proc. Meeting of the Acoust. Soc. Of Japan, 2002, Kanagawa University, Japan, Paper 1-P-25, pp. 401-402.
M. Fujimoto, et al. “Support System for Speech Segment Extraction” A-14, General Conference of the institute of Electronics, Information and Communication Engineers, Spring 1989, pp. 14.
J. Sundberg “The Science of the Singing Voice” Northern Illinois University Press, Dekalb, Illinois, 1987, pp. 74-89.
P. Alku, et al. “Amplitude domain quotient for characterization of the glottal volume velocity waveform estimated by inverse filtering” Speech Communication, 18(2), pp. 131-138, 1996.
D. Hermes “Measurement of pitch by subharmonic summation”, J. Acoust. Soc. Am. 83 (1), pp. 267-264 , 1988.
P. Mermelstein “Automatic segmentation of speech into syllabic units”, J. Acoust. Soc. Am. 58(4), pp. 880-883, 1975.
W.A. Lea, at al. “Prosodic aids to speech recognition”, in Lea, W.A. (ed.), Trends in Speech Recognition, Prentice-Hall, New Jersey, pp. 166-206, 1972.
D. J. Broad, “Format estimation by linear transformation of the LPC cepstrum”, J. Accust. Soc. Am. 86(5), pp. 2013-2017, 1989.
P. Mokhtari, et al. “Some articulatory correlates of emotion variability in speech: a preliminary study on spoken Japanese vowels”, in Proc. Int. Conf. on Speech Process., Taejon, Korea, 2001, pp. 431-436.
G.E. Peterson, et al. “A physiological theory of phonetics”, J. Speech Hear. Res. 9, 1966, pp. 5-67.
A. Iida, at al. “Acoustic nature and perceptual testing of corpora of emotional speech”, in Proc. 5th Int. Conf. on Spoken Lang. Process., 1998, pp. 1559-1562.
P. Mokhtari, at al. “Automatic Detection of Acoustic Centres of Reliability for Tagging Paralinguistic Information in Expressive Speech”, in Proc. 3rd Int. Conf. on Language Resources and Evaluation, Las Palmas, Canary Islands, Spain, 2002, pp. 2015-2018.
A. Bayya, et al. “Towards feature-based speech metric”, in Proc. IEEE Int. Conf. on Acoust., Speech, and Sig. Process., 1990, pp. 781-784.
J. Högberg “Prediction of format frequencies from linear combinations of filterbank and cepstral coefficients”, KTH-STL-QPSR, Royal Inst. Of Tech. Stockholm, Sweden, 1997, pp. 41-49.
W. A. Lea, et al. “Algorithms for acoustic prosodic analysis”, in Pros, IEEE Int. Conf. on Acoust., Speech, and Sig. Process., 1984, 42.7.1-42.7.4.
P. Mokhtari, at al. “Perceptual validation of a voice quality parameter AQ automatically measured in acoustic islands of reliability”, in Proc. Meeting of the Acoust. Soc. Of Japan, Kanagawa Univ., Japan, Paper 1-P-25, 2002a, pp. 401-402.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Apparatus and method for extracting syllabic nuclei does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Apparatus and method for extracting syllabic nuclei, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Apparatus and method for extracting syllabic nuclei will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-4073106

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.