Data processing: speech signal processing – linguistics – language – Speech signal processing – Application
Reexamination Certificate
2007-05-22
2007-05-22
Hudspeth, David (Department: 2626)
Data processing: speech signal processing, linguistics, language
Speech signal processing
Application
C455S413000, C379S088010
Reexamination Certificate
active
10194908
ABSTRACT:
A system and method are provided for detecting emotional states using statistics. First, a speech signal is received. At least one acoustic parameter is extracted from the speech signal. Then statistics or features from samples of the voice are calculated from extracted speech parameters. The features serve as inputs to a classifier, which can be a computer program, a device or both. The classifier assigns at least one emotional state from a finite number of possible emotional states to the speech signal. The classifier also estimates the confidence of its decision. Features that are calculated may include a maximum value of a fundamental frequency, a standard deviation of the fundamental frequency, a range of the fundamental frequency, a mean of the fundamental frequency, and a variety of other statistics.
REFERENCES:
patent: 3691652 (1972-09-01), Clynes
patent: 3855416 (1974-12-01), Fuller
patent: 3971034 (1976-07-01), Bell, Jr. et al.
patent: 4093821 (1978-06-01), Williamson
patent: 4142067 (1979-02-01), Williamson
patent: 4216594 (1980-08-01), Farley et al.
patent: 4472833 (1984-09-01), Turrell et al.
patent: 4490840 (1984-12-01), Jones
patent: 4592086 (1986-05-01), Watari et al.
patent: 4602129 (1986-07-01), Matthews et al.
patent: 4696038 (1987-09-01), Doddington et al.
patent: 4931934 (1990-06-01), Snyder
patent: 4996704 (1991-02-01), Brunson
patent: 5163083 (1992-11-01), Dowden et al.
patent: 5410739 (1995-04-01), Hart
patent: 5461697 (1995-10-01), Nishimura et al.
patent: 5495553 (1996-02-01), Jakatdar
patent: 5539861 (1996-07-01), DeSimone
patent: 5647834 (1997-07-01), Ron
patent: 5666400 (1997-09-01), McAllister et al.
patent: 5704007 (1997-12-01), Cecys
patent: 5734794 (1998-03-01), White
patent: 5774591 (1998-06-01), Black et al.
patent: 5774859 (1998-06-01), Houser et al.
patent: 5812977 (1998-09-01), Douglas
patent: 5860064 (1999-01-01), Henton
patent: 5884247 (1999-03-01), Christy
patent: 5893057 (1999-04-01), Fujimoto et al.
patent: 5897616 (1999-04-01), Kanevsky et al.
patent: 5903870 (1999-05-01), Kaufman
patent: 5909665 (1999-06-01), Kato
patent: 5913196 (1999-06-01), Talmor et al.
patent: 5936515 (1999-08-01), Right et al.
patent: 5987415 (1999-11-01), Breese et al.
patent: 6006188 (1999-12-01), Bogdashevsky et al.
patent: 6151571 (2000-11-01), Pertrushin
patent: 6173260 (2001-01-01), Slaney
patent: 6212550 (2001-04-01), Segur
patent: 6638217 (2003-10-01), Liberman
patent: 2004/0002838 (2004-01-01), Oliver et al.
patent: WO 87/02491 (1987-04-01), None
patent: WO 98/03941 (1998-01-01), None
patent: WO 98/10412 (1998-03-01), None
patent: WO 98/15924 (1998-04-01), None
patent: WO 98/23062 (1998-05-01), None
patent: WO 99/22364 (1999-05-01), None
patent: WO 99/31653 (1999-06-01), None
patent: WO 00/62279 (2000-10-01), None
Yamada, T., Hashimoto, H., and Tosa, N. “Pattern Recognition of Emotion with Neural Network”, Industrial Electronics, Control and Instrumentation 1995 vol. 1, pp. 183-187.
Banse, Rainer, et al., “Acoustic Profiles in Vocal Emotion Expression,”Journal of Personality and Social Psychology, 1996, vol. 70, No. 3, pp. 614-636.
Boersma, Paul, “Accurate Short-Term Analysis of the Fundamental Frequency and the Harmonics-to-Noise Ratio of a Sampled Sound,” Institute of Phonetic Sciences, University of Amsterdam, Proceedings 17 (1993), pp. 97-110.
Breiman, Leo, “Bagging Predictors,”Machine Learning, 24, 123-140 (1996).
Cahn, Janet E., “The Generation of Affect in Synthesized Speech,” M.I.T. Media Technology Laboratory (1990).
Darby, M.D., John K.,Speech Evaluation in Psychiatry, Chap. 10, 1981, pp. 189-220.
Elliott, Clark, et al., “Autonomous Agents as Synthetic Characters,”American Association for Artificial Intelligence, 1998, pp. 13-30.
Hansen, Lars Kai, “Neural Network Ensembles,”IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 12, No. 10, Oct. 1990, pp. 993-1001.
Kononenko, Igor, “Estimating Attributes: Analysis and Extensions of RELIEF,” University of Ljubljana, Faculty of Electrical Engineering & Computer Science, 1994, pp. 171-182.
Murray, Iain R., et al., “Toward the simulation of emotion in synthetic speech: A review of the literature on human vocal emotion,”J. Acoust. Soc. Am.93 (2), Feb. 1993, pp. 1097-1108.
Parmanto, Bambang, et al., “Improving Committee Diagnosis with Resampling Techniques,” Department of Information Science, University of Pittsburgh, 1996, pp. 882-888.
Polzin, T., et al., “Detecting Emotions in Speech,” Proceedings of the CMC 1998, http://www.ri.cmu.edu/pubs/pub—2161.html.
Polzin, T., et al., “Pronunciation Variations in Emotional Speech,” ESCA-98, Tutorial and Research Workshop on Modeling Pronunciation Variation for Automatic Speech Recognition, May 1998, http://www.ri.cmu.edu/pubs/pub—2160.html.
Scherer, Klaus R., et al., “Vocal Cues in Emotion Encoding and Decoding,”Motivation and Emotion, vol. 15, No. 2, 1991, pp. 123-148.
Talbot, David, “Prosody,”Technology Review, Jul./Aug. 2002, p. 27.
Tosa, Naoko, “Life-like Communication Agent,” MIC & MUSE, 1996, pp. 1-15.
Gadallah, M.E.; Matar, M.A.; Algezawi, A.F.; “Speech Based Automatic Lie Detection”; Radio Science Conference , 1999. . NRSC '99. Proceedings of the Sixteenth National, Feb. 23-25, 1999; pp. C33/1-C33/8.
Klasmeyer, G.; Acoustics, “The Perceptual Importance of Selected Voice Quality Parameters”; Speech, and Signal Processing, 1997, ICASSP-97., 1997 IEEE International Conference on vol. 3, Apr. 21-24, 1997 pp. 1615-1618 vol. 3.
Hansen, John H.L., “Analysis and Compensation of Speech Under Stress and Noise for Environmental Robustness in Speech Recognition”; Speech Communication, 1996, pp. 151-173.
Bourjot, C., et al., “Phonetic Decoder Assessment,” Eurospeech89,European Conference on Speech Communication and Technology, vol. Two, Sep. 1989, pp. 457-460.
Campbell, Jr., Joseph P., et al., “Government Applications and Operations,”Biometric Consortium; http://www.biometrics.org/REPORTS/CTSTG96.
Chiu, C.C., et al., “The Analysis and Recognition of Human Vocal Emotions,”Proceedings of International Computer Symposium 1994, Dec. 1994, pp. 83-88.
Dallaert, Frank, et al., “Recognizing Emotion in Speech,”1996, ICSLP 96, Proceedings, Fourth International Conference on Spoken Language, vol. 3, pp. 1970-1973.
Hays, Ronald J., “INS Passenger Accelerated Service System (INSPASS),”Biometric Consortium; http://www.biometrics.org/REPORTS/INSASS.html.
Jimenez-Fernandez, Alfonso, et al., “Pattern Recognition in the Vocal Expression of Emotional Categories,”IEEE, Ninth Annual Conference of the Engineering in Medicine and Biology Society, 1987, pp. 2090-2091.
Mayer, David L., et al., “Development of a Speech Analysis Protocol for Accident Investigation,” abstract andProceedings of the 38thAnnual Meeting of the Human Factors and Ergonomic Society, vol. 1, Oct. 24-28, 1994, pp. 124-127.
Moriyama, Tsuyoshi, et al., “Emotion Recognition and Synthesis System on Speech,”IEEE, 1999, pp. 840-844.
Oliver, Gina M., “A Study of the Use of Biometrics as it Relates to Personal Privacy Concerns,” 1999, http:/
ile.ed.umuc.edu/˜jmeinke/inss690/oliver/Oliver-690.html.
Yamada, Toyotoshi, et al., “Pattern recognition of emotion with Neural Network,”IEEE, 1995, pp. 183-187.
Accenture LLP
Hudspeth David
Sked Matthew J.
LandOfFree
Detecting emotions using voice signal analysis does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Detecting emotions using voice signal analysis, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Detecting emotions using voice signal analysis will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-3726343