Data processing: speech signal processing – linguistics – language – Speech signal processing – Recognition
Reexamination Certificate
2004-02-18
2009-06-02
Edouard, Patrick N (Department: 2626)
Data processing: speech signal processing, linguistics, language
Speech signal processing
Recognition
C704S001000, C704S009000, C704S010000
Reexamination Certificate
active
07542903
ABSTRACT:
Techniques are provided for determining predictive models of discourse functions based on prosodic features of natural language speech. Inter and intra sentential discourse functions in a training corpus of natural language speech utterances are determined. The discourse functions are clustered. The exemplary prosodic features associated with each type of discourse function are determined. Machine learning, observation and the like are used to determine a subset of prosodic features associated with each type of discourse function useful in predicting the likelihood of each type of discourse function.
REFERENCES:
patent: 5749071 (1998-05-01), Silverman
patent: 5761637 (1998-06-01), Chino
patent: 5890117 (1999-03-01), Silverman
patent: 2005/0086592 (2005-04-01), Polanyi et al.
patent: 2005/0171926 (2005-08-01), Thione
patent: 2005/0182618 (2005-08-01), Azara et al.
patent: 2005/0182625 (2005-08-01), Azara et al.
patent: 2005/0187772 (2005-08-01), Azara et al.
Yang, Li-Chiung. “Visualizing Spoken Discourse: Prosodic Form and Discourse Functions of Interruption,” Sep. 2001, Procs. of the Second SIGdial Workshop on Discourse and Dialogue vol. 16.
Jurafsky, D. et al., “Automatic Detection of Discourse Structure for Speech Recognition and Understanding,” Dec. 1997, IEEE Procs. of 1997 Workshop on Automatic Speech Recognition and Understanding, pp. 88-95.
Shriberg, Elizabeth et al., Can Prosody Aid the Automatic Classification of Dialog Acts in Conversational Speech, Language and Speech 41 (3-4): 439-487. Special Issue on Prosody and Conversation, 1998.
Dahlbäck, N. and Jönsson, A. 1989. Empirical studies of discourse representations for natural language interfaces. In Proceedings of the Fourth Conference on European Chapter of the Association For Computational Linguistics (Manchester, England, Apr. 10-12, 1989). European Chapter Meeting of the ACL. Association for Computational Linguistics, Mo.
Black A.; Taylor, P.: CHATR: a generic speech synthesis system in Proceedings of COLING74, II p. 83-986, Kyoto, 1994.
Haller, S. Fossum, T.: “The Association Between Subject Matter and Discourse Segmentation”, In The Proceedings of the Florida AI Research Symposium, Key West, FL, May 2001.
Long, S. ; Kooper, R.;Abowd, G.; Atkeson, C., “Rapid Prototyping of Mobile Context-Aware Applications: the CyberGuide Case Study”, in the Proceedings of the 2nd ACM International Conferenece on Mobile Computing and Networking (MobiCom'96), pp. 97-107, Nov. 10-12, 1996.
Shriberg., E.; Stolke, A.; Hakkani-Tur, Dilek; Tur, Gokhan, “Prosody-Based Segmentation of Speech Utterances into Sentences and Topics” in Speech Communication, 2000, 32, 1-2, Sept, pp. 127-154.
Stolcke., A.; Schriberg, E.; Bates, R.; Coccaro N.; Jurafsky, D.; Martin, R.; Meteer, M.; Ries, K.; Taylor, P.; Van Ess-Dykema, C., “Dialog Act Modeling for Conversational Speech” in Applying Machine Learning to Discourse Processing. Papers from the 1998 AAAI Spring Symposium, Technical Report SS-98-01 (J. Chu-Carroll et al, eds.) Stanford CA pp. 98-105, AAAI Press, Menlo Park CA. 1998.
Wrende, B.; Schriberg, E., “Spotting ‘HotSpots’ in Meetings: Human Judgments and Prosdic Cues” in Proc. Eurospeech, Geneva , 2003.
Levow, G., “Prosodic Cues to Discourse Segment Boundaries in Human-Computer Dialogue”, in 5th SIGdial Workshop on Discourse and Dialogue Boston, Apr. 30 and May 1, 2004.
Lascarides, A. and Oberlander, J., “Temporal Coherence and Defeasible Knowledge”, Theoretical Linguistics, 19.1, pp. 1-35, Walter de Gruyter, Berlin, New York, 1993.
“CHATR: A Generic Speech Synthesis System”, Dec. 25, 1997, downloaded from http://feast.atr.jp/chatr/manual/index.html Mar. 16, 2006.
“HCRC Project: ID4S Intonation in Dialogue for Speech Recognition”, downloaded from http://www.hcrc.ed.ac.uk/Site/IDS4.html Jun. 2, 2004.
Nuance Say Anything Grammars product description, downloaded from http://cafe.bevocal.com/docs/grammar/sayanything.html Jun. 2, 2004.
Ljolje, A., “The AT&T LVCSR-2001 System”, May 3, 2001, downloaded from ftp://jaguar.ncsl.nist.gov/evaluations/hub5/may01/pres/att—lvcsr.pdf Mar. 16, 2006.
DARPA Communicator Project: Robust Recognition and Dialog Tracking for Interactive Information Access, Mar. 2003, downloaded from http://ssli.ee.washington.edu/projects/communicator.html Mar. 15, 2006.
Ayers, G.M., 1992. “Discourse functions of pitch range in spontaneous and read speech.” Presented at the Linguistic Society of America Annual Meeting.
Brown, G. and Kenworthy, J. 1980 Questions of Intonation, Baltimore, University Park Press, p. 21-122.
Kamp, H. 1981. “A Theory of of Truth and Semantic Representation.” in J.A.G. Groenendijk, T. Jannssen, and M. Stokhof (eds.) Formal Methods in the Study of Language. Amesterdam: Mathematisch Centrum, 277-322.
Ladd, D.R. 1983, “Phonological Features of Intonation Peaks”, Language, 59:721-759.
Ladd, D.R. 1988. “Declination Reset and the Hierarchical Organization of Utterances” Journal of the Acoustical Society of America, 84(2):530-544.
Mariani, J.; Proubeck, P., 1999 “Language Technologies Evaluation in the European Framework”, Proceedings of the DARPA Broadcast News Workshop, Washington: Morgan Kaufman Publishers, pp. 237-242.
Nakatani, C.; Hirschberg, J.; and Grosz, 1995. “Discourse Structure in Spoken Language: Studies on Speech Corpora.” In Working Notes of the AAAI-95 Spring Symposium in Palo Alto, CA in Empirical Methods in Discourse Interpretation. pp. 106-112.
Polanyi, L.; and Sch, R., 1984. “A syntactic approach to discourse semantics.” Proceedings of the 10th International Conference on Computational Linguistics, Stanford, CA 413-419.
Silverman, K.; Beckman, M.; Pierrehumbert, J.; Ostendorf, M.; Wightman, C.; Price, P.; and Hirschberg, J. 1992. “ToBI: A standard scheme for labeling prosody.” In Proceedings of ICSLP. Banff: International Conference on Spoken Language Processing.
Terken, J. 1984. “The distribution of pitch accents in instructions as a function of discourse structure.” Language and Speech, 27:269-289.
Azara Misty
Polanyi Livia
Thione Giovanni L.
Van Den Berg Martin H.
Edouard Patrick N
Fuji 'Xerox Co., Ltd.
Godbold Douglas C
Sughrue & Mion, PLLC
LandOfFree
Systems and methods for determining predictive models of... does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Systems and methods for determining predictive models of..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Systems and methods for determining predictive models of... will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-4147200