Data processing: speech signal processing – linguistics – language – Speech signal processing – Recognition
Reexamination Certificate
2008-06-25
2011-11-22
Vo, Huyen X. (Department: 2626)
Data processing: speech signal processing, linguistics, language
Speech signal processing
Recognition
C704S270100, C704S270000
Reexamination Certificate
active
08065142
ABSTRACT:
A method and system for synchronizing words in an input text of a speech with a continuous recording of the speech. A received input text includes previously recorded content of the speech to be reproduced. A synthetic speech corresponding to the received input text is generated. Ratio data including a ratio between the respective pronunciation times of words included in the received text in the generated synthetic speech is computed. The ratio data is used to determine an association between erroneously recognized words of the received text and a time to reproduce each erroneously recognized word. The association is outputted in a recording medium and/or displayed on a display device.
REFERENCES:
patent: 5535063 (1996-07-01), Lamming
patent: 5598507 (1997-01-01), Kimber et al.
patent: 5606643 (1997-02-01), Balasubramanian et al.
patent: 5649060 (1997-07-01), Ellozy et al.
patent: 5655058 (1997-08-01), Balasubramanian et al.
patent: 5659662 (1997-08-01), Wilcox et al.
patent: 5717869 (1998-02-01), Moran et al.
patent: 5850629 (1998-12-01), Holm et al.
patent: 6076059 (2000-06-01), Glickman et al.
patent: 6263308 (2001-07-01), Heckerman et al.
patent: 6332122 (2001-12-01), Ortega et al.
patent: 6332147 (2001-12-01), Moran et al.
patent: 6434520 (2002-08-01), Kanevsky et al.
patent: 6490553 (2002-12-01), Van Thong et al.
patent: 6505153 (2003-01-01), Van Thong et al.
patent: 6714909 (2004-03-01), Gibbon et al.
patent: 7298930 (2007-11-01), Erol et al.
patent: 2002/0161804 (2002-10-01), Chiu et al.
patent: 2002/0193895 (2002-12-01), Qian et al.
patent: 2006/0100877 (2006-05-01), Zhang et al.
patent: 2006/0294453 (2006-12-01), Hirata
patent: 0495612 (1996-04-01), None
patent: 11-162152 (1999-06-01), None
Bett et al., “Multimodal Meeting Tracker. In: Proceedings of RIAO”, Paris, France (2000).
Jacobson et al., “Linguistic Documents Synchronizing Sound and Text”, Speech Communication 33 (1-2), pp. 79-96 (2001).
Kimber et al., “Speaker Segmentation for Browsing Recorded Audio”, (1995).
Kimber et al., “Acoustic segmentation for Audio Browsers,” in Proc. Interface Conf. Sydney, Australia (Jul. 1996).
Kubala et al., “Rough'n'Ready: A Meeting Recorder and Browser”, ACM Computing Surveys, vol. 31, No. 7, (Sep. 1999) Article No. 7.
Lu et al., “A Robust Audio Classification and Segmentation Method”, (2001).
Roy et al., “Audio Meeting History Tool: Interactive Graphical User-Support for Virtual Audio Meetings. In Proceedings of the ESCA workshop: Accessing information in spoken audio”, (Apr. 1999) Cambridge Unversity pp. 107-110. available from http://svrwww.eng.cam.ac.uk/img id="CUSTOM-CHARACTER-00001" he="3.13mm" wi="2.12mm" file="US08065142-20111122-P00001.TIF" alt="custom character" img-content="character" img-format="tif" ?ajr/esca99/.
Waibel et al., “Advances in Automatic Meeting Record Creation and Access: in Proceedings of ICASSP”, (May 2001).
Imoto Noriko
Uda Tetsuya
Watanabe Takatoshi
Nuance Communications Inc.
Vo Huyen X.
Wolf Greenfield & Sacks P.C.
LandOfFree
Synchronization of an input text of a speech with a... does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Synchronization of an input text of a speech with a..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Synchronization of an input text of a speech with a... will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-4305129