Data processing: speech signal processing – linguistics – language – Speech signal processing – Recognition
Reexamination Certificate
2007-11-06
2007-11-06
Azad, Abul K. (Department: 2626)
Data processing: speech signal processing, linguistics, language
Speech signal processing
Recognition
C704S245000
Reexamination Certificate
active
10685565
ABSTRACT:
A system (230) performs speaker adaptation when performing speech recognition. The system (230) receives an audio segment and identifies the audio segment as a first audio segment or a subsequent audio segment associated with a speaker turn. The system (230) then decodes the audio segment to generate a transcription associated with the first audio segment when the audio segment is the first audio segment and estimates a transformation matrix based on the transcription associated with the first audio segment. The system (230) decodes the audio segment using the transformation matrix to generate a transcription associated with the subsequent audio segment when the audio segment is the subsequent audio segment.
REFERENCES:
patent: 4879648 (1989-11-01), Cochran et al.
patent: 4908866 (1990-03-01), Goldwasser et al.
patent: 5317732 (1994-05-01), Gerlach, Jr. et al.
patent: 5404295 (1995-04-01), Katz et al.
patent: 5418716 (1995-05-01), Suematsu
patent: 5544257 (1996-08-01), Bellegarda et al.
patent: 5559875 (1996-09-01), Bieselin et al.
patent: 5572728 (1996-11-01), Tada et al.
patent: 5684924 (1997-11-01), Stanley et al.
patent: 5715367 (1998-02-01), Gillick et al.
patent: 5752021 (1998-05-01), Nakatsuyama et al.
patent: 5757960 (1998-05-01), Murdock et al.
patent: 5768607 (1998-06-01), Drews et al.
patent: 5777614 (1998-07-01), Ando et al.
patent: 5787198 (1998-07-01), Agazzi et al.
patent: 5835667 (1998-11-01), Wactlar et al.
patent: 5862259 (1999-01-01), Bokser et al.
patent: 5875108 (1999-02-01), Hoffberg et al.
patent: 5960447 (1999-09-01), Holt et al.
patent: 5963940 (1999-10-01), Liddy et al.
patent: 5970473 (1999-10-01), Gerszberg et al.
patent: 6006221 (1999-12-01), Liddy et al.
patent: 6024571 (2000-02-01), Renegar
patent: 6029124 (2000-02-01), Gillick et al.
patent: 6029195 (2000-02-01), Herz
patent: 6052657 (2000-04-01), Yamron et al.
patent: 6064963 (2000-05-01), Gainsboro
patent: 6067514 (2000-05-01), Chen
patent: 6067517 (2000-05-01), Bahl et al.
patent: 6088669 (2000-07-01), Mayes
patent: 6112172 (2000-08-01), True et al.
patent: 6151598 (2000-11-01), Shaw et al.
patent: 6161087 (2000-12-01), Wightman et al.
patent: 6169789 (2001-01-01), Rao et al.
patent: 6185531 (2001-02-01), Schwartz et al.
patent: 6219640 (2001-04-01), Basu et al.
patent: 6317716 (2001-11-01), Braida et al.
patent: 6332139 (2001-12-01), Kaneko et al.
patent: 6332147 (2001-12-01), Moran et al.
patent: 6360237 (2002-03-01), Schulz et al.
patent: 6373985 (2002-04-01), Hu et al.
patent: 6381640 (2002-04-01), Powers et al.
patent: 6434520 (2002-08-01), Kanevsky et al.
patent: 6437818 (2002-08-01), Lauwers et al.
patent: 6480826 (2002-11-01), Petrushin
patent: 6602300 (2003-08-01), Ushioda et al.
patent: 6604110 (2003-08-01), Savage et al.
patent: 6647383 (2003-11-01), August et al.
patent: 6654735 (2003-11-01), Eichstaedt et al.
patent: 6708148 (2004-03-01), Gschwendtner et al.
patent: 6714911 (2004-03-01), Waryas et al.
patent: 6718303 (2004-04-01), Tang et al.
patent: 6778958 (2004-08-01), Nishimura et al.
patent: 6792409 (2004-09-01), Wutte
patent: 6847961 (2005-01-01), Lapstun et al.
patent: 6922691 (2005-07-01), Flank
patent: 6931376 (2005-08-01), Lipe et al.
patent: 6961954 (2005-11-01), Maybury et al.
patent: 6973428 (2005-12-01), Boguraev et al.
patent: 6978277 (2005-12-01), Reed et al.
patent: 6999918 (2006-02-01), Ma et al.
patent: 7131117 (2006-10-01), Mills et al.
patent: 7146317 (2006-12-01), Bartosik
patent: 2001/0026377 (2001-10-01), Ikegami
patent: 2001/0051984 (2001-12-01), Fukasawa
patent: 2002/0010575 (2002-01-01), Haase et al.
patent: 2002/0010916 (2002-01-01), Thong et al.
patent: 2002/0059204 (2002-05-01), Harris
patent: 2002/0184373 (2002-12-01), Maes
patent: 2003/0051214 (2003-03-01), Graham et al.
patent: 2003/0093580 (2003-05-01), McGee et al.
patent: 2003/0167163 (2003-09-01), Glover et al.
patent: 2004/0024739 (2004-02-01), Copperman et al.
patent: 2004/0073444 (2004-04-01), Peh et al.
patent: 2005/0060162 (2005-03-01), Mohit et al.
patent: 2006/0129541 (2006-06-01), Morgan et al.
patent: 0664636 (1995-07-01), None
patent: 0935378 (1999-08-01), None
patent: 0715298 (2000-06-01), None
patent: 1079313 (2001-02-01), None
patent: 1103952 (2001-05-01), None
patent: 1176493 (2002-01-01), None
patent: 1 422 692 (2004-05-01), None
patent: 361285570 (1986-12-01), None
patent: WO-99/17235 (1999-04-01), None
patent: WO-00/59223 (2000-10-01), None
patent: WO-02/29612 (2002-04-01), None
patent: WO-02/29614 (2002-04-01), None
Cutting, et al “A Practical Part-of-Speech Tagger,” Proceedings of the 3rd Conference on Applied Natural Language Processing, ACL 1992, pp. 133-140.
Beeferman et al, “Cyberpunc: A Lightweight Punctuation Annotation System for Speech,” Proceedings of the 1999 IEEE Conference on Acoustics, Speech and Signal Processing, ICASSP'98 May 12-15, 1999, 2:689-692.
Shriberg et al, “Can Prosody Aid the Automatic Processing of Multi-Party Meetings? Evidence from Predicting Punctuation, Disfluencies, and Overlapping Speech,” Proceedings of the ISCA Tutorial and Research Workshop on Prosody in Speech Recognition and Understanding, Oct. 2001, pp. 139-140.
Guavain et al, “Transcribing Broadcast News Shows,” IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP'97, 2:21-24, Apr. 1997, p. 715-718.
Waibel et al, “Meeting Browser: Tracking and Summarizing Meetings,” Proceedings of DARPA Broadcast News Workshop, 1998.
Sean Colbath et al.: “Spoken Documents: Creating Searchable Archives from Continuous Audio,”Proceedings of the 33rdHawaii International Conference on System Sciences-2000; pp. 1-9.
Francis Kubala et al.: “Situation Awareness Contexts for Smart Environments,”Inter-Agency Workshop on Research Issues for Smart Environments; Atlanta, GA; 2000; 3 pages.
Daben Liu et al.: “Fast Speaker Change Detection For Broadcast News Transcription And Indexing,”The Proceedings of Eurospeech 1999; Budapest, Hungary; 4 pages.
Daniel M. Bikel et al.: “An Algorithm that Learns What's in a Name,”Machine Learning, 1999; pp. 1-20.
Richard Schwartz et al.: “Accurate Near-Real-Time Recognition of Broadcast News using Multiple-Pass Search Techniques,”1999 Workshop on Automatic Speech Recognition and Understanding, Dec. 12-15, 1999; Keystone, Colorado; 6 pages.
Francis Kubala et al.: “Smart Information Spaces: Managing Personal and Collaborative Histories,”Proceedings of the 1998 DARPA/NIST Smart Spaces Workshop, Jul. 30-31, 1998; 6 pages.
Daben Liu et al.: “Improvements in Spontaneous Speech Recognition,”Proceedings of the DARPA Broadcast News Transcription and Understanding Workshop; Feb. 8-11, 1998, in Lansdowne, Virginia; 5 pages.
Francis Kubala et al.: “The 1997 BBN Byblos System Applied To Broadcast News Transcription,” Cambridge, Massachusetts; 1997; 6 pages.
Hubert Jin et al.: “Automatic Speaker Clustering,”ICFEM, Chantilly, Virginia; Feb. 1997; 4 pages.
Sean Colbath et al.: “OnTAP: Mixed-media Multi-lingual Language Processing,”Human Language Technology Conference, San Diego, CA; 2002; 2 pages.
Andreas Stolcke et al.: “Automatic Linguistic Segmentation Of Conversational Speech,”Proceedings of the International Conference on Spoken Language Processing, vol. 2, Philadelphia 1996; pp. 1005-1008.
Scott S. Chen et al.: “Speaker, Environment and Channel Change Detection and Clustering via the Bayesian Information Criterion,” in DARPA Speech Recognition Workshop, 1998, 6 pages.
Marti A. Hearst: “Multi-Paragraph Segmentation of Expository Text,” in Proceedings of the 2ndAnnual Meeting of the Association for Computational Linguistics, New Mexico State University, Las Cruces, NM, 1994, pp. 1-8.
Beigi et al., “A Distance Measure Between Collections of Distributions and its Applications to Speaker Recognition” IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP'89, May 12-15, 1998, vol.
Azad Abul K.
BBNT Solutions LLC
LandOfFree
Systems and methods for providing online fast speaker... does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Systems and methods for providing online fast speaker..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Systems and methods for providing online fast speaker... will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-3822260