Systems and methods for classifying audio into broad phoneme...

Data processing: speech signal processing – linguistics – language – Speech signal processing – Recognition

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C704S206000, C704S208000, C704S210000, C704S246000, C704S249000, C704S256200

Reexamination Certificate

active

07424427

ABSTRACT:
An audio classification system classifies sounds in an audio stream as belonging to one of a relatively small number of classes. The audio classification system includes a signal analysis component [301]and a decoder [302]. The decoder [302]includes a number of models [310-316] for performing the audio classifications. In one implementation, the possible classifications include: vowels, fricatives, narrowband, wideband, coughing, gender, and silence. The classified audio may be used to enhance speech recognition of the audio stream.

REFERENCES:
patent: 5475792 (1995-12-01), Stanford et al.
patent: 5638487 (1997-06-01), Chigier
patent: 5897614 (1999-04-01), McKiel, Jr.
patent: 6208967 (2001-03-01), Pauws et al.
patent: 6243680 (2001-06-01), Gupta et al.
Leung et al, “A Comparitive Study of Signal Representations and Classification Techniques for Speech Recognition” IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP'93 27-40 Apr. 1993, vol. (2), pp. 680-683.
Amit Srivastava et al.: “Sentence Boundary Detection in Arabic Speech,”8thEuropean Conference on Speech Communication and Technology, Sep. 1-4, 2003 in Geneva, Switzerland; 4 pages.
Sreenivasa Sista et al.: “Unsupervised Topic Discovery Applied To Segmentation Of News Transcriptions,”8thEuropean Conference on Speech Communication and Technology, Sep. 1-4, 2003 in Geneva, Switzerland; 4 pages.
Daben Liu et al.: “Online Speaker Clustering,”ICASSP 2003, vol. 1, pp. 572-575, 2003 Hong Kong.
J. Billa et al.: “Audio Indexing Of Arabic Broadcast News,”ICASSP 2002; Orlando, FL; May 13-17, 2002; 4 pages.
Scott Shepard et al.: “Newsroom OnTAP—Real-time alerting from streaming audio,” Dec.-Jan. 2001 HLT Paper; 2 pages.
Heidi Christensen et al.: “Punctuation Annotation using Statistical Prosody Models,”The Proceedings of Eurospeech, Denmark, 2001; 6 pages.
Ji-Hwan Kim et al.: “The Use Of Prosody In A Combined System For Punctuation Generation And Speech Recognition,”The Proceedings of Eurospeech, Denmark, 2001; 4 pages.
Jing Huang et al.: “Maximum Entropy Model For Punctuation Annotation From Speech,”The Proceedings of Eurospeech, Denmark, 2001; pp. 917-920.
Yoshihiko Gotoh et al.: “Sentence Boundary Detection in Broadcast Speech Transcripts,”Proceedings of the International Speech Communication Association Workshop: Automatic Speech Recognition: Challenges for the New Millennium, Paris, Sep. 2000; 8 pages.
John Mekhoul et al.: “Speech and Language Technologies for Audio Indexing and Retrieval,”Proceedings of the IEEE, vol. 88, No. 8, Aug. 2000; pp. 1338-1353.
Francis Kubala et al.: “Integrated Technologies For Indexing Spoken Language,” Communications of the ACM, vol. 43, No. 2, Feb. 2000; pp. 48-56.
Sean Colbath et al.: “Spoken Documents: Creating Searchable Archives from Continuous Audio,”Proceedings of the 33rdHawaii International Conference on System Sciences-2000; pp. 1-9.
Francis Kubala et al.: “Situation Awareness Contexts for Smart Environments,”Inter-Agency Workshop on Research Issues for Smart Environments; Atlanta, GA; 2000; 3 pages.
Daben Liu et al.: “Fast Speaker Change Detection For Broadcast News Transcription And Indexing,”The Proceedings of Eurospeech 1999; Budapest, Hungary; 4 pages.
Daniel M. Bikel et al.: “An Algorithm that Learns What's in a Name,”Machine Learning, 1999; pp, 1-20.
Richard Schwartz et al.: “Accurate Near-Real-Time Recognition of Broadcast News using Multiple-Pass Search Techniques,”1999 Workshop on Automatic Speech Recognition and Understanding, Dec. 12-15, 1999; Keystone, Colorado; 6 pages.
Francis Kubala et al.: “Smart Information Spaces: Managing Personal and Collaborative Histories,”Proceedings of the 1998 DARPA/NIST Smart Spaces Workshop, Jul. 30-31, 1998; 6 pages.
Daben Liu et al.: “Improvements in Spontaneous Speech Recognition,”Proceedings of the DARPA Broadcast News Transcription and Understanding Workshop; Feb. 8-11, 1998 in Lansdowne, Virginia; 5 pages.
Francis Kubala et al.: “The 1997 BBN Byblos System Applied To Broadcast News Transcription,” Cambridge, Massachusetts; 1997; 6 pages.
Hubert Jin et al.: “Automatic Speaker Clustering,”ICFEM, Chantilly, Virginia; Feb. 1997; 4 pages.
Sean Colbath et al.: “OnTAP: Mixed-media Multi-lingual Language Processing,”Human Language Technology Conference, San Diego, CA; 2002; 2 pages.
Andreas Stolcke et al.: “Automatic Linguistic Segmentation Of Conversational Speech,”Proceedings of the International Conference on Spoken Language Processing, vol. 2, Philadelphia 1996; pp. 1005-1008
Scott S. Chen et al.: “Speaker, Environment and Channel Change Detection and Clustering via the Bayesian Information Criterion,” in DARPA Speech Recognition Workshop, 1998, 6 pages.
Marti A. Hearst: “Multi-Paragraph Segmentation of Expository Text,” in Proceedings of the 2ndAnnual Meeting of the Association for Computational Linguistics, New Mexico State University, Las Cruces, NM, 1994, pp. 1-8.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Systems and methods for classifying audio into broad phoneme... does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Systems and methods for classifying audio into broad phoneme..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Systems and methods for classifying audio into broad phoneme... will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-3992813

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.