Data processing: speech signal processing – linguistics – language – Speech signal processing – Synthesis
Patent
1998-08-11
2000-11-21
Hudspeth, David R.
Data processing: speech signal processing, linguistics, language
Speech signal processing
Synthesis
704235, 704239, G10L 1300, G10L 1526
Patent
active
061515760
ABSTRACT:
Methods and apparatus of processing, storing and transmitting an original data stream of digitized speech samples. The method converts a stream of digitized speech samples to a stream of text and associated reliability measures. A mixed-media data stream is created with the stream of text as a text component and selected portions of the digitized stream of speech as a speech component. The selected portions are those whose corresponding reliability measures fall below a threshold. The threshold can be changed to change the amount of storage or bandwidth used by the mixed-media data stream. The mixed-media data stream can be searched and the results can be spoken as synthetic speech derived form the text component or as speech samples taken from the digitized speech component.
REFERENCES:
patent: 5031113 (1991-07-01), Hollerbauer
patent: 5625711 (1997-04-01), Nicholson et al.
patent: 5729637 (1998-03-01), Nicholson et al.
patent: 5799273 (1998-08-01), Mitchell et al.
patent: 6026360 (2000-02-01), Ono
Digital Dictate; Technical Manual and Installation Guide Release 2.4; Interface version for IBM Voice Type and Windows word processors supported by Digital Dictate, Mar. 1995.
Wilcox et al., "Wordspotting for Voice Editing and Audio Indexing," CH'92 Conference Proceedings, May 3-7, 1992, 2 pgs.
Microsoft Corporation, Web Page Describing SAPI Speech SDK, 1 page.
Microsoft Corporation, Voice Dictation API Reference, 15 pages, copyright 1995-1998.
Virage, Inc., Video Cataloger 1.3, 6 pages, copyright 1998.
Raman T. V.
Warnock John E.
Adobe Systems Incorporated
Azad Abul K.
Hudspeth David R.
LandOfFree
Mixing digitized speech and text using reliability indices does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Mixing digitized speech and text using reliability indices, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Mixing digitized speech and text using reliability indices will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-1266508