Data processing: speech signal processing – linguistics – language – Speech signal processing – Recognition
Reexamination Certificate
2002-10-23
2004-11-09
Abebe, Daniel (Department: 2655)
Data processing: speech signal processing, linguistics, language
Speech signal processing
Recognition
C704S243000
Reexamination Certificate
active
06816834
ABSTRACT:
BACKGROUND INFORMATION
Local telephone companies offer Call Forward on Busy (“CFB”), Call Forward on No Answer (“CFNA”), Call Forwarding (“CF”), Distinctive Ring and other services.
FIG. 1
shows a traditional phone system
1
which may offer the services described above. When a user of the traditional phone system
1
places a call, the system has an Automatic Number Identification (“ANI”) service
10
that identifies the number from which the call has been placed. Similarly, the traditional phone system
1
has a Dialed Number Identification Service (“DNIS”) service
20
which identifies the number that the caller dialed. This information is received by the local phone company
30
and the call is directed to the receiving phone which is termed a Plain Old Telephone Service (“POTS”) device
40
.
SUMMARY OF THE INVENTION
A system, comprising an audio shredder receiving an audio segment, the audio segment being a portion of an audio stream, the audio shredder creating an audio shred from the audio segment, an audio mixer receiving the audio shred and randomizing the audio shred with other audio shreds from other audio streams and a plurality of transcribers, wherein one of the transcribers receives the audio shred and transcribes the audio shred into text.
In addition, a method, comprising the steps of receiving an audio stream, filtering the audio stream to separate identifiable words in the audio stream from unidentifiable words, creating a word text file for the identifiable words and storing the word text file in a database, the word text file including word indexing information. Creating audio segments from the audio stream, the audio segments including portions of the audio stream having unidentifiable words, creating audio shreds from the audio segments, the audio shreds including audio shred indexing information to identify each of the audio shreds and storing the audio shred indexing information in the database. Mixing the audio shreds with other audio shreds from other audio streams, delivering the audio shreds to a plurality of transcribers, transcribing each of the audio shreds into a corresponding audio shred text file, the audio shred text file including the audio shred indexing information corresponding to the audio shred from which the audio shred text file was created and reassembling the audio shred text files and the word text files into a conversation text file corresponding to the audio stream.
Furthermore, a system, comprising a service platform for receiving, processing and directing streaming audio and a user device connected to the service platform and configured to receive streaming audio from the service platform and transmit streaming audio to the service platform, the user device further configured to signal the service platform to begin a transcription of the streaming audio transmitted and received by the user device. The service platform including a filter receiving the streaming audio, identifying words within the streaming audio and creating a word text file corresponding to each of the identified words, the filter further creating audio segments from the streaming audio, the audio segments including portions of the audio stream having unidentifiable words, an audio shredder creating a plurality of audio shreds from each of the audio segments, an audio mixer randomizing the audio shreds with other audio shreds from other streaming audio, wherein the service platform delivers the randomized audio shreds to a plurality of transcribers which transcribe the audio shreds into audio shred text files corresponding to the audio shreds, and a reassembler creating a conversation text file corresponding to the streaming audio from the audio shred text files and the word text files.
A system, comprising an audio stream element including information corresponding to an audio stream, the information including a begin time of the audio stream, an end time of the audio stream, a conversation identification of the audio stream and the audio stream file, a word element including information corresponding to a word identified in the audio stream by a speech recognition filter, the information including an identification of the audio stream from which the word was identified, a begin time of the word, an end time of the word, an audio file of the word and text corresponding to the word, an audio segment element including information corresponding to an audio segment of the audio stream, the audio segment being a portion of the audio stream without identifiable words, the information including the identification of the audio stream from which the audio segment originates, the begin time of the audio segment, the end time of the audio segment and the audio file of the audio segment, an audio shred element including information corresponding to an audio shred of the audio segment, the information including an identification of the audio segment from which the audio shred originates, the begin time of the audio shred, the end time of the audio shred and the audio file of the audio shred and a text token element including information corresponding to a textual representation of the audio shred, the information including an identification of the audio shred from which the textual representation originates and the textual representation. The information included in each of the audio stream element, the word element, the audio segment element, the audio shred element and the text token element is processed to generate a text transcription of the audio stream.
REFERENCES:
patent: 3660616 (1972-05-01), Davidge et al.
patent: 5655058 (1997-08-01), Balasubramanian et al.
patent: 5659662 (1997-08-01), Wilcox et al.
patent: 6076059 (2000-06-01), Glickman et al.
patent: 6243676 (2001-06-01), Witteman
patent: 6636238 (2003-10-01), Amir et al.
Abebe Daniel
Fay Kaplun & Marcin LLP
LandOfFree
System and method for secure real-time high accuracy speech... does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with System and method for secure real-time high accuracy speech..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and System and method for secure real-time high accuracy speech... will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-3359686