Data processing: speech signal processing – linguistics – language – Speech signal processing – Recognition
Reexamination Certificate
2007-02-13
2007-02-13
Hudspeth, David (Department: 2626)
Data processing: speech signal processing, linguistics, language
Speech signal processing
Recognition
C704S233000, C704S247000
Reexamination Certificate
active
10923157
ABSTRACT:
Method for improving speaker identification by determining usable speech. Degraded speech is preprocessed in a speaker identification (SID) process to produce SID usable and SID unusable segments. Features are extracted and analyzed so as to produce a matrix of optimum classifiers for the detection of SID usable and SID unusable speech segments. Optimum classifiers possess a minimum distance from a speaker model. A decision tree based upon fixed thresholds indicates the presence of a speech feature in a given speech segment. Following preprocessing, degraded speech is measured in one or more time, frequency, cepstral or SID usable/unusable domains. The results of the measurements are multiplied by a weighting factor whose value is proportional to the reliability of the corresponding time, frequency, or cepstral measurements performed. The measurements are fused as information, and usable speech segments are extracted for further processing. Such further processing of co-channel speech may include speaker identification where a segment-by-segment decision is made on each usable speech segment to determine whether they correspond to speaker #1or speaker #2. Further processing of co-channel speech may also include constructing the complete utterance of speaker #1or speaker #2. Speech features such as pitch and formants may be extended back into the unusable segments to form a complete utterance from each speaker.
REFERENCES:
patent: 5271088 (1993-12-01), Bahler
patent: 5355431 (1994-10-01), Kane et al.
patent: 5623539 (1997-04-01), Bassenyemukasa et al.
patent: 6522746 (2003-02-01), Marchok et al.
patent: 6539352 (2003-03-01), Sharma et al.
patent: 2003/0023436 (2003-01-01), Eide
Kizhanatham, “Detection of Cochannel Speech and Usable Speech,” Masters Thesis, Temple University, pp. 1-87, May 2002.
Lovekin et al, “Developing Usable Speech Criteria for Speaker Identification”, ICASSP 2001, pp. 421-424, May 2001.
Shao et al, “Co-channel Speaker Identification Using Usable Speech Extraction Based on Multipitch Tracking,” IEEE International Conference on Acoustics, Speech, and Signal Processing, vol. 2, pp. 205-208, 2003.
Benincasa Daniel S.
Smolenski Brett Y.
Wenndt Stanley J.
Yantorno Robert E.
Hudspeth David
Mancini Joseph A.
The United States of America as represented by the Secretary of
Wozniak James S.
LandOfFree
Method for improving speaker identification by determining... does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Method for improving speaker identification by determining..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Method for improving speaker identification by determining... will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-3889464