Data processing: speech signal processing – linguistics – language – Speech signal processing – Recognition
Patent
1998-01-28
1999-12-21
Hudspeth, David R.
Data processing: speech signal processing, linguistics, language
Speech signal processing
Recognition
704245, G10L 506
Patent
active
060061847
ABSTRACT:
In a speaker recognition system, a tree-structured reference pattern storing unit has first through M-th node stages each of which has nodes that respectively store a reference pattern of inhibiting speakers. The reference pattern of each node of (N-1)-th node stage represents acoustic features in the reference patterns of predetermined ones of the nodes of the N-th node stage. An analysis unit analyzes input speech and converts the input speech into feature vectors. A similarities calculating unit calculates similarities between the feature vectors and the reference patterns of all of the inhibiting speakers. An inhibiting speaker selecting unit sorts the similarities and selects a predetermined number of inhibiting speakers. The similarities calculating unit calculates the similarity of the node of the first node stage and calculates the similarities of ones of the nodes of the N-th node stage which are connected to a predetermined number of nodes of the (N-1)-th node stage, selected in an order based on highest similarities.
REFERENCES:
Aaron E. Rosenberg and S. Parthasarathy, "Speaker Background Models for Connected Digit Password Speaker Verificationm," Proc. IEEE ICASSP 96, vol. 1, p. 81-84, May 1996.
J. M. Colombi, D. W. Ruck, S. K. Rogers, M. Oxley, and T. R. Anderson, "Cohort Selection and Word Grammar Effects for Speaker Recognition," Proc. ICASSP 96, vol. 1, p. 85-88, May 1996.
Kevin R. Farrell, Richard J. Mammone, and Khalel T. Assaleh, "Speaker Recognition Using Neural Networks and Conventional Classifiers," IEEE Trans. Speech and Audio Processing, vol. 2, No. 1, Part II, p. 194-205, Jan. 1994.
Higgins et al., "Speaker Verification Using Randomized Phrase Prompting", Digital Signal Processing, vol. 1:89-106, (1991).
Rosenberg et al., "The Use of Cohort Normalized Scores For Speaker Verification", ICSLP92, pp. 599-602.
Matsui et al., "Speaker Recognition Using Concatenated Phoneme Models", ICSLP92, pp. 603-606, (1992).
Kai-Fulee, "Large-Vocabulary Speaker-Independent Continuous Speech Recogition: The Shinx System", CMU-CS-88-148, pp. 103-108, (1988.4).
Furui, "Digital Speech Processing", pp. 44-47, (1985) (ET p. 64-67).
Kosaka et al., "Tree-Structured Speaker Clustering For Speaker Adaptation", Singakugihou, SP93-110, pp. 49-54, (1993-12).
Sakoe et al., "Recognition of Continuously Spoken Words Based on Time-Normalization", pp. 483-490 (1971).
Rabiner et al., "On the Application of Vector Quantization and Hidden Morkov Models to Speaker-Independent, Isolated Word Recognition", pp. 1075-1105 (1983).
Hattori Hiroaki
Yamada Eiko
Hudspeth David R.
NEC Corporation
Smits Talivaldis Ivars
LandOfFree
Tree structured cohort selection for speaker recognition system does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Tree structured cohort selection for speaker recognition system, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Tree structured cohort selection for speaker recognition system will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-515601