Data processing: speech signal processing – linguistics – language – Linguistics – Natural language
Reexamination Certificate
2006-05-09
2006-05-09
Dorvil, Richemond (Department: 2654)
Data processing: speech signal processing, linguistics, language
Linguistics
Natural language
C704S231000, C704S240000, C704S250000, C704S257000, C704S266000
Reexamination Certificate
active
07043422
ABSTRACT:
A method and apparatus are provided for adapting a language model to a task-specific domain. Under the method and apparatus, the relative frequency of n-grams in a small training set (i.e. task-specific training data set) and the relative frequency of n-grams in a large training set (i.e. out-of-domain training data set) are used to weight a distribution count of n-grams in the large training set. The weighted distributions are then used to form a modified language model by identifying probabilities for n-grams from the weighted distributions.
REFERENCES:
“A Language Model Adaptation Method for fixed Phrases by Emphasizing N-gram Subsets”- Tomoyoshi et al.
1992 IEEE -“Adaptive Language Modeling Using Minimum Discriminant Estimation”—Pietra et al.
1994 Academic Press limited-“A weighted average n-gram model of natural language”—O'Boyle et al.
R. Iyer, M. Ostendorf, H. Gish, “Using Out-of-Domain Data to Improve In-Domain Language Models,” IEEE Signal Processing letters, vol. 4, No. 8, pp. 221-223 (Aug. 1997).
P. Clarkson and A. Robinson, “Language Model Adaptation Using Mixtures and An Exponentially Decaying Cache,” In Proc. ICASSP-97, pp. 799-802 (1997).
K. Seymore, R. Rosenfeld, “Large-Scale Detection and Language Model Adaptation” (Jun. 1997).
J. Gao, K.F. Lee, “Distribution-Based Pruning of Backoff Language Models,” In Proceedings of the Annual Meeting of the ACL, Hong Kong, 7 pages (Oct. 3-6, 2000).
Gao Jianfeng
Li Mingjing
Dorvil Richemond
Magee Theodore M.
Microsoft Corporation
Spooner Lamont
Westman Champlin & Kelly P.A.
LandOfFree
Method and apparatus for distribution-based language model... does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Method and apparatus for distribution-based language model..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Method and apparatus for distribution-based language model... will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-3616551