Data processing: speech signal processing – linguistics – language – Linguistics – Natural language
Reexamination Certificate
2007-08-07
2007-08-07
Hudspeth, David (Department: 2626)
Data processing: speech signal processing, linguistics, language
Linguistics
Natural language
C704S010000, C704S240000, C704S250000, C704S257000
Reexamination Certificate
active
11225543
ABSTRACT:
A method and apparatus are provided for adapting a language model to a task-specific domain. Under the method and apparatus, the relative frequency of n-grams in a small training set (i.e. task-specific training data set) and the relative frequency of n-grams in a large training set (i.e. out-of-domain training data set) are used to weight a distribution count of n-grams in the large training set. The weighted distributions are then used to form a modified language model by identifying probabilities for n-grams from the weighted distributions.
REFERENCES:
patent: 6188976 (2001-02-01), Ramaswamy et al.
Office Action (Sep. 2, 2005) and Amendment (Sep. 14, 2005) from Appl. No. 09/945,930, filed Sep. 4, 2001.
R. Iyer, M. Ostendorf, H. Gish, “Using Out-of-Domain Data to Improve In-Domain Language Models,” IEEE Signal Processing letters, vol. 4, No. 8, pp. 221-223 (Aug. 1997).
K. Seymore, R. Rosenfeld, “Large-Scale Detection and Language Model Adaptation” (Jun. 1997).
J. Gao, K.F. Lee, “Distribution-Based Pruning of Backoff Language Models,” In Proceedings of the Annual Meeting of the ACL, Hong Kong, 7 pages (Oct. 3-6, 2000).
“A Language Model Adaption for Fixed Phrases by Emphasizing N-Gram Subsets,” Tomoyoshi et al. (2003).
1992 IEEE, “Adaptive Language Modeling Using Minimum Discriminant Estimation,” Pietra et al.
1994 Academic Press Limited, “A Weighted Average N-Gram Model of Natural Language,” O'Boyle et al.
P. Clarkson and A. Robinson, “Language Model Adaption Using Mixtures and An Exponentially Decaying Cache,” In. Proc. ICASSP-97, pp. 799-802 (1997).
Gao Jianfeng
Li Mingjing
Hudspeth David
Magee Theodore M.
MIcrosoft Corporation
Serrou Abselali
LandOfFree
Method and apparatus for distribution-based language model... does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Method and apparatus for distribution-based language model..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Method and apparatus for distribution-based language model... will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-3899307