Data processing: speech signal processing – linguistics – language – Linguistics – Natural language
Patent
1997-08-06
2000-06-06
Isen, Forester W.
Data processing: speech signal processing, linguistics, language
Linguistics
Natural language
704257, G06F 1728, G10L 506, G10L 900
Patent
active
060730919
ABSTRACT:
A method of forming a language model for a language having a selected vocabulary of word forms comprises: (a) mapping the word forms into integer vectors in accordance with frequencies of word form occurrence; (b) partitioning the integer vectors into subsets, the subsets respectively having ranges of frequencies of word form occurrence associated therewith, the subsets being arranged in a descending order of frequency ranges; (c) respectively assigning maps to the subsets; (d) filtering a textual corpora using the maps assigned to the subsets in order to generate indexed integers; (e) determining n-gram statistics for the indexed integers; and (f) estimating n-gram language model probabilities from the n-gram statistics to form the language model.
REFERENCES:
patent: 5467425 (1995-11-01), Lau et al.
patent: 5490061 (1996-02-01), Tolin et al.
patent: 5680511 (1997-10-01), Baker et al.
patent: 5828999 (1998-10-01), Bellegarda et al.
patent: 5835888 (1998-11-01), Kanevsky et al.
L. Bahl et al., F. Jelinek, R. Mercer, "A Maximum Likelihood Approach to Continuous Speech Recognition", IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. PAMI-5, No. 2, Mar. 1983, pp. 179-190, IV, Language Modeling on p. 181.
Kanevsky Dimitri
Monkowski Michael Daniel
Sedivy Jan
Edouard Patrick N.
International Business Machines - Corporation
Isen Forester W.
LandOfFree
Apparatus and method for forming a filtered inflected language m does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Apparatus and method for forming a filtered inflected language m, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Apparatus and method for forming a filtered inflected language m will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-2222921