Data processing: speech signal processing – linguistics – language – Linguistics – Multilingual or national language support
Patent
1996-12-20
2000-02-08
Thomas, Joseph
Data processing: speech signal processing, linguistics, language
Linguistics
Multilingual or national language support
704 1, 707536, G06F 1728, G06F 1721
Patent
active
060236701
ABSTRACT:
The language in which a computer document is written is identified. A plurality of words from the document are compared to words in a word list associated with a candidate language. The words in the word list are a selection of the most frequently used words in the candidate language. A count of matches between words in the document and words in the word list for each word in the word list to produce a sample count. The sample count is correlated to a reference count for the candidate language to produce a correlation score for the candidate language. The language of the document is identified based on the correlation score. Generally, there are a plurality of candidate languages. Thus, comparing, accumulating, correlating and identifying processes are practiced for each language. The language of the document is identified as the candidate language having a reference count which generates a highest correlation score.
REFERENCES:
patent: 4610025 (1986-09-01), Blum et al.
patent: 4773009 (1988-09-01), Kucera et al.
patent: 4829580 (1989-05-01), Church
patent: 5062143 (1991-10-01), Schmitt
patent: 5182708 (1993-01-01), Ejiri
patent: 5251131 (1993-10-01), Masand et al.
patent: 5371673 (1994-12-01), Fan
patent: 5371807 (1994-12-01), Register et al.
patent: 5392419 (1995-02-01), Walton
patent: 5418951 (1995-05-01), Damashek
patent: 5548507 (1996-08-01), Martino et al.
patent: 5623609 (1997-04-01), Kaye et al.
Martino Michael John
Paulsen, Jr. Robert Charles
International Business Machines - Corporation
LaBaw Jeffrey S.
Thomas Joseph
LandOfFree
Natural language determination using correlation between common does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Natural language determination using correlation between common , we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Natural language determination using correlation between common will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-1688289