Natural language determination using correlation between common

Data processing: speech signal processing – linguistics – language – Linguistics – Multilingual or national language support

Patent

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

704 1, 707536, G06F 1728, G06F 1721

Patent

active

060236701

ABSTRACT:
The language in which a computer document is written is identified. A plurality of words from the document are compared to words in a word list associated with a candidate language. The words in the word list are a selection of the most frequently used words in the candidate language. A count of matches between words in the document and words in the word list for each word in the word list to produce a sample count. The sample count is correlated to a reference count for the candidate language to produce a correlation score for the candidate language. The language of the document is identified based on the correlation score. Generally, there are a plurality of candidate languages. Thus, comparing, accumulating, correlating and identifying processes are practiced for each language. The language of the document is identified as the candidate language having a reference count which generates a highest correlation score.

REFERENCES:
patent: 4610025 (1986-09-01), Blum et al.
patent: 4773009 (1988-09-01), Kucera et al.
patent: 4829580 (1989-05-01), Church
patent: 5062143 (1991-10-01), Schmitt
patent: 5182708 (1993-01-01), Ejiri
patent: 5251131 (1993-10-01), Masand et al.
patent: 5371673 (1994-12-01), Fan
patent: 5371807 (1994-12-01), Register et al.
patent: 5392419 (1995-02-01), Walton
patent: 5418951 (1995-05-01), Damashek
patent: 5548507 (1996-08-01), Martino et al.
patent: 5623609 (1997-04-01), Kaye et al.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Natural language determination using correlation between common does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Natural language determination using correlation between common , we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Natural language determination using correlation between common will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-1688289

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.