Language identification process using coded language words

Boots – shoes – and leggings

Patent

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

36441902, 36441908, G06F 1727

Patent

active

055485073

ABSTRACT:
Provides a process which identifies the language or genre of a stored or transmitted document. The process uses a plurality of Word Frequency Tables (WFTs) respectively associated with languages/genre of interest. Each WFT contains a relatively few of the most common words of one of the languages of interest. Each word code in a WFT has an associated normalized frequency of occurrence value (NFO); use of NFOs increases the language/genre detection ability of the process. A plurality of respective accumulators are associated with the plurality of WFTs. All accumulators are set to zero before identification processing starts. The language/genre identification process receives a sequence of words from an inputted document, and compares each received word to all of the words in all WFTs. Whenever a received word is found in any WFT, the process adds the word's associated NFO to a current total in the associated accumulator. In this manner, totals in all accumulators build up into language discriminating values after a number of words are read from the document. Processing stops when either the end of the document is reached or when a predetermined number of words are received; and then the language/genre associated with the accumulator containing the largest total is the identified language.

REFERENCES:
patent: 4058795 (1977-11-01), Balm
patent: 4610025 (1986-09-01), Blum et al.
patent: 4829580 (1989-05-01), Church
patent: 5062143 (1991-10-01), Schmitt
patent: 5182708 (1993-01-01), Ejiri
patent: 5371807 (1994-12-01), Register et al.
patent: 5392419 (1995-02-01), Walton
patent: 5418951 (1995-05-01), Damashek

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Language identification process using coded language words does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Language identification process using coded language words, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Language identification process using coded language words will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-2335300

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.