Multi-language document search and retrieval system

Data processing: speech signal processing – linguistics – language – Linguistics – Natural language

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C715S252000

Reexamination Certificate

active

07369987

ABSTRACT:
A multi-lingual indexing and search system is presented that performs tokenization and stemming in a manner which is independent of whether index entries and search terms appear as words in a dictionary. The system includes a tokenizer that separates a string of text into individual word tokens, and eliminates predetermined types of tokens from further processing. The system also includes a stemmer that reduces words to grammatical stems by removing known word-endings associated with the various languages to be supported. The stemmer removes known word endings from the word tokens without any effort to guarantee that the remaining stem is contained in a dictionary. In an embodiment, the stemmer only removes those word endings which are associated with nouns. The system further includes an indexer that stores the stems in an index.

REFERENCES:
patent: 5276616 (1994-01-01), Kuga et al.
patent: 5301109 (1994-04-01), Landauer et al.
patent: 5369577 (1994-11-01), Kadashevich et al.
patent: 5475587 (1995-12-01), Anick et al.
patent: 5594641 (1997-01-01), Kaplan et al.
patent: 5983171 (1999-11-01), Yokoyama et al.
patent: 6006221 (1999-12-01), Liddy et al.
patent: 6038527 (2000-03-01), Renz
patent: 6101492 (2000-08-01), Jacquemin et al.
Wechsier, Martin et al., “Multi-Language Text Indexing for Internet Retrieval”, Swiss Federal Institute of Techology, 1997, 16 pages.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Multi-language document search and retrieval system does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Multi-language document search and retrieval system, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Multi-language document search and retrieval system will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-2793492

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.