System and methods for document retrieval using natural...

Data processing: database and file management or data structures – Database design – Data structure types

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C707S793000

Reexamination Certificate

active

06687689

ABSTRACT:

The present application is related to the following commonly-owned U.S. patent application(s), the disclosures of which are hereby incorporated by reference in their entirety, including any incorporations-by-reference, appendices, or attachments thereof, for all purposes:
Ser. No. 09/614,465, filed on <the same day as the present application>, and entitled S
YSTEM AND
M
ETHODS FOR
D
ETERMINING
S
EMANTIC
S
IMILARITY OF
S
ENTENCES;
Ser. No. 60/212484, filed on <the same day as the present application>, and entitled S
YSTEM AND
M
ETHODS FOR
A
CCEPTING
U
SER
I
NPUT IN A
D
ISTRIBUTED
E
NVIRONMENT IN A
S
CALABLE
M
ANNER
; and
Ser. No. 09/614,050, filed on <the same day as the present application>, and entitled S
YSTEM AND
M
ETHODS FOR
F
ACILITATING
M
ANUAL
E
NTRY OF, AND
U
SE OF,
I
DEOGRAPHIC
T
EXT IN A
C
OMPUTER.
COPYRIGHT NOTICE
A portion of the disclosure of this patent document contains material which is subject to copyright protection. The copyright owner has no objection to the facsimile reproduction by anyone of the patent document or the patent disclosure as it appears in the Patent and Trademark Office patent file or records, but otherwise reserves all copyright rights whatsoever.
BACKGROUND OF THE INVENTION
The present invention relates to information retrieval. More especially, the present invention relates to document retrieval using the Chinese language, or like languages. Even more especially, the present invention relates to document retrieval involving remote servers on a communication network, for example, World Wide Web sites on the Internet.
The World Wide Web (the Web) is a mine of information. Unfortunately, it is frequently not easy to find needed information from the Web. The problem is not that the Web does not have the needed information. Rather, the problem is that the Web has too much information that is not needed. Various online search engines attempt to help users find just the information that is most needed by the user, based on queries supplied by user. Most of these search engines require their users to learn and use particular query syntaxes, perhaps syntaxes that require keywords combined by boolean operators. Learning and mastering such syntaxes is inconvenient for the users. More recently, some search engines have begun to allow users to enter English-language queries in the form of natural-language sentences. Nevertheless, there is still much room for improvement.
In particular, although some search engines now allow users to enter queries in the form of natural-language sentences, there is still a need to improve such systems so that they process queries to produce only the most relevant documents. Further, there is a need for systems and methods that allow users to search for documents using natural language sentences that include words of the Chinese language, or similar languages. Still firther, such improved systems and methods should still be efficient and suitable for large-scale, real-time use on the Internet or on other communication networks. The present invention satisfies these and other needs.
SUMMARY OF THE INVENTION
A system and associated methods identify documents relevant to an inputted natural-language user query. According to one aspect of the invention, relevant documents are identified by: selecting a set of keywords from the user query; determining at least one word, not necessarily found in the user query, that is semantically similar to a keyword of the set of keywords; using the set of keywords and the at least one word to determining a subset of word sets from a database of pre-stored word sets, wherein the pre-stored word sets are each preassociated with at least one document; determining a plurality of word sets, from the subset of word sets, that is most semantically similar to the user query; and identifying documents that have been pre-associated with the plurality of word sets as being relevant to the natural-language user query.
According to another aspect of the invention, a system identifies relevant documents. The system includes means for selecting a set of keywords from the user query; means for determining at least one word, not necessarily found in the user query, that is semantically similar to a keyword of the set of keywords; means for using the set of keywords and the at least one word to determining a subset of word sets from a database of pre-stored word sets, wherein the pre-stored word sets are each pre-associated with at least one document; means for determining a plurality of word sets, from the subset of word sets, that is most semantically similar to the user query; and means for identifying documents that have been pre-associated with the plurality of word sets as being relevant to the natural-language user query.


REFERENCES:
patent: 5164900 (1992-11-01), Bernath
patent: 5297039 (1994-03-01), Kanaegami et al.
patent: 5680511 (1997-10-01), Baker et al.
patent: 5764851 (1998-06-01), Pengwu
patent: 5822729 (1998-10-01), Glass
patent: 5835924 (1998-11-01), Maruyama et al.
patent: 5897616 (1999-04-01), Kanevsky et al.
patent: 5907841 (1999-05-01), Sumita et al.
patent: 6006175 (1999-12-01), Holzrichter
patent: 6138116 (2000-10-01), Kitagawa et al.
patent: 6163768 (2000-12-01), Sherwood et al.
patent: 6175828 (2001-01-01), Kuromusha et al.
patent: 6178401 (2001-01-01), Franz et al.
patent: 6182038 (2001-01-01), Balakrishnan et al.
patent: 6219638 (2001-04-01), Padmanabhan et al.
patent: 6233547 (2001-05-01), Denber
patent: 6260008 (2001-07-01), Sanfilippo
patent: 6345271 (2002-02-01), Dempsey et al.
patent: 6397259 (2002-05-01), Lincke et al.
patent: 6446041 (2002-09-01), Reynar et al.
patent: 6446064 (2002-09-01), Livowsky
patent: 6453315 (2002-09-01), Weissman et al.
patent: 6456969 (2002-09-01), Beyerlein
patent: 6502073 (2002-12-01), Guan et al.
patent: 2002/0048350 (2002-04-01), Phillips et al.
patent: 2002/0082829 (2002-06-01), Jiang et al.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

System and methods for document retrieval using natural... does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with System and methods for document retrieval using natural..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and System and methods for document retrieval using natural... will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-3302858

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.