Data processing: database and file management or data structures – Database design – Data structure types
Reexamination Certificate
1999-09-03
2001-07-17
Breene, John (Department: 2777)
Data processing: database and file management or data structures
Database design
Data structure types
C707S793000, C707S793000
Reexamination Certificate
active
06263329
ABSTRACT:
FIELD OF THE INVENTION
The present invention relates to the field of computerized information search and retrieval systems. More particularly, this invention relates to a method and apparatus for retrieving documents in languages other than the language of the search query.
BACKGROUND OF THE INVENTION
Due to rapid advances made in electronic storage technology, documents are increasingly being stored on electronic computer devices. Not only are documents being generated in the first instance in computer readable form, but documents that have heretofore existed only on paper are now being scanned for the purpose of taking advantage of the many benefits that electronic storage units have to offer. One of the principal advantages associated with electronic storage is that previously printed materials that formerly occupied a tremendous amount of space can now be stored in much less space. Also, electronic databases can be searched from locations around the world. This means that information stored in databases from many different parts of the world is widely available.
As a result of this worldwide activity, vast computerized databases of documents have been developed. However, many documents that exist in these collections appear in languages that the user of the database is not familiar with. This makes the retrieval of many relevant documents cumbersome if not impossible using conventional computer search techniques. This is because conventional search techniques rely on the ability of the user to create a query that is useful in the database. Since users may not be familiar with the language of particular databases, those databases are not accessible to such users by conventional techniques. As a result, substantial efforts have been directed to developing procedures by which search queries crafted in one language could be used to retrieve relevant documents existing in another language.
Conventional techniques for retrieving foreign language documents simply use a translator or a machine translation system to translate the user's query. These systems attempt to generate a foreign language query that captures the semantic meaning of the query in the language of the user. Since many words or phrases do not translate directly into other languages, the translation system must choose the phrase or phrases as they are used in context in the language of the database that most closely match the semantic meaning of the query. Relying on the translation system to provide this semantic meaning is often a mistake which results in retrieving irrelevant documents. More importantly, this mistake results in not retrieving the most relevant documents. A further disadvantage of machine translation systems is that they are difficult to create and, even when they operate properly, they make mistakes. As a result, they are difficult to use. The problems associated with these retrieval methods highlight the need for the user to be able to retrieve relevant foreign documents without knowledge on the part of either the user or the retrieval system of the semantic meaning of the query in a foreign language.
OBJECTS OF THE INVENTION
It is an object of the present invention to provide a method of retrieving documents from a database.
It is another object of the present invention to provide a method for retrieving relevant documents from a database in which documents are stored in a foreign language.
It is still another object of the present invention to provide a method for retrieving relevant documents from a foreign language database without relying on a semantic translation of a query.
It is still a further object of the invention to provide a method for retrieving relevant documents that is adaptable to many different foreign languages without relying on a semantic translations of a query.
SUMMARY OF THE INVENTION
The present invention provides a method and apparatus for retrieving documents that are stored in a language other than the language that is used to formulate a search query. This invention decomposes the query into terms and then translates each of the terms into terms of the language of the database. Once the database language terms have been listed, a series of subqueries is formed by creating all the possible combinations of the listed terms. Each subquery is then scored on each of the documents in the target language database. Only those subqueries that return meaningful scores are relevant to the query. Thus, the semantic meaning of the query is determined against the database itself and those documents in the database language that are most relevant to that semantic meaning are returned.
REFERENCES:
patent: 5278980 (1994-01-01), Pedersen et al.
patent: 5301109 (1994-04-01), Landauer et al.
patent: 5497319 (1996-03-01), Chong et al.
patent: 5535120 (1996-07-01), Chong et al.
patent: 5675819 (1997-10-01), Schuetze
patent: 5694523 (1997-12-01), Wical
patent: 5694592 (1997-12-01), Driscoll
patent: 5696962 (1997-12-01), Kupiec
patent: 5724571 (1998-03-01), Woods
patent: 5761631 (1998-06-01), Nasukawa
Breene John
Claritech
Harper Blaney
Jones Day Reavis & Pogue
Lewis Cheryl
LandOfFree
Method and apparatus for cross-linguistic database retrieval does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Method and apparatus for cross-linguistic database retrieval, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Method and apparatus for cross-linguistic database retrieval will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-2528787