Apparatus and method for retrieving data from a document...

Data processing: presentation processing of document – operator i – Presentation processing of document – Layout

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C707S793000, C704S009000, C704S010000

Reexamination Certificate

active

06602300

ABSTRACT:

BACKGROUND OF THE INVENTION
1. Field of the Invention
The present invention relates to an information retrieving apparatus and a method thereof, in particular, to those suitable in the case that the language of an input keyword is different from the language of a database from which data is retrieved.
2. Description of the Related Art
In a conventional information retrieving apparatus, when the language of a keyword that is input by a user (this language is hereinafter referred to as input side language) is different from the language of a database from which data corresponding to the input keyword is retrieved (hereinafter this language is referred to as database side language), data is retrieved through a machine-translating process.
Here, we use the word “keyword” as the user query or the user input to the apparatus.
In other words, the language of the input keyword language is converted into the language of the database. With the converted keyword, data is retrieved from the database. The retrieved results in the database side language are converted into the input side language and then displayed on a monitor.
In an information retrieving apparatus using a conventional machine-translating process, with synonyms expanded from an input keyword, a hit rate is increased. In addition, an apparatus that performs logical operations for expanded keywords so as to retrieve data has been proposed.
Moreover, a ranking retrieving process for ranking retrieved results of an information retrieving apparatus corresponding to match rates of retrieval keywords and retrieved data has been used. In the ranking retrieving process, the retrieved results are ranked with keywords converted into the database side language. The ranked results are converted into the input side language and presented to the user.
Now, assume that by inputting a keyword written in Japanese, data corresponding to the keyword is retrieved from a database described in English. In this case, the input keyword described in Japanese is converted into an equivalent keyword described in English. With the keyword described in English, data is retrieved from the database described in English. The retrieved results described in English are translated into Japanese. Thereafter, the retrieved results described in Japanese are presented to the user. In the ranking retrieving process, the retrieved results described in English are ranked with keywords converted into English. The ranked results are translated into Japanese and then provided to the user.
However, in the information retrieving apparatus using the conventional machine-translating process, when an input keyword is expanded into synonyms and a keyword described in the input side language is translated into the database side language, some variation in meaning may take place. In other words, the nuance of a keyword described in the input side language may be different from the nuance of a keyword described in the database side language. Thus, data that does not directly correlate with a keyword described in the input side language may be retrieved. In such a situation, when the retrieved results described in the database side language are ranked using the keyword translated into the database side language, the nuance of the keyword described in the input side language is not reflected to the ranked results described in the database side language. Consequently, the ranked results may be contrary to the intention of the user.
For example, when data is retrieved from a database described in English with a keyword input in Japanese, the retrieved results are ranked by comparing the keyword converted into English with the retrieved results described in English. Thus, documents containing the keyword converted into English are highly ranked. Unless a keyword is correctly converted from Japanese into English, documents that do not reflect the meaning of the keyword described in Japanese are highly ranked.
SUMMARY OF THE INVENTION
An object of the present invention is to provide an information retrieving apparatus that can output retrieved results corresponding to an input keyword even if the language of the input keyword is different from the language of a database from which data is retrieved.
According to an aspect of the present invention, an information retrieving apparatus comprises an inputting unit for inputting a retrieval request described in a first data format, a generating unit for generating retrieval information described in a second data format based on the retrieval request described in the first data format, a retrieving unit for retrieving data described in the second data format based on the retrieval information described in the second data format, a converting unit for converting the retrieved results from the second data format into the first data format, and an evaluating unit for evaluating the retrieved results translated into the first data format based on the retrieval request described in the first data format.
Thus, even if the data format of the retrieved results is different from the data format of the retrieval request, the data format of the retrieved results can be matched with the data format of the retrieval request. Consequently, the retrieved results can be evaluated without need to convert the data format of the retrieval request. As a result, the retrieved results exactly corresponding to the retrieval request can be obtained free of any variation in meaning caused by a conversion process of the data format of the retrieval request.
According to a further aspect of the present invention, the retrieval information described in the second data format is generated based on the key information (keyword) extracted from the retrieval request in the first data format.
Thus, since the key information is extracted in the first data format, the key information can be extracted free of a variation in meaning caused by a conversion process of data, in comparison with the case that the key information is extracted after the data format is converted into the second data format. Consequently, the key information can be extracted exactly corresponding to a retrieval request.
According to an aspect of the present invention, the retrieval information described in the second data format is generated based on the expanded results in the first data format.
Thus, since the retrieval request is expanded in the first data format, the retrieval request can be expanded free of a variation in meaning caused by the conversion process of data in comparison with the case that the retrieval request is expanded after the data format is converted into the second data format.
According to an aspect of the present invention, the retrieval information described in the second data format is generated based on the results of a logical operation in the first data format.
Thus, since the logical operation of the retrieval request is performed in the first data format, the logical operation can be performed free of a variation in meaning of the conversion process of data in comparison with the case that the logical operation is performed after the data format is converted into the second data format. Consequently, the logical operation can be performed exactly corresponding to the retrieval request.
According to an aspect of the present invention, the retrieved results described in the second data format are converted into the first data format. The retrieved results converted into the first data format are evaluated based on the key information, the expanded results, or the results of the logical operation.
Thus, even if data whose data format is different from the data format of the retrieval request is retrieved, the results retrieved over a wide range can be evaluated without need to convert the data format of the retrieval request. Consequently, the retrieved results can be evaluated exactly corresponding to the retrieval request free of a variation in meaning of a nuance due to the conversion process of the retrieval request.
According to an aspect of the present invention

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Apparatus and method for retrieving data from a document... does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Apparatus and method for retrieving data from a document..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Apparatus and method for retrieving data from a document... will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-3087002

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.