System and method of dynamically generating an electronic...

Data processing: database and file management or data structures – Database design – Data structure types

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C707S793000, C707S793000, C707S793000, C707S793000

Reexamination Certificate

active

06654754

ABSTRACT:

BACKGROUND OF THE INVENTION
1. Field of the Invention
The field of the invention relates to information retrieval systems. More particularly, the field of the inventions relates to generating index information for data objects.
2. Description of the Related Technology
Information retrieval (IR) systems index documents by searching for keywords that are contained within the documents. Typically, the searches are not performed on the documents themselves. Instead, words are extracted from the document and are then indexed in separate data structures optimized for searching.
However, secure documents, such as documents that are protected by digital rights management (DRM) software, present a special problem for IR systems. Traditionally, IR systems rely upon having full access to the contents of the document to prepare the index information for the document. For example, IR systems that index HyperText Markup Language (HTML) documents on the Internet typically open the HTML documents via its Uniform Resource Locator (URL), then download, parse, and index the entire document.
Secure software, however, does not permit this kind of unrestricted access. Access is restricted to those applications that are both authorized and trusted by the secure software. For security concerns, all other applications are prevented from accessing the protected document.
One way to solve this problem is to retrofit all pre-existing IR systems so that they are “rights enabled.” This solution permits IR systems to communicate directly with secure software to obtain the document source. However, this approach makes a number of unrealistic assumptions, including: (i) that it is possible to retrofit legacy IR systems such that they would comply with the secure software's security requirements; (ii) that all secure system providers would be willing or able to make the necessary changes in a timely manner; and (iii) that it is possible to establish the necessary trust relationships between every secure provider, copyright holder, and IR system provider. This approach has attendant flaws and there is a need for a better solution.
Another problem with preparing index information for IR systems is that each IR system has different indexing algorithms for organizing and storing information. IR systems often analyze the header of the electronic document when selecting the index information for the electronic document. The header includes meta-information regarding the content of document. However, not all of the IR systems retrieve the same keywords from the electronic document when selecting the index information. For example, some IR systems remove duplicative words from the metatag information, while others do not. Furthermore, for example, some IR system recognize phrases, while others do not. Accordingly, it is difficult to customize index information that is ideally suited for use with more than one IR system.
Thus, there is a need for a system for providing index information to IR systems. The system should be able to provide information to the IR systems that is almost as usable as the original. Preferably, the system should not require the modification of any legacy IR systems. Furthermore, it should be difficult to reconstruct the original document source (or any reasonable facsimile thereof) from the provided index information. Furthermore, the system should be able to automatically customize the index information regarding an electronic document, on an IR system-by-IR system basis.
SUMMARY OF THE INVENTION
In one embodiment of the invention, a method of generating index information for a data object, the method comprising generating index information for the data object, wherein the index information includes a set of one or more keywords, selecting one or more of the keywords from the index information, identifying one or more words that are associated with the selected keywords, and adding the identified words to the set of keywords, the identified words providing additional keywords for the index information for the data object.
In yet another embodiment of the invention, a method of generating index information for a data object, the method comprising generating index information for the data object, identifying one or more words that are common to a group of data objects that includes the data object, and adding the identified words to the index information.
In yet another embodiment of the invention, a method of generating index information for a data object, the method comprising generating index information for the data object, wherein the index information includes a set of one or more keywords, identifying the roots of selected keywords, substituting the selected keywords with the roots.
In yet another embodiment of the invention, a method of generating index information for a data object, the method comprising generating index information for the data object, wherein the index information includes a set of one or more keywords, classifying one or more of the keywords into one or more classifications, selecting at least one of the classifications, and removing one or more of the keywords that are members of a selected classification of the keywords.
In yet another embodiment of the invention, a method of generating index information for a data object, the method comprising generating index information for a data object, wherein the index information comprises one or more keywords, selecting one or more of the keywords, identifying one or more keywords that are associated with the selected keywords with a semantic network, and adding the identified keywords to the index information.


REFERENCES:
patent: 4817036 (1989-03-01), Millett et al.
patent: 5276616 (1994-01-01), Kuga et al.
patent: 5546578 (1996-08-01), Takada
patent: 5685003 (1997-11-01), Peltonen et al.
patent: 5848410 (1998-12-01), Walls et al.
patent: 6081774 (2000-06-01), De Hita et al.
patent: 6094649 (2000-07-01), Bowen et al.
patent: 6128613 (2000-10-01), Wong et al.
patent: 6295529 (2001-09-01), Corston-Oliver et al.
patent: 0423683 (1991-04-01), None

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

System and method of dynamically generating an electronic... does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with System and method of dynamically generating an electronic..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and System and method of dynamically generating an electronic... will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-3183903

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.