Archival and retrieval of similar documents

Image analysis – Image transformation or preprocessing – Image storage or retrieval

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C382S306000

Reexamination Certificate

active

06263121

ABSTRACT:

TECHNICAL FIELD
This invention pertains to the archival and retrieval of documents. More specifically, the present invention is a system and method for archiving documents that enables retrieval based on attributes of the archived documents.
BACKGROUND ART
In many archival systems, documents are archived based on particular attributes of the documents. On computers, folders are often used to create a set of documents that fall in a particular category. For example, a document generated on a computer could be archived in a folder bearing the name of the client for whom it was generated with the further attribute of case number stored in the document name. Those attribute values can later used in order to retrieve the document from its storage location by executing a search for a file with a name including the case number or by searching for a folder with the client name.
A problem with archiving based on a particular set of attributes arises, however, when a document or document set that has a particular attribute is desired and the documents are not archived based on the desired attribute, making retrieval difficult and potentially expensive. This problem is exacerbated when a document is not desired for a particular attribute of the document but is instead desired based on the similarity between the attributes of the archived document and the attributes of another document. For example, if a particular form is used during a transaction, a user may wish to retrieve all forms having the same format. Since the documents may not have been archived based on the document format, the user would have to retrieve each document and individually compare it to the desired form.
The problem is even more difficult when both paper and electronic documents are involved. At the present time, electronic documents may be located based on a predefined set of attributes or properties stored by the computer. In Windows 95®, an operating system that runs on IBM-compatible personal computers, electronic documents may be searched based on values such as the date the document was modified, the size of the document, or simple text searches for words in the document. The limitations of this system, however, are that the attributes used in the search can include only those attributes that the computer has stored as part of the document properties. These limitations are evident when a search is performed in order to locate documents similar to another document.
A further problem with standard electronic archival systems is the inability to integrate the attributes of paper documents that have been converted into a digital format and the previously stored electronic documents. When a document is scanned into the computer, the computer can generate a list of associated properties only through either user input, such as an entry form that can be filled in by the user, or by creating artificial properties of the document, such as using the date on which the document was scanned as the date of creation.
What is needed, then, is a system and method for archiving both scanned paper documents and electronic documents based on attribute values located in the documents such that the documents can later be retrieved based on those attributes. What is further needed is a system and method for locating archived documents based on the similarities between the archived document and a paper or electronic document.
DISCLOSURE OF INVENTION
The present invention is a system and method for locating attribute values located in a document while enabling archival and retrieval based on these attribute values. A document (
130
) is either retrieved in, or translated into, a digital format by an input device (
120
). A master list (
150
) contains the attribute values to be searched for in the document (
130
). The master list (
150
) may include a default list of attributes or may categorize the document and provide a list of attributes accordingly. An attribute processor (
160
) locates the attributes and stores the attribute values in an attribute index (
170
). If a search for similar documents is performed, a pointer file (
180
) is created containing document pointers ordered according to the similarity between the attribute values of documents stored in a document set (
140
) and the retrieved or translated document (
130
). Pointers in the pointer file (
180
) provide a means for retrieval of the similar documents.


REFERENCES:
patent: 5926824 (1999-07-01), Hashimoto
patent: 5933548 (1999-08-01), Morisawa
patent: 5963954 (1999-10-01), Burrows
patent: 5987471 (1999-11-01), Bodine et al.
patent: 6041360 (2000-03-01), Himmel et al.
patent: 6049799 (2000-04-01), Mangat et al.
patent: 6061478 (2000-05-01), Kanoh et al.
patent: 6070157 (2000-05-01), Jacobson et al.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Archival and retrieval of similar documents does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Archival and retrieval of similar documents, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Archival and retrieval of similar documents will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-2545732

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.