Methods and apparatus for print scraping

Image analysis – Image segmentation

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C382S175000, C382S181000, C382S187000, C382S276000, C382S277000, C707S793000, C707S793000, C707S793000

Reexamination Certificate

active

06546133

ABSTRACT:

BACKGROUND OF THE INVENTION
This invention relates generally to electronic exchange of information and, more particularly, to extracting information from a document provided in electronic form.
Automatically exchanging information with another party via electronic documents is difficult. Typically both parties agree on using a common set of file exchange formats, which requires both parties to implement the necessary software logic to work with the mutually agreed upon exchange formats. However, when one of the participants involves a legacy computer application, it may not be practical to actually modify the application. Information therefore is exchanged using unstructured documents available through existing mechanisms, e.g., standard reporting interfaces and messaging mechanisms. To facilitate such unstructured information exchanges, software packages are commercially available that allow users to interactively work with unstructured electronic documents, define scripts to extract pertinent data from these documents, and facilitate importing the extracted information into a software system. However, these processes tend to be manual and require human knowledge and intervention to handle the arbitrary arrival of unstructured document types.
BRIEF SUMMARY OF THE INVENTION
The present invention, in one aspect, includes systems and processes that automate receiving of unstructured information contained in electronic documents, detecting the document type, determining the corresponding document format, extracting structured information from the source document, and populating an information store with the extracted information. Generally, the electronic documents are pre-characterized and both extraction and mapping/translation details are developed as scripts on a per document type basis. These extraction and mapping/translation scripts are then automatically selected and used to automatically drive the subsequent information extraction processes.
Although print scraping is described herein in the context of financial lending, print scraping can be utilized in many other contexts. Print scraping can be used in connection with extracting information from a legacy report format. More specifically, print scraping is performed using processes that extract meaningful data from flat files from various systems in order to update a database. Since legacy systems vary in format and structure of reports, print scraping is used to parse out the required data for the database. As part of the process, the data is validated for errors and, in the context of financial lending, for example, the necessary business logic is applied for determining the credit availability for a client.


REFERENCES:
patent: 4980855 (1990-12-01), Kojima
patent: 5241674 (1993-08-01), Kuorsawa et al.
patent: 5321395 (1994-06-01), Van Santbrink
patent: 5359673 (1994-10-01), de La Beaujardiere
patent: 5832497 (1998-11-01), Taylor
patent: 5841900 (1998-11-01), Rahgozar et al.
patent: 5878398 (1999-03-01), Tokuda et al.
patent: 5956422 (1999-09-01), Alam
patent: 6009196 (1999-12-01), Mahoney
patent: 6038541 (2000-03-01), Tokuda et al.
patent: 6185604 (2001-02-01), Sekiguchi
patent: 6298357 (2001-10-01), Wexler et al.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Methods and apparatus for print scraping does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Methods and apparatus for print scraping, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Methods and apparatus for print scraping will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-3019845

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.