Document analysis system and method

Image analysis – Image segmentation – Region labeling

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C382S299000, C382S282000, C358S453000

Reexamination Certificate

active

06674901

ABSTRACT:

TECHNICAL FIELD
The present invention is generally related to document analysis and, more particularly, is related to a document analysis system and method to flexibly control he analysis of a scanned document or other digital representation of a document.
BACKGROUND OF THE INVENTION
More and more documents are generated using word processors and the like and are stored on memory devices such as hard drives, floppy disks, compact disks and other mass storage media. Nonetheless, paper and other similar media will continue to be used far into the future. Consequently, there will continually be a need to scan the substance portrayed on such media so that such information may be manipulated on a computer or other like device.
However, the scanning of paper documents to make the content thereon available in a digital environment may be time consuming and costly. In particular, one problem is that the processing of various regions of scanned documents may take a long time requiring the user to wait for an analysis of a whole document. Oftentimes, a user may only want to access a portion of the text, artwork, or other region data types of the scanned document, rather than the entire document. For example, one may wish to obtain specific paragraphs of text from a document.
However, current users are often forced to wait while scan converter technology analyzes an entire document to determine the specific data types of the various regions which are ultimately applied to processing pipelines such as optical character recognition pipelines, etc.
SUMMARY OF THE INVENTION
The present invention provides a document analysis system and method. In one embodiment, the document analysis system includes a software implementation on a processor circuit, although dedicated logical circuits may be employed as well. The document analysis system includes an interim analyzer configured to perform an interim document analysis to identify a number of interim regions on a document at an initial setting of pixels-per-inch (PPI). The document system also includes a complete analyzer configured to perform a complete analysis on at least one of the interim regions at a second, higher PPI, thereby generating at least one complete region therefrom. The present invention provides significant flexibility to the user with a number of options relative to the analysis of the regions of information of interest in a document, and to limiting the analysis to such preferred regions.
The present invention can also be viewed as providing a method for controlling document region analysis. In this regard, the method can be broadly summarized by the following steps: performing an interim document analysis to identify a number of interim regions on a document at an initial pixels-per-inch (PPI); and, performing a complete analysis on at least one of the interim regions at a second, higher PPI, thereby generating at least one complete region therefrom.
The present invention has numerous advantages, a few of which are delineated hereafter as merely examples. Specifically, the present invention provides the user with a fast display of the various regions of information on a document and allows the user to control further analysis of these regions and identify the type of information contained therein before processing the regions in an appropriate processing pipeline which may use optical character recognition algorithms, etc. The present invention is also simple in design, user friendly, robust, reliable, and efficient in operation, and easily implemented for mass commercial production.


REFERENCES:
patent: 5222154 (1993-06-01), Graham et al.
patent: 5684610 (1997-11-01), Brandestini et al.
patent: 5778092 (1998-07-01), MacLeod et al.
patent: 5838836 (1998-11-01), Omvik
patent: 5862305 (1999-01-01), Girmay et al.
patent: 5987171 (1999-11-01), Wang
patent: 6002496 (1999-12-01), Weng
patent: 6011905 (2000-01-01), Huttenlocher et al.
patent: 6043823 (2000-03-01), Kodiara et al.
patent: 6134565 (2000-10-01), Hommersom et al.
patent: 6151426 (2000-11-01), Lee et al.
patent: 6239882 (2001-05-01), De Mangelaere et al.
patent: 6240205 (2001-05-01), Fan et al.
patent: 6289371 (2001-09-01), Kumpf et al.
patent: 6295388 (2001-09-01), Stokes et al.
patent: 6385351 (2002-05-01), Simske et al.
patent: 6417857 (2002-07-01), Finger et al.
patent: 6453069 (2002-09-01), Matsugu et al.
Pavlidis, et al., “Page Segmentation and Classification,” CVGIP: Graphical Models and Image Processing, vol. 54, No. 6, Nov. 1992, pp. 226-238.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Document analysis system and method does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Document analysis system and method, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Document analysis system and method will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-3204320

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.