Document page analyzer and method

Image analysis – Image segmentation

Patent

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

382180, 707517, G06K 934

Patent

active

058481842

ABSTRACT:
Apparatus and method are provided which determine the geometric and logical structure of a document page from its image. The document image is partitioned into regions (both text and non-text) which are then organized into related "article" groups in the correct reading order. The invention uses image-based features, text-based features, and assumptions based on knowledge of expected layout, to find the correct reading order of the text blocks on a document page. It can handle complex layouts which have multiple configurations of columns on a page and inset material (such as figures and inset text blocks). The apparatus comprises two main components, a geometric page segmentor and a logical page organizer. The geometric page segmentor partitions a binary image of a document page into fundamental units of text or non-text, and produces a list of rectangular blocks, their locations on the page in points (1/72 inch), and the locations of horizontal rule lines on the page. The logical page organizer receives a list of text region locations, rule line locations, associated ASCII text (as found from an OCR) for the text blocks, and a list of text attributes (such as face style and point size). The logical page organizer groups appropriately the components (both text and non-text) which comprise a document page, sequences them in a correct reading order and establishes the dominance pattern (e.g., find the lead article on a newspaper page).

REFERENCES:
patent: 4730252 (1988-03-01), Bradshaw
patent: 4803651 (1989-02-01), Galkowski
patent: 4928252 (1990-05-01), Gabbe et al.
patent: 4937439 (1990-06-01), Wanninger et al.
patent: 4964030 (1990-10-01), Suzuki et al.
patent: 4965763 (1990-10-01), Zamora
patent: 4974260 (1990-11-01), Rudak
patent: 4980829 (1990-12-01), Okajima et al.
patent: 5001653 (1991-03-01), Buchanan et al.
patent: 5018083 (1991-05-01), Watanabe et al.
patent: 5038392 (1991-08-01), Morris et al.
patent: 5043891 (1991-08-01), Goldstein et al.
patent: 5073953 (1991-12-01), Westdijk
patent: 5164899 (1992-11-01), Sobotka et al.
patent: 5167016 (1992-11-01), Bagley et al.
patent: 5179650 (1993-01-01), Fukui et al.
patent: 5181162 (1993-01-01), Smith et al.
patent: 5185698 (1993-02-01), Hesse et al.
patent: 5185813 (1993-02-01), Tsujimoto
patent: 5191525 (1993-03-01), LeBrun et al.
patent: 5191613 (1993-03-01), Graziano et al.
patent: 5193147 (1993-03-01), Amari et al.
patent: 5222236 (1993-06-01), Potash et al.
patent: 5228121 (1993-07-01), Fontaine et al.
patent: 5335087 (1994-08-01), Cho
patent: 5335290 (1994-08-01), Cullen et al.
patent: 5369716 (1994-11-01), Sangu
patent: 5555362 (1996-09-01), Yamashita et al.
patent: 5613016 (1997-03-01), Saitoh
patent: 5680479 (1997-10-01), Wang et al.
patent: 5701500 (1997-12-01), Ikeo et al.
Data Sources Software Catalog, 1.sup.st Edition 1990--vol. 2, pp. J-515 to J-516 and J-583 to J-588.
S. Tsujimoto and H. Asada, "Understanding Multi-Articled Documents" in 10th International Conference on Pattern Recognition (IEEE, Atlantic City, NJ) 16-21 Jun. 1990, pp. 551-556.
J. A. Pastor and S. L. Taylor, "Recognizing Structured Forms Using Neural Networks" in International Joint Conference on Neural Networks, Seattle, WA 1991.
W. Lam and D. Niyogi, Block Segmentation of Document Images Using the X-Y Tree Approach. Tech. Report TR88-14, Department of Computer Science, State University of New York at Buffalo, Buffalo, NY, Jun. 1988.
S. L. Taylor, M. Lipshutz and C. Weir, "Document Structure Interpretation by Integrating Multiple Knowledge Sources" in Proceedings: Symposium on Document Analysis and Information Retrieval (University of Nevada, Las Vegas, Las Vegas, NV, Mar. 16-18, 1992) pp. 58-76.
S. L. Taylor and R. Fritzson, "Registration and Region Extraction of Data from Forms" in Proceedings: 11th IAPR International Conference on Pattern Recognition, the Hague, the Netherlands, Aug. 30-Sep. 3, 1992 (IEEE Computer Society Press, Los Alamitos, CA, 1992) pp. 173-176.
P. H. Winston "Image Understanding" in Artificial Intelligence, 2nd ed. (Addison-Wesley, Reading, MA, 1984) pp. 335-383.
S. C. Hinds, J. L. Fisher and D. P. D'Amato, "A Document Skew Detection Method Using Run-Length Encoding and the Hough Transform" in 10th International Conference on Pattern Recognition Atlantic City, NJ, 16-21 Jun. 1990 (IEEE Computer Society Press, Los Alamitos, CA, 1990), pp. 464-468.
J. L. Fisher, S. C. Hinds and D. P. D'Amato, "A Rule-Based System for Document Image Segmentation" in 10th International Conference on Pattern Recognition, Atlantic City, NJ, 16-21 Jun. 1990 (IEEE Computer Society Press, Los Alamitos, CA, 1990) pp. 567-572.
K. Y. Wong, R. G. Casey and F. M. Wahl, "Document Analysis System" in IBM Journal of Research and Development, vol. 26, No. 6, Nov. 1982, pp. 647-656.
S. L. Taylor, R. Fritzson and J. A. Pastor "Extraction of Data from Preprinted Forms" in Machine Vision and Applications, vol. 5, No. 3, Summer 1992, pp. 211-222.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Document page analyzer and method does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Document page analyzer and method, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Document page analyzer and method will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-186852

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.