Method and apparatus for logically tagging of document elements

Image analysis – Image segmentation – Distinguishing text from other regions

Patent

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

382209, 382173, 358462, G06K 934

Patent

active

056994532

ABSTRACT:
A system for logically identifying document elements from a document includes an input port for inputting a signal representing the document image, a computer having a document structural model, a document white region extraction system that extracts major white regions separating and within document elements in the input document image, a major white region selecting device and a column string selection device that generate matching column string of document elements that match the extracted major white regions in a column, a column expression comparison device that selects the best matching column string and a logical tagging device that logically tags and then extracts the document elements in the document image using the best matching column string. The method for logically identifying document elements includes providing at least one structural model of a corresponding source document, each structural model including at least one column expression defining relationships between document elements of the source document. Identifying major white regions in the input document image segmenting and defining the document elements of the document image, and assembling a major white region pattern and generating at least one column string that matches the major white region pattern for each column of the input document. Then, determining the column string that most closely matches the column expression, and logically identifying each document element of the document image based on the closest matching column string.

REFERENCES:
patent: 4698779 (1987-10-01), Holden et al.
patent: 4876728 (1989-10-01), Roth
patent: 4887302 (1989-12-01), Urushibata
patent: 5046114 (1991-09-01), Zobel
patent: 5272764 (1993-12-01), Bloomberg et al.
patent: 5335290 (1994-08-01), Cullen et al.
patent: 5335298 (1994-08-01), Hevenor et al.
patent: 5444797 (1995-08-01), Spitz
patent: 5566255 (1996-10-01), Pavlidis
"Page Segmentation by White Streams", T. Pavlidis et al., First International Conference on Document Analysis and Recognition, Sep. 30-Oct. 2, 1991, St. Malo, France.
"Page Segmentation and Classification", T. Pavlidis et al., CVGIP: Graphical Models and Image Processing, vol. 54, No. 6, Nov., pp. 484-496, 1992.
"Image Segmentation by Shape-Directed Covers", Baird et al., 10th Intl. Conference on Pattern Recog., Jun. 16-21, 1990, pp. 820-825.
"A Prototype Document Image Analysis System for Technical Journals", Nagy et al., Computer, Jul., 1992, pp. 10-21.
"Approximate Matching of Regular Expressions", E. Myers et al., Bulletin of Mathematical Biology, vol. 51, No. 1, 1989, pp. 5-37.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Method and apparatus for logically tagging of document elements does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Method and apparatus for logically tagging of document elements , we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Method and apparatus for logically tagging of document elements will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-214875

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.