Vision-based document segmentation

Data processing: presentation processing of document – operator i – Presentation processing of document – Structured document

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C715S234000

Reexamination Certificate

active

07428700

ABSTRACT:
Vision-based document segmentation identifies one or more portions of semantic content of a document. The one or more portions are identified by identifying a plurality of visual blocks in the document, and detecting one or more separators between the visual blocks of the plurality of visual blocks. A content structure for the document is constructed based at least in part on the plurality of visual blocks and the one or more separators, and the content structure identifies the one or more portions of semantic content of the document. The content structure obtained using the vision-based document segmentation can optionally be used during document retrieval.

REFERENCES:
patent: 6701015 (2004-03-01), Fujimoto et al.
patent: 6721451 (2004-04-01), Ishitani
patent: 6754885 (2004-06-01), Dardinski et al.
patent: 6798913 (2004-09-01), Toriyama
patent: 6880122 (2005-04-01), Lee et al.
patent: 7003159 (2006-02-01), Yamaai
patent: 7010745 (2006-03-01), Shimada et al.
patent: 7010746 (2006-03-01), Purvis
patent: 2002/0065842 (2002-05-01), Takagi et al.
patent: 2004/0205608 (2004-10-01), Huang
Gu, X., Chen, J., Ma, W.-Y., and Chen, G., “Visual Based Content Understanding towards Web Adaptation,” In Second International Conference on Adaptive Hypermedia and Adaptive Web-based Systems (AH2002), Spain, 2002, 10 pages.
Chen, J., Zhou, B., Shi, J., Zhang, H., and Wu, Q., “Function-Based Object Model Towards Website Adaptation,” in Proceedings of the 10th International World Wide Web Conference, 2001, 10 pages.
Yang, Y. and Zhang, H., “HTML Page Analysis Based on Visual Cues,” in 6th International Conference on Document Analysis and Recognition (ICDAR 2001), Seattle, Washington, USA, 2001, 6 pages.
Tang, Y. Y., Cheriet, M., Liu, J., Said, J. N., and Suen, C. Y., “Document Analysis and Recognition by Computers,” Handbook of Pattern Recognition and Computer Vision, edited by C. H. Chen, L. F. Pau, and P. S. P. Wang, World Scientific Publishing Company, 1999, 37 pages.
Robertson, S. E. and Walker, S., “Okapi/Keenbow at TREC-8,” in The Eighth Text REtrieval Conference (TREC 8), 1999, 11 pages.
Callan, J. P., “Passage-Level Evidence in Document Retrieval,” in Proceedings of the Seventeenth Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Dublin, 1994, pp. 302-310.
Cal et al., “Extracting Content Structure for Web Pages Based on Visual Representation” Proceedings fo 5th Asia-Pacific Web Conference, Lecture Notes on Computer Science, vol. 2642, Apr. 23, 2003 pp. 406-417.
Gu et al., “Visual Based Content Understanding towards Web Adaptation” Proceedings of the Second International Conference on Adaptive Hypermedia and Adaptive Web-Based Systems, Lecture Notes in Computer Science, vol. 2347, May 29, 2002 pp. 164-173.
EPO Communication with Search Report dated May 10, 2006, from counterpart EP patent application, European Patent Application No. 04015636.6, copy attached, 2 pages.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Vision-based document segmentation does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Vision-based document segmentation, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Vision-based document segmentation will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-3967690

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.