Data processing: presentation processing of document – operator i – Presentation processing of document – Structured document
Reexamination Certificate
2003-07-28
2008-09-23
Hong, Stephen (Department: 2178)
Data processing: presentation processing of document, operator i
Presentation processing of document
Structured document
C715S234000
Reexamination Certificate
active
07428700
ABSTRACT:
Vision-based document segmentation identifies one or more portions of semantic content of a document. The one or more portions are identified by identifying a plurality of visual blocks in the document, and detecting one or more separators between the visual blocks of the plurality of visual blocks. A content structure for the document is constructed based at least in part on the plurality of visual blocks and the one or more separators, and the content structure identifies the one or more portions of semantic content of the document. The content structure obtained using the vision-based document segmentation can optionally be used during document retrieval.
REFERENCES:
patent: 6701015 (2004-03-01), Fujimoto et al.
patent: 6721451 (2004-04-01), Ishitani
patent: 6754885 (2004-06-01), Dardinski et al.
patent: 6798913 (2004-09-01), Toriyama
patent: 6880122 (2005-04-01), Lee et al.
patent: 7003159 (2006-02-01), Yamaai
patent: 7010745 (2006-03-01), Shimada et al.
patent: 7010746 (2006-03-01), Purvis
patent: 2002/0065842 (2002-05-01), Takagi et al.
patent: 2004/0205608 (2004-10-01), Huang
Gu, X., Chen, J., Ma, W.-Y., and Chen, G., “Visual Based Content Understanding towards Web Adaptation,” In Second International Conference on Adaptive Hypermedia and Adaptive Web-based Systems (AH2002), Spain, 2002, 10 pages.
Chen, J., Zhou, B., Shi, J., Zhang, H., and Wu, Q., “Function-Based Object Model Towards Website Adaptation,” in Proceedings of the 10th International World Wide Web Conference, 2001, 10 pages.
Yang, Y. and Zhang, H., “HTML Page Analysis Based on Visual Cues,” in 6th International Conference on Document Analysis and Recognition (ICDAR 2001), Seattle, Washington, USA, 2001, 6 pages.
Tang, Y. Y., Cheriet, M., Liu, J., Said, J. N., and Suen, C. Y., “Document Analysis and Recognition by Computers,” Handbook of Pattern Recognition and Computer Vision, edited by C. H. Chen, L. F. Pau, and P. S. P. Wang, World Scientific Publishing Company, 1999, 37 pages.
Robertson, S. E. and Walker, S., “Okapi/Keenbow at TREC-8,” in The Eighth Text REtrieval Conference (TREC 8), 1999, 11 pages.
Callan, J. P., “Passage-Level Evidence in Document Retrieval,” in Proceedings of the Seventeenth Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Dublin, 1994, pp. 302-310.
Cal et al., “Extracting Content Structure for Web Pages Based on Visual Representation” Proceedings fo 5th Asia-Pacific Web Conference, Lecture Notes on Computer Science, vol. 2642, Apr. 23, 2003 pp. 406-417.
Gu et al., “Visual Based Content Understanding towards Web Adaptation” Proceedings of the Second International Conference on Adaptive Hypermedia and Adaptive Web-Based Systems, Lecture Notes in Computer Science, vol. 2347, May 29, 2002 pp. 164-173.
EPO Communication with Search Report dated May 10, 2006, from counterpart EP patent application, European Patent Application No. 04015636.6, copy attached, 2 pages.
Cai Deng
Ma Wei-Ying
Wen Ji-Rong
Yu Shipeng
Hong Stephen
Vaughn Gregory J
LandOfFree
Vision-based document segmentation does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Vision-based document segmentation, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Vision-based document segmentation will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-3967690