Generator for document with HTML tagged table having data elemen

Data processing: database and file management or data structures – Database design – Data structure types

Patent

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

707509, 707518, 382176, G06F 1700

Patent

active

058931279

ABSTRACT:
Automatic generation of hypertext markup language (HTML) files based on bitmap image data, which faithfully preserves layout information of an original document from which the bitmap data was obtained. Generally, multi-column document layouts result in automatic generation of HTML files that use HTML "table tags" to display each of the different columns. More particularly, a bitmap image is obtained such as by scanning or retrieval of a pre-existing image, and the bitmap image is segmented into blocks. The location of each block is determined, each block is analyzed in preparation for insertion of appropriate data into an HTML file, and layout analysis is performed to identify layout relationships between the blocks based on the relative locations of the blocks in the bitmap image. Based on the layout relationships, a block type is determined for each block, column span and row span data for each block is determined, blocks are re-ordered if needed, and an HTML file is generated in which blocks are tagged as data elements in a row of an HTML "table tag" based on block type and based on column and row span information for the block.

REFERENCES:
patent: 4933984 (1990-06-01), Nakano et al.
patent: 5048107 (1991-09-01), Tachikawa
patent: 5065442 (1991-11-01), Kugai
patent: 5073953 (1991-12-01), Westdijk
patent: 5075895 (1991-12-01), Bessho
patent: 5086346 (1992-02-01), Fujisawa
patent: 5093868 (1992-03-01), Tanaka et al.
patent: 5101439 (1992-03-01), Kiang
patent: 5101448 (1992-03-01), Kawachiya et al.
patent: 5129012 (1992-07-01), Abe
patent: 5172422 (1992-12-01), Tan
patent: 5278918 (1994-01-01), Bernzott et al.
patent: 5278920 (1994-01-01), Bernzott et al.
patent: 5307422 (1994-04-01), Wang
patent: 5313526 (1994-05-01), Cheong
patent: 5335290 (1994-08-01), Cullen et al.
patent: 5351314 (1994-09-01), Vaezi
patent: 5359673 (1994-10-01), De La Beaujardiere
patent: 5430808 (1995-07-01), Baird et al.
patent: 5436983 (1995-07-01), Bernzott et al.
patent: 5465304 (1995-11-01), Cullen et al.
patent: 5486686 (1996-01-01), Zdybel, Jr. et al.
patent: 5530852 (1996-06-01), Meske, Jr. et al.
patent: 5555362 (1996-09-01), Yamashita et al.
patent: 5557722 (1996-09-01), DeRose et al.
patent: 5587902 (1996-12-01), Kugimiya
patent: 5588072 (1996-12-01), Wang
patent: 5594809 (1997-01-01), Kopec et al.
Powell, "A document scanning duo with real character", Window Magazine, v. 7, n. 7, p. 107(2), Jul. 1996.
Nadile, "Caere brings electronic publishing to the Internet", PC Week, v. 13, N. 26, p. 10, Jul. 1, 1996.
Itonori, "Table Structure Recognition based on Textblock Arrangement and Ruled Line Position", Proc. of Second Intl. Conf. on Document Analysis and Recognition, Oct. 20, 1993, pp. 765-768.
Lawrentini et al., "Identifying and Understanding Tabular Material in Compound Documents", 11.sup.th IAPR Intl. Conf. on Pattern Recognition, pp. 405-409, Aug. 30, 1993.
Saitoh, "Document Image Segmentation and Text Area Ordering", Second Intl. Conf. on Document Analysis and Recognition, Oct. 20, 1993, pp. 323-329.
Hirayama, "A Block Segmentation Method for Document Images with Complicated Column Structures", Second Intl. Conf. on Document Analysis and Recognition, Oct. 20, 1993, pp. 739-742.
Okamoto et al., "A Hybrid Page Segmentation Method", Second Intl. Conf. on Document Analysis and Recognition, Oct. 20, 1993, pp. 743-746.
Gann, Roger, "Caere improves its OCR interface", PC User, Aug. 21-Sep. 3, 1996, p. 44.
Gann, Roger, "Accurate OCR for complex pages", PC User, Oct. 2-15, 1996, p. 50.
Park, Young Seak, "A hierarchical Method for block Segmentation and Classification of General Document Images", Systems and Computer in Japan, vol. 24, No. 9, 1993, pp. 84-96.
Tang, Yuan Y., et al., "Automatic Document Processing: A Survey", Pattern Recognition, vol. 29, No. 12, Dec. 1996, pp. 1931-1952.
Isao Masuda, et al., "Approach to Smart Document Reader System", Proc. of IEEE Comp. Society Conf. on Computer Vision and Pattern Recognition, Jun. 1985, pp. 550-557.
Hiroshi Makino, "Representation And Segmentation Of Document Images", Proc. of IEEE Comp. Society Conf. on Computer Vision and Pattern Recognition, Jun. 1983, pp. 291-296.
W. Doster, "A Step Towards Intelligent Document Input To Computers", et al., Proc. of IEEE Comp. Society Conf. on Computer Vision and Pattern Recognition, Jun. 1983, pp. 515-516.
Qin Luo, et al., "A Structure Recognition Method For Japanese Newspapers", Symposium on Document Analysis and Information Retrieval, Mar., 1992 pp. 217-234.
L.A. Fletcher, et al., "A Robust Algorithm For Text String Separation From Mixed Text/Graphics Images", IEEE Transactions On Pattern Analysis and Machine Intelligence, vol. 10, No. 6, Nov., 1988, pp. 910-918.
Osamu Iwaki, et al., "A Segmentation Method Based On Office Document Hierarchical Structure", Proceedings of the 1987 IEEE International Conference on Systems, Man, and Cybernetics, vol. 2, pp. 759-763.
K.Y. Wong, et al., "Document Analysis System", IBM J. Res. Develop., vol. 26, No. 6, Nov., 1982, pp. 647-656.
D.J. Ittner, "Automatic Inference of Textline Orientation", Proccedings, Second Annual Symposium on Document Analysis & Information Retrieval, Apr. 1993, pp. 123-133.
Tsujimoto, et al., "Understanding Multi-articled Documents," 10th Int'l Conf. on Pattern Recognition, IEEE, vol. 1, Jun. 16-21, 1990, pp. 551-556.
J. Fisher, et al., "A Rule-Based System For Document Image Segmentation", IEEE Proceedings of 10th International Conference on Pattern Recognition, 1990, pp. 567-572.
T. Akiyama, et al., "Automated Entry System For Printed Documents", Pattern Recognition, vol. 23, No. 11, pp. 1141-1154, 1990.
Stuart C. Hinds, et al., "A Document Skew Detection Method Using Run-Length Encoding And The Hough Transform", Proc. of 10th Intl. Conf. on Pattern Recognition, vol. 1, pp. 464-468, Jun. 1990.
Philip J. Bones, et al., "Segmentation Of Document Images", SPIE vol. 1258 Image Communications and Workstations (1990), pp. 78-88.
Yamada, et al., "Document Image Processing Based on Enhanced Border Following Algorithm", IEEE Proceedings of the 10th International Conference on Pattern Recognition, vol. 2, Jun. 21, 1990, pp. 231-236.
Mizuno, et al., "Document Recognition System With Layout Structure Generator", NEC Research And development, vol. 32, No. 3, Jul. 1991, pp. 430-437.
Pizano, et al., "A Business Form Recognition System", COMPSAC91 Proceedings, The Fifteenth Annual International Computer Software & Applications Conference, Sep. 13, 1991, pp. 626-632.
"Line Segmentation Method For Documents In European Languages", IBM Technical Disclosure Bulletin, vol. 33, No. 1B, Jun. 1990, pp. 207-210.
Cordi, V. A., "Virtual Memory Hierarchy", IBM Technical Disclosure Bulletin, vol. 21, No. 10, Mar. 1979, pp. 4001-4004.
Inagaki, et al., "Macsym: A Hierarchical Parallel Image Processing System For Event-Driven Pattern Understanding Of Documents", Pattern Recognition, vol. 17, No. 1, 1984, pp. 85-108.
Y. Tang, et al., "Document Analysis And Understanding: A Brief Survey", ICDAR, First International Conference on Document Analysis and Recognition, France, Sep. 30 -Oct. 2, 1991, pp. 17-31.
G. Nagy, et al., "A Prototype Document Image Analysis System For Technical Journals", Computer, Jul. 1992, pp. 10-22.
"Converters to and from HTML ", HTML Converters, Aug. 1, 1996 (9 pages).
Drakos, Nikos, "From text to hypertext: A post-hoc rationalisation of LaTeX2HTML", Computer Networks and ISDN Systems 27, Jan. 1994, pp. 215-224.
Methvin, David, "App Adapts Help to HTML", Windows Magazine, Jul. 1996, p. 114.
"Web Tools: Authoring and Browser Plug-ins", PC Computing Best, Aug. 1996, pp. 134 and 136.
"TextBridge Pro -Getting Started", The Document Company Xerox, Mar. 1996.
"Caere OmniPage Pro for Windows 95", Caere Corporation, Mar. 1996.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Generator for document with HTML tagged table having data elemen does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Generator for document with HTML tagged table having data elemen, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Generator for document with HTML tagged table having data elemen will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-1381758

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.