Data processing: database and file management or data structures – Database design – Data structure types
Patent
1997-07-25
1999-09-07
Amsbury, Wayne
Data processing: database and file management or data structures
Database design
Data structure types
707 4, 707 6, 707104, 455 4, G06F 1730
Patent
active
059501960
ABSTRACT:
Tables form an important kind of data element in text retrieval. Often, the gist of an entire news article or other exposition can be concisely captured in tabular form. Information other than the key words in a digital document can be exploited to provide the users with more flexible and powerful query capabilities. More specifically, the structural information in a document is exploited to identify tables and their component fields and let the users query based on these fields. Component fields can include table lines, caption lines, row headings, column headings, or other table components. Empirical results have demonstrated that heuristic method based table extraction and component tagging can be performed effectively and efficiently. Moreover, experiments in retrieval using the system of the present invention strongly indicate that such structural decomposition can facilitate better representation of user's information needs and hence more effective retrieval of tables.
REFERENCES:
patent: 5265056 (1993-11-01), Turtle
patent: 5418948 (1995-05-01), Turtle
patent: 5488725 (1996-01-01), Turtle et al.
patent: 5519857 (1996-05-01), Kato et al.
patent: 5523942 (1996-06-01), Tyler et al.
patent: 5742816 (1998-04-01), Barr et al.
patent: 5754939 (1998-05-01), Herz et al.
Callan et al., "The INQUERY Retrieval System," In Proceedings of the Third International Conference on Database and Expert Systems Applications, Valencia, Spain, 1992, Springer-Verlag, pp. 78-83.
Callan et al., "Tipster text Phase 2 Activities: The University of Massachusetts at Amherst," Tipster Text Phase II 24-Month Workshop, University of Massachusetts, Amherst, Massachusetts, May 5-8, 1996, 17 pages.
Fujisawa et al., "Segmentation Methods for Character Recognition: From Segmentation to Document Structure Analysis," Proceedings of the Institute of Electrical and Electronics Engineers, Vol. 80, No. 7, Jul., 1992, pp. 1079-1092.
Nagy et al., "A Prototype Document Image Analysis System for Technical Journals," Computer, Jul., 1992, pp. 10-22.
Ponte et al., "Useg: A Retargetable Word Segmentation Procedure for Information Retrieval," University of Massachusetts Technical Report TR 96-2, presented at Symposium on Document Analysis and Information Retrieval '96 (SDAIR), Las Vegas, Nevada, Apr. 15, 1996, 10 pages.
Pyreddy et al., "TINTIN: A System for Retrieval in Text Tables," presented at Digital Libraries '97 Conference, Philadelphia, PA, Jul. 25, 1997, 12 pages.
Rus et al., "Does Navigation Require More Than One Compass?" Presented at American Association for Artificial Intelligence Fall 1995 Symposium, Boston, MA, Nov. 11, 1995, pp. 1-7.
Wang et al., "Classification of Newspaper Image Blocks Using Texture Analysis," Computer Vision, Graphics, and Image Processing 47, 1989, pp. 327-352.
Croft W. Bruce
Pyreddy Pallavi
Amsbury Wayne
Haven Thu-Thao
Sovereign Hill Software, Inc.
LandOfFree
Systems and methods for retrieving tabular data from textual sou does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Systems and methods for retrieving tabular data from textual sou, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Systems and methods for retrieving tabular data from textual sou will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-1815461