Systems and methods for retrieving tabular data from textual sou

Data processing: database and file management or data structures – Database design – Data structure types

Patent

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

707 4, 707 6, 707104, 455 4, G06F 1730

Patent

active

059501960

ABSTRACT:
Tables form an important kind of data element in text retrieval. Often, the gist of an entire news article or other exposition can be concisely captured in tabular form. Information other than the key words in a digital document can be exploited to provide the users with more flexible and powerful query capabilities. More specifically, the structural information in a document is exploited to identify tables and their component fields and let the users query based on these fields. Component fields can include table lines, caption lines, row headings, column headings, or other table components. Empirical results have demonstrated that heuristic method based table extraction and component tagging can be performed effectively and efficiently. Moreover, experiments in retrieval using the system of the present invention strongly indicate that such structural decomposition can facilitate better representation of user's information needs and hence more effective retrieval of tables.

REFERENCES:
patent: 5265056 (1993-11-01), Turtle
patent: 5418948 (1995-05-01), Turtle
patent: 5488725 (1996-01-01), Turtle et al.
patent: 5519857 (1996-05-01), Kato et al.
patent: 5523942 (1996-06-01), Tyler et al.
patent: 5742816 (1998-04-01), Barr et al.
patent: 5754939 (1998-05-01), Herz et al.
Callan et al., "The INQUERY Retrieval System," In Proceedings of the Third International Conference on Database and Expert Systems Applications, Valencia, Spain, 1992, Springer-Verlag, pp. 78-83.
Callan et al., "Tipster text Phase 2 Activities: The University of Massachusetts at Amherst," Tipster Text Phase II 24-Month Workshop, University of Massachusetts, Amherst, Massachusetts, May 5-8, 1996, 17 pages.
Fujisawa et al., "Segmentation Methods for Character Recognition: From Segmentation to Document Structure Analysis," Proceedings of the Institute of Electrical and Electronics Engineers, Vol. 80, No. 7, Jul., 1992, pp. 1079-1092.
Nagy et al., "A Prototype Document Image Analysis System for Technical Journals," Computer, Jul., 1992, pp. 10-22.
Ponte et al., "Useg: A Retargetable Word Segmentation Procedure for Information Retrieval," University of Massachusetts Technical Report TR 96-2, presented at Symposium on Document Analysis and Information Retrieval '96 (SDAIR), Las Vegas, Nevada, Apr. 15, 1996, 10 pages.
Pyreddy et al., "TINTIN: A System for Retrieval in Text Tables," presented at Digital Libraries '97 Conference, Philadelphia, PA, Jul. 25, 1997, 12 pages.
Rus et al., "Does Navigation Require More Than One Compass?" Presented at American Association for Artificial Intelligence Fall 1995 Symposium, Boston, MA, Nov. 11, 1995, pp. 1-7.
Wang et al., "Classification of Newspaper Image Blocks Using Texture Analysis," Computer Vision, Graphics, and Image Processing 47, 1989, pp. 327-352.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Systems and methods for retrieving tabular data from textual sou does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Systems and methods for retrieving tabular data from textual sou, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Systems and methods for retrieving tabular data from textual sou will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-1815461

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.