Automatically extracting data from semi-structured documents

Data processing: database and file management or data structures – Database and file access

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

Reexamination Certificate

active

08041695

ABSTRACT:
This description provides tools and techniques for automatically extracting data from semi-structured documents. A computer-readable storage medium may contain computer-executable instructions that, when executed by a computer, cause the computer to receive a request for data representing an inferred structure of an input document. For the request, the computer may determine whether a repository containing mined information includes the requested data. If the repository contains the requested data, the computer may return the data representing the inferred structure of the input document in response to the request.

REFERENCES:
patent: 7197503 (2007-03-01), Palanisamy et al.
patent: 2003/0140311 (2003-07-01), Lemon et al.
patent: 2004/0205463 (2004-10-01), Darbie
patent: 2005/0108267 (2005-05-01), Gibson et al.
patent: 2009/0030671 (2009-01-01), Kwon et al.
patent: 200975625 (2009-04-01), None
patent: 100814079 (2008-03-01), None
patent: 0020985 (2000-04-01), None
patent: 0203210 (2002-01-01), None
patent: 03014966 (2003-02-01), None
patent: 2008040046 (2008-04-01), None
Pearson Education, Anatomy of an XSLT Stylesheet, Pearson Education 2001, pp. 2-23.
Gronim et al, Extracting Names from Websites Containing Lists of People, caregie Mellon University, Mar. 31, 2001, pp. 1-20.
Maly et al, Exploiting Dynamic Validation for Document Layout Classification During Metadata Extraction, IADIS International Conference 2007; pp. 261-268.
Reference material printed from website address: http://www.PDFlib.com on Jul. 3, 2008, 1 page.
L. Eikvil. Technical Report 945 entitled “Information Extraction from World Wide Web—A Survey,” Norweigan Computing Center, Oslo, Norway, dated Jul. 1999; 40 pages.
A.H.F. Laender, B.A. Ribeiro-Neto, A.S. da Silva, and J.S. Teixeira. Abstract entitled “A Brief Survey of Web Data Extraction Tools,” SIGMOD Record, 31(2):84-93, dated Jun. 2002; 10 pages.
Omar Alonso, Sandeepan Banerjee, Steve Buxton, Roger Ford, and Richard Pitts, Technical Paper entitled “An Oracle Technical White Paper,” dated Apr. 2005; 34 pages.
Tak-Lam Wong, and Wai Lam, Article entitled “Adapting Web Information Extraction Knowledge via Mining Site-Invariant and Site-Dependent Features,” vol. 7, Issue 1, Publication date Feb. 2007 in ACM Transactions on Internet Technology.
Search Report for UK Application No. GB0906700.0 dated Aug. 6, 2009.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Automatically extracting data from semi-structured documents does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Automatically extracting data from semi-structured documents, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Automatically extracting data from semi-structured documents will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-4273356

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.