Data processing: database and file management or data structures – Database and file access
Reexamination Certificate
2008-04-18
2011-10-18
Ehichioya, Fred I (Department: 2156)
Data processing: database and file management or data structures
Database and file access
Reexamination Certificate
active
08041695
ABSTRACT:
This description provides tools and techniques for automatically extracting data from semi-structured documents. A computer-readable storage medium may contain computer-executable instructions that, when executed by a computer, cause the computer to receive a request for data representing an inferred structure of an input document. For the request, the computer may determine whether a repository containing mined information includes the requested data. If the repository contains the requested data, the computer may return the data representing the inferred structure of the input document in response to the request.
REFERENCES:
patent: 7197503 (2007-03-01), Palanisamy et al.
patent: 2003/0140311 (2003-07-01), Lemon et al.
patent: 2004/0205463 (2004-10-01), Darbie
patent: 2005/0108267 (2005-05-01), Gibson et al.
patent: 2009/0030671 (2009-01-01), Kwon et al.
patent: 200975625 (2009-04-01), None
patent: 100814079 (2008-03-01), None
patent: 0020985 (2000-04-01), None
patent: 0203210 (2002-01-01), None
patent: 03014966 (2003-02-01), None
patent: 2008040046 (2008-04-01), None
Pearson Education, Anatomy of an XSLT Stylesheet, Pearson Education 2001, pp. 2-23.
Gronim et al, Extracting Names from Websites Containing Lists of People, caregie Mellon University, Mar. 31, 2001, pp. 1-20.
Maly et al, Exploiting Dynamic Validation for Document Layout Classification During Metadata Extraction, IADIS International Conference 2007; pp. 261-268.
Reference material printed from website address: http://www.PDFlib.com on Jul. 3, 2008, 1 page.
L. Eikvil. Technical Report 945 entitled “Information Extraction from World Wide Web—A Survey,” Norweigan Computing Center, Oslo, Norway, dated Jul. 1999; 40 pages.
A.H.F. Laender, B.A. Ribeiro-Neto, A.S. da Silva, and J.S. Teixeira. Abstract entitled “A Brief Survey of Web Data Extraction Tools,” SIGMOD Record, 31(2):84-93, dated Jun. 2002; 10 pages.
Omar Alonso, Sandeepan Banerjee, Steve Buxton, Roger Ford, and Richard Pitts, Technical Paper entitled “An Oracle Technical White Paper,” dated Apr. 2005; 34 pages.
Tak-Lam Wong, and Wai Lam, Article entitled “Adapting Web Information Extraction Knowledge via Mining Site-Invariant and Site-Dependent Features,” vol. 7, Issue 1, Publication date Feb. 2007 in ACM Transactions on Internet Technology.
Search Report for UK Application No. GB0906700.0 dated Aug. 6, 2009.
Ehichioya Fred I
Hope Baldauff Hartman LLC
The Boeing Company
LandOfFree
Automatically extracting data from semi-structured documents does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Automatically extracting data from semi-structured documents, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Automatically extracting data from semi-structured documents will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-4273356