Method for content mining of semi-structured documents

Data processing: database and file management or data structures – Database design – Data structure types

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C709S202000

Reexamination Certificate

active

06912555

ABSTRACT:
Embodiments of the present invention are directed to a method for content mining of semi-structured documents. In one embodiment, a semi-structured document is first converted from a document-type specific format such as HTML or PDF, to a document-type independent format such as XML. The document formatting, which contains basic level information about the document's structure, is then analyzed by a series of modules to develop a higher level understanding of the document's structure. These modules append information to the document describing the features which collectively comprise the higher level document structure. The appended information facilitates finding specified information within the document when content mining is performed.

REFERENCES:
patent: 6470352 (2002-10-01), Yaginuma
patent: 6473741 (2002-10-01), Baker
patent: 6477538 (2002-11-01), Yaginuma et al.
patent: 6477565 (2002-11-01), Daswani et al.
patent: 6640244 (2003-10-01), Bowman-Amuah
patent: 6640249 (2003-10-01), Bowman-Amuah
patent: 6711585 (2004-03-01), Copperman et al.
patent: 6718338 (2004-04-01), Vishnubhotla
patent: 6721795 (2004-04-01), Eldreth
patent: 6745185 (2004-06-01), Lee et al.
patent: 6775671 (2004-08-01), de Lara et al.
patent: 6836773 (2004-12-01), Tamayo et al.
patent: 6865582 (2005-03-01), Obradovic et al.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Method for content mining of semi-structured documents does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Method for content mining of semi-structured documents, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Method for content mining of semi-structured documents will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-3507636

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.