Document descriptor extraction method

Data processing: presentation processing of document – operator i – Presentation processing of document – Layout

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C715S252000

Reexamination Certificate

active

07080314

ABSTRACT:
The present invention discloses a document descriptor extraction method and system. The document descriptor extraction method and system creates a document descriptor by generalizing input sequences within a document; factoring the input sequences and generalized input sequences; and selecting a document descriptor from the input sequences, generalized sequences, and factored sequences, preferably using minimum descriptor length (MDL) principles. Novel algorithms are employed to perform the generalizing, factoring, and selecting.

REFERENCES:
patent: 4876720 (1989-10-01), Kaneko et al.
patent: 5299206 (1994-03-01), Beaverson et al.
patent: 5812999 (1998-09-01), Tateno
patent: 5926823 (1999-07-01), Okumura et al.
patent: 5930746 (1999-07-01), Ting
patent: 5977890 (1999-11-01), Rigoutsos et al.
patent: 6061697 (2000-05-01), Nakao
patent: 6078884 (2000-06-01), Downey
patent: 6092065 (2000-07-01), Floratos et al.
patent: 6108666 (2000-08-01), Floratos et al.
patent: 6134512 (2000-10-01), Barrett
patent: 6167523 (2000-12-01), Strong
patent: 6202072 (2001-03-01), Kuwahara
patent: 6282681 (2001-08-01), Sun et al.
patent: 6330574 (2001-12-01), Murashita
patent: 6373971 (2002-04-01), Floratos et al.
patent: 6438540 (2002-08-01), Nasr et al.
patent: 6487566 (2002-11-01), Sundaresan
patent: 6507856 (2003-01-01), Chen et al.
patent: 6515978 (2003-02-01), Buehrer et al.
patent: 6519617 (2003-02-01), Wanderski et al.
patent: 6532556 (2003-03-01), Wong et al.
patent: 6553072 (2003-04-01), Chiang et al.
patent: 6569207 (2003-05-01), Sundaresan
patent: 6604099 (2003-08-01), Chung et al.
patent: 6651059 (2003-11-01), Sundaresan et al.
patent: 6675219 (2004-01-01), Leppinen et al.
patent: 6718317 (2004-04-01), Wang et al.
patent: 6766330 (2004-07-01), Chen et al.
patent: 6779154 (2004-08-01), Nussbaum et al.
patent: 6810398 (2004-10-01), Moulton
patent: 6912538 (2005-06-01), Stapel et al.
patent: 2001/0011287 (2001-08-01), Goto et al.
patent: 2001/0027459 (2001-10-01), Royal
patent: 2002/0002566 (2002-01-01), Gajraj
patent: 2002/0085032 (2002-07-01), Fong et al.
patent: 2003/0056193 (2003-03-01), Perycz et al.
patent: 2003/0208473 (2003-11-01), Lennon
patent: 2004/0039993 (2004-02-01), Kougiouris et al.
patent: 2004/0133569 (2004-07-01), Munetsugu et al.
Papakonstantinou et al., DTD Inferrence for Views of XML Data, ACM May 2000, pp. 35-46.
Moh et al., Re-Engineering Structures from Web Documents, ACM Jun. 2, 2000, pp. 67-76.
Dodge, Using SGML to Streamline Print and CD-ROM Production, CD-ROM Professional, Mar. 1994, vol. 7, iss. 2, p. 77, 5 pg.
Ashish et al., Wrapper Generation for Semi-Structured Internet Sources, ACM Dec. 1997, pp. 8-15.
Wallace et al., Haskell and XML: Generic Combinators or Type-Based Translation?, ACM 1999, pp. 148-159.
Bergamaschi et al., An Approach for the Extraction of Information from Heterogeneous Sources of Textual Data, Google Augus 1997, pp. 1-7.
Poulin et al., The Oher Formalization of Law: SGML Modelling and Tagging, ACM 1997, pp. 82-88.
Adelberg, NoDoSE—a Tool for Semi-Automatically Extracting Structured and Semistructured Data from Text Documents, ACM Jun. 1998, pp. 283-294.
www.alphaworks.ibm.com/aw.nsf/techmain/DDbE: Data Descriptors by Example; Application Development, Java, XML; Leonard Berman and Angel Diaz; Posted Jun. 11, 1999; Updated Mar. 8, 2000; Printed May 12, 2000.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Document descriptor extraction method does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Document descriptor extraction method, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Document descriptor extraction method will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-3547687

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.