Taxonomy generation for electronic documents

Data processing: database and file management or data structures – Database design – Data structure types

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C707S793000, C707S793000

Reexamination Certificate

active

10233019

ABSTRACT:
Systems and techniques to generate a term taxonomy for a collection of documents and filling the taxonomy with documents from the collection. In general, in one implementation, the technique includes: extracting terms from a plurality of documents; generating term pairs from the terms; ranking terms in each term pair based on a relative specificity of the terms; aggregating the ranks of the terms in each term pair; selecting term pairs based on the aggregate rankings; and generating a term hierarchy from the selected term pairs.

REFERENCES:
patent: 6253202 (2001-06-01), Gilmour
patent: 6360227 (2002-03-01), Aggarwal et al.
patent: 6438543 (2002-08-01), Kazi et al.
patent: 6442545 (2002-08-01), Feldman et al.
patent: 6446061 (2002-09-01), Doerre et al.
patent: 6523026 (2003-02-01), Gillis
patent: 6594658 (2003-07-01), Woods
patent: 6640224 (2003-10-01), Chakrabarti
patent: 6665681 (2003-12-01), Vogel
patent: 6701311 (2004-03-01), Biebesheimer et al.
patent: 6704729 (2004-03-01), Klein et al.
patent: 6732090 (2004-05-01), Shanahan et al.
patent: 6772170 (2004-08-01), Pennock et al.
patent: 6901398 (2005-05-01), Horvitz et al.
patent: 6910037 (2005-06-01), Gutta et al.
patent: 6941297 (2005-09-01), Carmel et al.
patent: 7003442 (2006-02-01), Tsuda
patent: 7003516 (2006-02-01), Dehlinger et al.
patent: 2001/0013029 (2001-08-01), Gilmour
patent: 2003/0014405 (2003-01-01), Shapiro et al.
patent: 2003/0033300 (2003-02-01), Bergman et al.
patent: 2003/0078913 (2003-04-01), McGreevy
patent: 2003/0093423 (2003-05-01), Larason et al.
patent: 2004/0148155 (2004-07-01), Vogel
patent: 2004/0230577 (2004-11-01), Kawatani
patent: 2005/0097628 (2005-05-01), Lussier et al.
Feldman et al, Text Mining at the Term Level, pp. 65-73 Year of Publication: 1998 ISBN: 3-540-650687.
Fabrizio Sebastiani. Machine learning in automated text categorization, ACM Press, vol. 34, Issue 1, Mar. 2002, pp. 1-47.
Chuang et al. Taxonomy generation for text segments: A practical web-based approach, ACM Press, vol. 23, Issue 4, Oct. 2005, pp. 363-396.
Chan et al. Efficient filtering of XML documents with Xpath expressions, vol. 11, Issue 4, 2002, pp. 354-379.
Caraballo, Sharon A. Automatic Construction of a Hypernym-Labeled Noun Hierarchy from Text, Ph.D. Thesis, Brown University, May, 2001, Providence, Rhode Island (USA). Retrieved from the internet: URL:http://www.cs.georgeotwn.edu/{caraball/CaraballoThesis.ps, retrieved on Feb. 18, 2004.
Sanderson, et al. “Deriving concept hierarchies from text”,Proceedings of SIGIR '99, 22ndInternational Conference on Research and Development in Information Retrieval, Berkeley, California, (USA), Aug. 1999,Annual International ACM-SIGIR Conference on Research and Development in Information Retrieval, New York, NY, Aug. 1999, pp. 206-213.
Dunning, Mark. “Accurate Methods for the Statistics of Surprise and Coincidence”,Computational Linguistics, 19.1 (Mar. 1993), pp. 61-74.
Rigau, German, et al., “Combining Unsupervised Lexical Knowledge Methods for Word Sense Disambiguation”,Proceedings of the 35thAnnual Meeting of the Association for Computational Linguistics and 8thConference of the European Chapter of the Association for Computational Linguistics, Madrid, Spain, Jul. 7-12, 1997, pp. 48-55.
Roark, Brian, et al., “Noun-phrase co-occurrence statistics for semi-automatic semantic lexicon construction”,Proceedings of the Annual Meeting of the Association for Computational Linguistics and 17thInternational Conference on Computational Linguistics, Aug. 1998, vol. 1, pp. 1110-1116.
Hearts, Marti A., “Automatic Acquisition of Hyponyms from Large Text Corpora”,Proceedings of the Fourteenth International Conference on Computational Linguistics, Nantes, France, Jul. 1992, pp. 539-545, Retrieved from the internet URL:http://citeseer.nj.nec.com/hearts92automatic.html, retrieved on Feb. 5, 2004.
Miller, et al., “Introduction to WordNet: An On-Line Lexical Database”,International Journal of Lexicography, vol. 3, No. 4, pp. 235-244, 1990.
Ted Dunning,Accurate Methods for the Statistics of Surprise and Coincidence, 19 Association for Computational Linguistics, 61-74 (1993).
Brian Roark, et al.,Noun-phrase co-occurrence statistics for semi-automatic semantic lexicon construction, Proceeding of the Annual Meeting Of The Association For Computational Linguistics and 17thInternational Conference on Computational Linguistics (Oct. 8, 1998), 2, 1110-1116.
Sharon A. Caraballo,Automatic Acquisition of a Hypernym-Labeled Noun Hierarchy from Text, PhD Thesis (May 2001), Brown University, Providence, USA.
Jinxi Xu and W. Bruce Croft.Improving the Effectiveness of Information Retrieval with Local Context Analysis, Computer Science Department. University of Massachusetts.
O.R. Zainae et al., “On-Line Resource Discovery Using Natural Language” in Proceedings of RIAO'97, Montreal, Canada, Jun. 25-27, 1997.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Taxonomy generation for electronic documents does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Taxonomy generation for electronic documents, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Taxonomy generation for electronic documents will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-3784615

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.