Data processing: database and file management or data structures – Database design – Data structure types
Reexamination Certificate
2007-09-04
2007-09-04
Rimell, Sam (Department: 2161)
Data processing: database and file management or data structures
Database design
Data structure types
C707S793000, C707S793000, C707S793000
Reexamination Certificate
active
10881893
ABSTRACT:
In a hierarchical taxonomy of document, the categories of information may be structured as a binary tree with the nodes of the binary tree containing information relevant to the search. The binary tree may be ‘trained’ or formed by examining a training set of documents and separating those documents into two child nodes. Each of those sets of documents may then be further split into two nodes to create the binary tree data structure. The nodes may be generated to maximize the likelihood that all of the training documents are in either or both of the two child nodes. In one example, each node of the binary tree may be associated with a list of terms and each term in each list of terms is associated with a probability of that term appearing in a document given that node. New documents may be categorized by the nodes of the tree. For example, the new documents may be assigned to a particular node based upon the statistical similarity between that document and the associated node.
REFERENCES:
patent: 5325298 (1994-06-01), Gallant
patent: 5619709 (1997-04-01), Caid et al.
patent: 6298340 (2001-10-01), Calvignac et al.
patent: 6360227 (2002-03-01), Aggarwal et al.
patent: 6438590 (2002-08-01), Gartner et al.
patent: 6446061 (2002-09-01), Doerre et al.
patent: 6484149 (2002-11-01), Jammes et al.
patent: 6526440 (2003-02-01), Bharat
patent: 6529903 (2003-03-01), Smith et al.
patent: 6615209 (2003-09-01), Gomes et al.
patent: 6647004 (2003-11-01), Allen, Jr. et al.
patent: 6658423 (2003-12-01), Pugh et al.
patent: 6675163 (2004-01-01), Bass et al.
patent: 6678681 (2004-01-01), Brin
patent: 6704729 (2004-03-01), Klein et al.
patent: 7103609 (2006-09-01), Elder et al.
patent: 2002/0078091 (2002-06-01), Vu et al.
patent: 2002/0123988 (2002-09-01), Dean et al.
patent: 2002/0133481 (2002-09-01), Smith et al.
patent: 2002/0147906 (2002-10-01), Lotspeich et al.
patent: 2003/0217335 (2003-11-01), Chung et al.
patent: 2004/0111438 (2004-06-01), Chitrapura et al.
patent: 2005/0114161 (2005-05-01), Garg et al.
Beitzel et al., Using Titles and Category Names for Editor-Driven Taxonomies for Automatic Evaluation, CIKM 2003, Nov. 3, 2003, New Orleans, LA, USA, pp. 17-23.
Brin et al, The Anatomy of a Large-Scale Hypertextual Web Search Engine, http://www7.scu.au/programme/fullpapers/1921/com1921.htm.
Hofmann, Probablistic Latent Semantic Indexing, 22nd Int'l SIGIR Conference on R&D in Information Retrieval, Aug. 15, 1999, pp. 50-57.
McCallum et al., A comparison of Event Modelsfor Naive Bayes Text Classification, http://www-2.cs.cmu.edu/˜knigam/papers/multionomial-aaaiws98.pdf.
Sebastiani, Machine Learning in Automated Text Categorization, ACM Computing Surveys, vol. 34, No. 1, Mar. 2002, pp. 1-47.
Waldvogel et al., Scalable High Speed Prefix Matching, ACM Transactions on Computer Systems, vol. 19, No. 4, Nov. 2001, pp. 440-482.
Zhai et al, A Study of Smoothing Methods for Language Models Applied to Information Retrieval, ACM Transaction on Information systems, vol. 22, No. 2, Apr. 2004, pp. 179-214.
Bibbee Jared M
Microsoft Corporation
Rimell Sam
LandOfFree
Automated taxonomy generation does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Automated taxonomy generation, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Automated taxonomy generation will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-3745892