Method and apparatus for machine learning a document...

Data processing: database and file management or data structures – Database design – Data structure types

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C707S793000

Reexamination Certificate

active

10424170

ABSTRACT:
Provided is a method and computer program product for determining a document relevance function for estimating a relevance score of a document in a database with respect to a query. For each of a plurality of test queries, a respective set of result documents is collected. For each test query, a subset of the documents in the respective result set is selected, and a set of training relevance scores is assigned to documents in the subset. In one embodiment, at least some of the training relevance scores are assigned by human subjects who determine individual relevance scores for submitted documents with respect to the corresponding queries. Finally, a relevance function is determined based on the plurality of test queries, the subsets of documents, and the sets of training relevance scores.

REFERENCES:
patent: 5696962 (1997-12-01), Kupiec
patent: 5909510 (1999-06-01), Nakayama
patent: 6026388 (2000-02-01), Liddy et al.
patent: 6119114 (2000-09-01), Smadja
patent: 6651057 (2003-11-01), Jin et al.
patent: 7062485 (2006-06-01), Jin et al.
patent: 2002/0114394 (2002-08-01), Ma
patent: 2003/0040930 (2003-02-01), Zhai
patent: 2003/0061214 (2003-03-01), Alpha
patent: 2003/0074353 (2003-04-01), Berkan et al.
patent: 2004/0059736 (2004-03-01), Willse et al.
patent: 2005/0004943 (2005-01-01), Chang
patent: 2005/0033745 (2005-02-01), Wiener et al.
k.L Kwok & L. Grunfeld, “TREC-4 Ad-Hoc, Routing Retrieval and Filtering Experiments using PIRCS”, Computer Science Dept., Queens College, CUNY, Flusing, NY, 8 pages.
Friedman, J.H. “Greedy Function Approximation: A Gradient Boosting Machine,” The Annals of Statistics 29(5), Oct. 2001., PA. 2145-45145.
Gey, F. C. “Inferring the Probability of Relevance Using the Method of Logistic Regression”, SIGIR 1994: 222-231.
Kleinberg J., “Authoritative sources in a hyperlinked environment,” in Proceedings of the Nineth Annual ACM-SIAM Symposium on Discrete Algorithms, 1998., pp. 668-677.
Page L., Brin S., Motwani R., and Winograd T., “The PageRank citation ranking: Bringing order to the Web,” http://citeseer.nj.nec.com/page98pagerank.html, website last accessed Apr. 10, 2003, pp. 1-17.
Schapire, R.E. “The Boosting Approach to Machine Learning: An Overview”, in MSRI Workshop on Nonlinear Estimation and Classification, 2002, pp. 1-23.
Indexing by Latent Semantic Analysis, Deerwester, et al., 1990, pp. 1-33.
The INQUERY Retrieval System, Callan, et al., 1992, pp. 1-9.
Query Expansion Using Local and Global Document Analysis, Xu, et al., 1996, pp. 1-8.
Publication Query Results, http://ciir.cs.unmass.edu/cgi-bin/irdemo/pubdb—scripts/oldsearch—pub.pl. Dec. 11, 2003, pp. 1-3.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Method and apparatus for machine learning a document... does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Method and apparatus for machine learning a document..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Method and apparatus for machine learning a document... will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-3724750

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.