Data processing: database and file management or data structures – Database design – Data structure types
Reexamination Certificate
2007-03-27
2007-03-27
Wong, Don (Department: 2163)
Data processing: database and file management or data structures
Database design
Data structure types
C707S793000
Reexamination Certificate
active
10424170
ABSTRACT:
Provided is a method and computer program product for determining a document relevance function for estimating a relevance score of a document in a database with respect to a query. For each of a plurality of test queries, a respective set of result documents is collected. For each test query, a subset of the documents in the respective result set is selected, and a set of training relevance scores is assigned to documents in the subset. In one embodiment, at least some of the training relevance scores are assigned by human subjects who determine individual relevance scores for submitted documents with respect to the corresponding queries. Finally, a relevance function is determined based on the plurality of test queries, the subsets of documents, and the sets of training relevance scores.
REFERENCES:
patent: 5696962 (1997-12-01), Kupiec
patent: 5909510 (1999-06-01), Nakayama
patent: 6026388 (2000-02-01), Liddy et al.
patent: 6119114 (2000-09-01), Smadja
patent: 6651057 (2003-11-01), Jin et al.
patent: 7062485 (2006-06-01), Jin et al.
patent: 2002/0114394 (2002-08-01), Ma
patent: 2003/0040930 (2003-02-01), Zhai
patent: 2003/0061214 (2003-03-01), Alpha
patent: 2003/0074353 (2003-04-01), Berkan et al.
patent: 2004/0059736 (2004-03-01), Willse et al.
patent: 2005/0004943 (2005-01-01), Chang
patent: 2005/0033745 (2005-02-01), Wiener et al.
k.L Kwok & L. Grunfeld, “TREC-4 Ad-Hoc, Routing Retrieval and Filtering Experiments using PIRCS”, Computer Science Dept., Queens College, CUNY, Flusing, NY, 8 pages.
Friedman, J.H. “Greedy Function Approximation: A Gradient Boosting Machine,” The Annals of Statistics 29(5), Oct. 2001., PA. 2145-45145.
Gey, F. C. “Inferring the Probability of Relevance Using the Method of Logistic Regression”, SIGIR 1994: 222-231.
Kleinberg J., “Authoritative sources in a hyperlinked environment,” in Proceedings of the Nineth Annual ACM-SIAM Symposium on Discrete Algorithms, 1998., pp. 668-677.
Page L., Brin S., Motwani R., and Winograd T., “The PageRank citation ranking: Bringing order to the Web,” http://citeseer.nj.nec.com/page98pagerank.html, website last accessed Apr. 10, 2003, pp. 1-17.
Schapire, R.E. “The Boosting Approach to Machine Learning: An Overview”, in MSRI Workshop on Nonlinear Estimation and Classification, 2002, pp. 1-23.
Indexing by Latent Semantic Analysis, Deerwester, et al., 1990, pp. 1-33.
The INQUERY Retrieval System, Callan, et al., 1992, pp. 1-9.
Query Expansion Using Local and Global Document Analysis, Xu, et al., 1996, pp. 1-8.
Publication Query Results, http://ciir.cs.unmass.edu/cgi-bin/irdemo/pubdb—scripts/oldsearch—pub.pl. Dec. 11, 2003, pp. 1-3.
Black Linh
Dreier LLP
Overture Services Inc.
Wong Don
LandOfFree
Method and apparatus for machine learning a document... does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Method and apparatus for machine learning a document..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Method and apparatus for machine learning a document... will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-3724750