Data processing: database and file management or data structures – Database and file access – Search engines
Reexamination Certificate
2011-01-18
2011-01-18
Trujillo, James (Department: 2159)
Data processing: database and file management or data structures
Database and file access
Search engines
Reexamination Certificate
active
07873624
ABSTRACT:
Structured content and associated metadata from the Web are leveraged to provide specific answer string responses to user questions. The structured content can also be indexed at crawl-time to facilitate searching of the content at search-time. Ranking techniques can also be employed to facilitate in providing an optimum answer string and/or a top K list of answer strings for a query. Ranking can be based on trainable algorithms that utilize feature vectors for candidate answer strings. In one instance, at crawl-time, structured content is indexed and automatically associated with metadata relating to the structured content and the source web page. At search-time, candidate indexed structured content is then utilized to extract an appropriate answer string in response to a user query.
REFERENCES:
patent: 5418948 (1995-05-01), Turtle
patent: 6006218 (1999-12-01), Breese et al.
patent: 6269368 (2001-07-01), Diamond
patent: 6665666 (2003-12-01), Brown et al.
patent: 7085755 (2006-08-01), Bluhm et al.
patent: 2002/0129005 (2002-09-01), Ojanen
patent: 2004/0068486 (2004-04-01), Chidlovskii
patent: 2004/0249808 (2004-12-01), Azzam et al.
patent: 2005/0027687 (2005-02-01), Nowitz et al.
patent: 2006/0173834 (2006-08-01), Brill et al.
patent: 2006/0235689 (2006-10-01), Sugihara et al.
Lide Wu, Xuanjing Huang, Lan You, Zhushuo Zhang, Xin Li, and Yaqian Zhou. 2004. FDUQA on TREC2004 QA Track. In Proceedings of the Thirteenth Text REtrieval Conference (TREC 2004).
E. Brill, J. Lin, M. Banko, S. Dumais and A. Ng. 2001. Data-Intensive Question Answering. In Proceedings of the Tenth Text Retrieval Conference (TREC 2001), Gaithersburg, MD, pp. 183-189.
Embley et al.,“Automatically Extracting Ontologically Specified Data from HTML Tables of Unknown Structure”, 2002.
Pinto et al., “QuASM: A System for Question Answering Using Semi-Structured Data”, 2002, ACM, pp. 46-55.
Pinto et al.,“Table Extraction Using Conditional Random Fields”, 2003, ACM, pp. 235-242.
Neumann et al.,“Mining Natural Language Answers from the Web”, 2004, ACM, pp. 123-135.
Cai et al.,“Extracting Content Structure for Web Pages Based on Visual Representation”, 2003, Springer-Verlag Berlin Heidelberg, pp. 406-417.
E. Agichtein, et al., Snowball: Extracting Relations From Large Plain-Text Collections. In Proceedings of the 5th ACM International Conference on Digital Libraries, Jun. 2000.
E. Brill, et al., An Analysis of the AskMSR Question-Answering System. In EMNLP, 2002.
S. Buchholz, Using Grammatical Relations, Answer Frequencies and the World Wide Web for Trec Question Answering. In TREC, 2001.
C. Burges, et al., Learning to Rank Using Gradient Descent. In Proceedings of the Twenty Second International Conference on Machine Learning, Bonn, Germany, 2005.
R. Caruana, et al., Using the Future to “Sort Out” the Present: Rankprop and Multitask Learning for Medical Risk Evaluation. In D. S. Touretzky, M. C. ozer, and M. E. Hasselmo, editors, Advances in Neural Information Processing Systems, vol. 8, pp. 959-965. The MIT Press, 1996.
J. Caverlee, et al., Probe, Cluster, and Discover: Focused Extraction of QA-Pagelets From the Deep Web. In Proceedings of ICDE, 2004.
K. C.-C. Chang, et al., Toward Large Scale Integration: Building a MetaQuerier Over Databases on the Web. In Proceedings of the Second Conference on Innovative Data Systems Research (CIDR 2005), 2005.
K. Crammer, et al., On the Algorithmic Implementation of Multiclass Kernel-Based Vector Machines. Journal of Machine Learning Research, 2:265-292, 2001.
A. Doan, et al., Semantic Integration Research in the Database Community: A Brief Survey. AI Magazine, Special Issue on Semantic Integration, 2005.
O. Etzioni, et al., Web-scale information extraction in knowitall. In WWW, 2004.
E. Harrington, Online Ranking/Collaborative Filtering Using the Perceptron Algorithm. In Proceedings of the Twentieth International Conference on Machine Learning, 2003.
W. Hildebrandt, et al., Answering Definition Questions With Multiple Knowledge Sources. In HLT/NAACL 2004, 2004.
V. Hristidis, et al., Efficient IR-Style Keyword Search Over Relational Databases. In VLDB, 2003.
C. C. T. Kwok, et al., Scaling Question Answering to the Web. In Proceedings of the 10thWorldWide Web Conference (WWW-10), 2001.
G. Ramakrishnan, et al., Is Question Answering an Acquired Skill? in WWW, 2004.
E. M. Voorhees, Overview of the TREC 2003 Question Answering Track. In Text REtrieval Conference, 2004.
Agichtein, et al. Learning to Find Answers to Questions on the Web. In: ACM Transactions on Internet Technology, vol. 4, No. 2, May 2004, pp. 129-162. http://delivery.acm.org/10.1145/1000000/990303/p129-agichtein.pdf?key1=990303&key2=8208815221&coll=GUIDE&dl=GUIDE&CFID=8355246&CFTOKEN=26669024. Last accessed Oct. 28, 2008, 34 pages.
Mitchell, T. M., Machine Learning, McGraw-Hill, New York, 1997, pp. 154-198.
Agichtein Yevgeny E.
Brill Eric D.
Burges Christopher J.
Conyers Dawaune
Lee & Hayes PLLC
Microsoft Corporation
Trujillo James
LandOfFree
Question answering over structured content on the web does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Question answering over structured content on the web, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Question answering over structured content on the web will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-2739509