Topic distillation via subsite retrieval

Data processing: database and file management or data structures – Database design – Data structure types

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C707S793000

Reexamination Certificate

active

07580931

ABSTRACT:
A method and system for generating a search result for a query of hierarchically organized documents based on retrieval of subtrees that are key resources for topic distillation is provided. The retrieval system may identify documents relevant to a query using conventional searching techniques. The retrieval system then calculates a subtree feature for subtrees that have an identified document as their root. After the retrieval system calculates the subtree feature for the subtrees, the retrieval system may generate a subtree relevance score for each subtree based on its subtree feature. The retrieval system may then order the identified documents based on their corresponding subtree relevances.

REFERENCES:
patent: 5940821 (1999-08-01), Wical
patent: 6363378 (2002-03-01), Conklin et al.
patent: 6738678 (2004-05-01), Bharat et al.
patent: 6871202 (2005-03-01), Broder
patent: 7028029 (2006-04-01), Kamvar et al.
patent: 2003/0037074 (2003-02-01), Dwork et al.
patent: 2003/0204502 (2003-10-01), Tomlin et al.
patent: 2004/0267722 (2004-12-01), Larimore et al.
patent: 2005/0060297 (2005-03-01), Najork
patent: 2005/0071465 (2005-03-01), Zeng et al.
patent: 2006/0004809 (2006-01-01), Zhang et al.
patent: 1653380 (2006-05-01), None
U.S. Appl. No. 11/293,044, Bragdon.
U.S. Appl. No. 11/459,869, Liu et al.
Albert, Reka and Albert-Laszlo Barabasi, “Statistical mechanics of complex networks,” Reviews of Modern Physics, vol. 74, Jan. 2002, © 2002 The American Physical Society, pp. 47-97.
Amitay, Einat, et al., “Topic Distillation with Knowledge Agents,” 11th TREC, 2002, 10 pages.
Arasu, Arvind et al., “PageRank Computation and the Structure of the Web: Experiments and Algorithms,” Technical Report, IBM Almaden Research Center, Nov. 2001, 5 pages.
Baeza-Yates, R. and B. Ribeiro-Neto, “Chapter 2 Modeling and Chapter 3 Retrieval Evaluation,” Modern Information Retrieval, © 1999 by the ACM Press, pp. 19-97.
Bharat, Krishna and George A. Mihaila, “When Experts Agree: Using Non-Affiliated Experts to Rank Popular Topics,” WWW10, Hong Kong, pp. 597-602.
Bharat, Krishna and Monika R. Henzinger, “Improved Algorithms for Topic Distillation in a Hyperlinked Environment,” SIGIR'98, Melbourne, Australia, ACM 1998, 9 pages.
Bharat, Krishna et al., “Who Links to Whom: Mining Linkage between Web Sites,” In Proceedings of the IEEE International Conference on Data Mining (ICDM'01), San Jose, California, Nov. 2001, 8 pages.
Broder, Andrei, “A taxonomy of web search,” SIGIR Forum 36(2), 2002, 8 pages.
Chakrabarti, Soumen, “Integrating the Document Object Model with Hyperlinks for Enhanced Topic Distillation and Information Extraction,” WWW10, May 2001, Hong Kong, pp. 211-220.
Chakrabarti, Soumen, Mukul Joshi and Vivek Tawde, “Enhanced Topic Distillation using Text, Markup Tags, and Hyperlinks,” SIGIR'01, New Orleans, Louisiana, ACM 2001, 9 pages.
Cho, Grace E. and Carl D. Meyer, “Aggregation /Disaggregation Methods of Nearly Uncoupled Markov Chains,” Nov. 24, 1999, Department of Mathematics, North Carolina State University, 12 pages.
Craswell, Nick and David Hawking, “Overview of the TREC 2003 Web Track,” 12th TREC 2003, Mar. 22, 2004, pp. 1-15.
Davulcu, Hasan et al., “OntoMiner: Bootstrapping Ontologies From Overlapping Domain Specific Web sites,” WWW2004, May 2004, New York, ACM, 2 pages.
Despeyroux, Thierry, “Practical Semantic Analysis of Web Sites and Documents,” WWW2004, May 2004, New York, ACM, pp. 685-693.
Dill, Stephen et al., “Self-Similarity In the Web,” ACM Transactions on Internet Technology, vol. 2, No. 3, Aug. 2002, © 2002 ACM, pp. 205-223.
Dwork, Cynthia et al., “Rank Aggregation Methods for the Web,” WWW10, May 2001, Hong Kong, ACM, pp. 613-622.
Eiron, Nadav et al., “Ranking the Web Frontier,” WWW2004, May 2004, New York, ACM, pp. 309-318.
Girvan, Michelle and M. E. J. Newman, “Community structure in social and biological networks,” Dec. 7, 2001, Proc. Natl. Acad. Sci. USA, 2002, pp. 7821-7826.
Google, http://www.google.com, 1 page, [last accessed Jan. 26, 2007].
Hawking, David, “Overview of the TREC-9 Web Track,” 9th TREC, 2000, Sep. 4, 2001, pp. 1-16.
Henzinger, Monika R. et al., “Challenges in Web Search Engines,” Sep. 3, 2002, In Proceedings of the 18th International Joint Conference on Artificial Intelligence, 2003, 12 pages.
Kamvar, Sepandar, D. et al., “Exploiting the Block Structure of the Web for Computing PageRank,” Stanford University Technical Report, Copyright 2003, 13 pages.
Kleinberg, Jon M., “Authoritative Sources in a Hyperlinked Environment,” Journal of the ACM, vol. 46, No. 5, 1999, 34 pages.
Langville, Amy N. And Carl D. Meyer, “Deeper Inside PageRank,” Jul. 6, 2004, Internet Mathematics, vol. 1, No. 3, © A K Peters, Ltd., pp. 335-380.
Lei, Yuangui et al., “Modelling Data-Intensive Web Sites with OntoWeaver,” In Proceedings of International Workshop on Web Information Systems Modeling, Riga, Latvia, Jun. 2004, 16 pages.
Lerman, Kristina et al., “Using the Structure of Web Sites for Automatic Segmentation of Tables,” SIGMOD 2004, Paris, France, © 2004 ACM, 12 pages.
Meghabghab, George, “Google's Web Page Ranking Applied to Different Topological Web Graph Structures,” Jan. 2, 2001, Journal of the American Society for Information Science and Technology, 52(9), Jul. 2001, © 2001 by John Wiley & Sons, Inc., pp. 736-747.
Meyer, C. D., “Stochastic Complementation, Uncoupling Markov Chains, and the Theory of Nearly Reducible Systems,” Feb. 2, 1989, SIAM Review, 31 (1989), 34 pages.
NetCraft, http://www.netcraft.com, 6 pages, [last accessed Jan. 26, 2007].
Page, L., S. Brin, R. Motwani and T. Winograd, “The PageRank Citation Ranking: Bringing Order to the Web,” Jan. 29, 1998, Stanford University Technical Report, 17 pages.
Qin, Tao et al., “Subsite Retrieval: A Novel Concept for Topic Distillation,” G.G. Lee et al. (Eds.), AIRS 2005, LNCS 3689, 2005, © Springer-Verlag Berlin Heidelberg 2005, pp. 388-400.
ResearchBuzz!, “Google Celebrates 7, Where Did the 8 Go?,” Sep. 27, 2005, http://www.researchbuzz.org/2005/09/google—celebrates—7—where—did.shtml.
Robertson, S.E. and K. Sparck Jones, “Relevance Weighting of Search Terms,” Journal of the American Society for Information Science, vol. 27, No. 3, May-Jun. 1976, pp. 129-146.
Robertson, S.E., “Overview of the Okapi Projects,” Journal of Documentation, vol. 53, No. 1, Jan. 1997, pp. 3-7.
Shakery, Azadeh and ChengXiang Zhai, “Relevance Propagation for Topic Distillation UIUC TREC-2003 Web Track Experiments,” 12th TREC, 2003, pp. 1-5.
TREC-2004 Web Track Guidelines, Updated Jul. 16, 2004 (7 pages).
Wu, Jie and Karl Aberer, “Using SiteRank for P2P Web Retrieval,” Mar. 24, 2004, EPFL Technical Report ID: IC/2004/31, 20 pages.
Yu, Shipeng et al., “Improving Pseudo-Relevance Feedback in Web Information Retrieval Using Web Page Segmentation,” WWW2003, May 2003, Budapest, Hungary, ACM, 11 pages.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Topic distillation via subsite retrieval does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Topic distillation via subsite retrieval, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Topic distillation via subsite retrieval will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-4138920

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.