Document comparison using multiple similarity measures

Data processing: database and file management or data structures – Database design – Data structure types

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C707S793000

Reexamination Certificate

active

07472121

ABSTRACT:
Disclosed herein is a method for comparing documents. The method includes the steps of: determining a plurality of similarity measures; and determining an overall similarity measure for the plurality of documents, based on the plurality of similarity measures. In one embodiment, the similarity measures are chosen from the group of similarity measures consisting of semantic and reference similarity measures. When comparing documents from the chemical, biochemical or pharmaceutical domains, the determination of the similarity utilizes a determination of structural similarity of the chemical formulas described in the plurality of documents.

REFERENCES:
patent: 5774833 (1998-06-01), Newman
patent: 6556992 (2003-04-01), Barney et al.
patent: 2003/0004936 (2003-01-01), Grune et al.
patent: 2003/0033295 (2003-02-01), Adler et al.
J. Dean and M. Henzinger. Finding Related Pages in the World-Wide Web. InProceedings of the Eight International World-Wide Web Conference, Toronto, Canada, May 1999.
P. Ganesan, H. Garcia-Molina, and J. Widom. Exploiting Hierarchical Domain Structure to Compute Similarity.ACM Transactions on Information Systems, 21(1):64(93, Jan. 2003.
V. Gillet, D. Wild, P. Willett, and J. Bradshaw. Similarity and Dissimilarity Measures for Processing Chemical Structure Databases.The Computer Journal, 41(8), 1998.
M. Halkidi, B. Nguyen, I. Varlamis, and M. Vazirgiannis. Thesus: Organizing Web Document Collections based on Link Semantics.VLDB Journal, 12(4), Nov. 2003.
N. Kando and M. Leong. Workshop on Patent Retrieval: SIGIR 2000 Workshop Report.ACM SIGIR Forum, 34(1):28{Apr. 30, 2000.
L. Larkey. A Patent Search and Classification System. Inthe Proceedings of the ACM Digital Library Conference, Berkeley, CA, 1999.
D. Lin. An Information-Theoretic Definition of Similarity. Inthe Proceedings of the International Conference on Machine Learning, pp. 296{304, San Francisco, Ca, 1998.
M. Marinescu, M. Markellou, G. Mayritsakis, K. Perdikuri, S. Sirmakessis, and A. Tsakalidis. Knowledge Discovery in Patent Databases. Inthe Proceedings of the ACM Conference on Information and Knowledge Management, McLean, Virginia, 2002.
S. Mukherjea and B. Bamba. BioPatentMiner: An Information Retrieval System for BioMedical Patents, inthe Proceedings of the Very Large Databases(VLDB)Conference, Toronto, Canada, 2004.
M. Osborn, T. Strzalknowski, and M. Marinescu. Evaluating Document Retrieval in Patent Database: a Preliminary Report. Inthe Proceedings of the ACM Conference on Information and Knowledge Management, Las Vegas, Nevada, 1997.
J. Pitkow and P. Pirolli. Life, Death and Lawfulness on the Electronic Frontier. InProceedings of the ACM SIGCHI '97 Conference on HumanFactor s in Computing Systems, pp. 383{390, Atlanta, Ga, Mar. 1997.
P. Resnik. Using Information Content to Evaluate Semantic Similarity in a Taxonomy. Inthe Proceedings of the International Joint Conference on Artificial Intelligence(IJCAI), pp. 448{453, 1995.
G. Salton, A. Wong and C.S.Yand. A Vector Space Model for Automatic Indexing in theCommunications of the ACM, 18(11), Nov. 1975.
L. Subramaniam, S. Mukherjea, P. Kankar, B. Srivastava, V. Batra, P. Kamesam, and R. Kothari. Information Extraction from Biomedical Literature: Methodology, Evaluation and an Application. Inthe Proceedings of the ACM Conference on Information and Knowledge Management, New Orleans, Lousiana, 2003.
J. Chen et al. A Protein Patent Query System Powered by Klesli.,Proceedings of the ACM SIGMOD Conference, Seattle, WA, 1998.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Document comparison using multiple similarity measures does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Document comparison using multiple similarity measures, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Document comparison using multiple similarity measures will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-4037612

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.