Similarity-based searching

Data processing: database and file management or data structures – Database and file access – Search engines

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

Reexamination Certificate

active

08032507

ABSTRACT:
Pairs of similar objects in a population of objects can be found using a process that includes identifying a comparison vector x in a set of vectors having non-zero features, determining an estimated similarity contribution of a subset of features of the comparison vector x to a similarity between the comparison vector x and each vector in the set of vectors, generating an index that includes features based on a comparison of the similarity contribution with a similarity threshold, and identifying another vector in the set that is similar to the vector x using the index.

REFERENCES:
patent: 5933806 (1999-08-01), Beyerlein et al.
patent: 6055540 (2000-04-01), Snow et al.
patent: 2004/0194612 (2004-10-01), Parees
patent: 2005/0234953 (2005-10-01), Zhang et al.
patent: 2008/0005223 (2008-01-01), Flake et al.
patent: 2008/0021860 (2008-01-01), Wiegering et al.
patent: 2008/0031203 (2008-02-01), Bill
Buckley et al., “Optimization of Inverted Vector Searches” Proc. of the Eight Annual Int'l Conf. on Research & Dev. In Information Retrieval, pp. 97-110, 1985, 14 pages.
Chaudhuri et al., “A Primitive Operator for Similarity Joins in Data Cleaning” Proc. of the 22nd Int'l Conf. on Data Engineering, (to appear), 2006, 12 pages.
Sarawagi et al., “Efficient set joins on similarity predicates” Proc. of the ACM SIGMOD, pp. 743-754, 2004, 12 pages.
Turtle et al., “Query Evaluation: Strategies and Optimizations” Inform. Process. & Management 31(6):831-850, 1995, 20 pages.
Arasu, A., et al., “Efficient Exact Set-Similarity Joins,” VLBD '06 Proceedings of the 32nd International Conference on Very Large Data Bases, Sep. 12-15, 2006, Seoul, Korea, pp. 918-929.
Broder, A.Z., et al., “Syntactic Clustering of the Web,”Proc. Of the 6thInt'l World Wide Web Conference, 1997, pp. 391-303.
Charikar, M.S., “Similarity Estimation Techniques from Rounding Algorithms,”Proc. Of the 34thAnnual Symposium on Theory of Computing, 2002, pp. 380-388.
Chien, S., and Immorlica, N., “Semantic Similarity Between Search Engine Queries Using Temporal Correlation,”Proc. Of the 14thInt'l World Wide Web Conference, 2005, pp. 2-11.
Fagin, R., et al., “Efficient Similarity Search and Classification via Rank Aggregation,”Proc. Of the 2003 ACM-SIGMOID Int'l Conference on Management of Data, pp. 301-312.
Gionis, A., “Similarity Search in High Dimensions via Hashing,”Proc. Of the 25thInt'l Conference on Very Large Data Bases, 1999, pp. 518-529.
Indyk, P., and Motwani, R., “Approximate Nearest Neighbors: Towards Removing the Curse of Dimensionality,”Proc. Of the 30thSymposium on the Theory of Computing, 1998, pp. 604-613.
Bayardo, R.J., et al., ‘Scaling Up All Pairs Similarity Search,’ Proc. of the 16th Int'l Conf. on World Wide Web, WWW 2007, May 8-12, 2007, Banff, Alberta, Canada, pp. 131-140, revised May 21, 2007.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Similarity-based searching does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Similarity-based searching, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Similarity-based searching will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-4294226

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.