Efficient weighted consistent sampling

Data processing: artificial intelligence – Having particular user interface

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C708S270000, C702S028000

Reexamination Certificate

active

07925598

ABSTRACT:
A method and a processing device may be provided for performing efficient weighted consistent sampling. A group of sets having multiple elements with associated weights may be provided. A single hash function may be applied to each of the elements of the group of sets to produce consistent uniformly distributed non-negative random numbers. Transformed values corresponding to each of the elements may be produced by determining a wthroot of a value based on applying the hash function to a respective element, where w may be based on a weight associated with the respective element. A minimum transformed value or a maximum transformed value may be determined for each of the sets. Sets having matching ones of the minimum transformed value or the maximum transformed value may be determined. The determined sets may be considered to be similar.

REFERENCES:
patent: 6697800 (2004-02-01), Jannink et al.
patent: 2005/0060643 (2005-03-01), Glass et al.
patent: 2006/0242217 (2006-10-01), Bartels
patent: 2007/0005556 (2007-01-01), Ganti et al.
patent: 2007/0027672 (2007-02-01), Decary et al.
patent: 2007/0038659 (2007-02-01), Datar et al.
patent: 2007/0118498 (2007-05-01), Song et al.
patent: 2007/0124698 (2007-05-01), Majumder
patent: 2007/0226188 (2007-09-01), Johnson et al.
Broder, et al., “Syntactic Clustering of the Web”, Jul. 25, 1997, Digital Equipment Corporation, pp. 1-13.
Weis, et al., “Space and Time Scalability of Duplicate Detection in Graph Data”, Jun. 2007, pp. 1-28.
Henzinger, “Tutorial: Web Information Retrieval”,2007, IEEE, pp. 1-154.
Manasse, et al., “Consistent Weighted Sampling”, 2007, pp. 7.
Yang, et al., “Near-Duplicate Detection for eRulemaking”, vol. 89, May 2005, p. 9.
Broder, et al., “Min-Wise Independent Permutations”, 1998, pp. 1-36.
Gollapudi, et al., “Exploiting Asymmetry in Hierarchical Topic Extraction”, 2006, pp. 8.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Efficient weighted consistent sampling does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Efficient weighted consistent sampling, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Efficient weighted consistent sampling will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-2735220

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.