Data processing: artificial intelligence – Having particular user interface
Reexamination Certificate
2011-04-12
2011-04-12
Sparks, Donald (Department: 2129)
Data processing: artificial intelligence
Having particular user interface
C708S270000, C702S028000
Reexamination Certificate
active
07925598
ABSTRACT:
A method and a processing device may be provided for performing efficient weighted consistent sampling. A group of sets having multiple elements with associated weights may be provided. A single hash function may be applied to each of the elements of the group of sets to produce consistent uniformly distributed non-negative random numbers. Transformed values corresponding to each of the elements may be produced by determining a wthroot of a value based on applying the hash function to a respective element, where w may be based on a weight associated with the respective element. A minimum transformed value or a maximum transformed value may be determined for each of the sets. Sets having matching ones of the minimum transformed value or the maximum transformed value may be determined. The determined sets may be considered to be similar.
REFERENCES:
patent: 6697800 (2004-02-01), Jannink et al.
patent: 2005/0060643 (2005-03-01), Glass et al.
patent: 2006/0242217 (2006-10-01), Bartels
patent: 2007/0005556 (2007-01-01), Ganti et al.
patent: 2007/0027672 (2007-02-01), Decary et al.
patent: 2007/0038659 (2007-02-01), Datar et al.
patent: 2007/0118498 (2007-05-01), Song et al.
patent: 2007/0124698 (2007-05-01), Majumder
patent: 2007/0226188 (2007-09-01), Johnson et al.
Broder, et al., “Syntactic Clustering of the Web”, Jul. 25, 1997, Digital Equipment Corporation, pp. 1-13.
Weis, et al., “Space and Time Scalability of Duplicate Detection in Graph Data”, Jun. 2007, pp. 1-28.
Henzinger, “Tutorial: Web Information Retrieval”,2007, IEEE, pp. 1-154.
Manasse, et al., “Consistent Weighted Sampling”, 2007, pp. 7.
Yang, et al., “Near-Duplicate Detection for eRulemaking”, vol. 89, May 2005, p. 9.
Broder, et al., “Min-Wise Independent Permutations”, 1998, pp. 1-36.
Gollapudi, et al., “Exploiting Asymmetry in Hierarchical Topic Extraction”, 2006, pp. 8.
Charles Denis Xavier
Chellapilla Kumar Hemachandra
Bharadwaj Kalpana
Capitol City TechLaw
Irving Richard C.
Microsoft Corporation
Sparks Donald
LandOfFree
Efficient weighted consistent sampling does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Efficient weighted consistent sampling, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Efficient weighted consistent sampling will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-2735220