Data processing: database and file management or data structures – Database design – Data structure types
Patent
1998-08-10
2000-10-24
Ho, Ruay Lian
Data processing: database and file management or data structures
Database design
Data structure types
707100, 34082544, G06F 1730
Patent
active
061381136
ABSTRACT:
A method is described for identifying pages that are near duplicates in a linked database. In the linked database, pages can have incoming links and outgoing links. Two pages are selected, a first page and a second page. For each selected page, the number of outgoing links is determined. The two pages are marked as near duplicates based on the number of common outgoing links for the two pages.
REFERENCES:
patent: 5241305 (1993-08-01), Fascenda et al.
patent: 5309433 (1994-05-01), Cidon et al.
patent: 5335325 (1994-08-01), Frank et al.
patent: 5345227 (1994-09-01), Fascenda et al.
patent: 5425021 (1995-06-01), Derby et al.
patent: 5483522 (1996-01-01), Derby et al.
patent: 5917424 (1999-06-01), Goldman et al.
patent: 5991809 (1999-11-01), Kriegsman
Dean Jeffrey
Henzinger Monika R.
AltaVista Company
Ho Ruay Lian
LandOfFree
Method for identifying near duplicate pages in a hyperlinked dat does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Method for identifying near duplicate pages in a hyperlinked dat, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Method for identifying near duplicate pages in a hyperlinked dat will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-1974977