Method for identifying near duplicate pages in a hyperlinked dat

Data processing: database and file management or data structures – Database design – Data structure types

Patent

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

707100, 34082544, G06F 1730

Patent

active

061381136

ABSTRACT:
A method is described for identifying pages that are near duplicates in a linked database. In the linked database, pages can have incoming links and outgoing links. Two pages are selected, a first page and a second page. For each selected page, the number of outgoing links is determined. The two pages are marked as near duplicates based on the number of common outgoing links for the two pages.

REFERENCES:
patent: 5241305 (1993-08-01), Fascenda et al.
patent: 5309433 (1994-05-01), Cidon et al.
patent: 5335325 (1994-08-01), Frank et al.
patent: 5345227 (1994-09-01), Fascenda et al.
patent: 5425021 (1995-06-01), Derby et al.
patent: 5483522 (1996-01-01), Derby et al.
patent: 5917424 (1999-06-01), Goldman et al.
patent: 5991809 (1999-11-01), Kriegsman

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Method for identifying near duplicate pages in a hyperlinked dat does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Method for identifying near duplicate pages in a hyperlinked dat, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Method for identifying near duplicate pages in a hyperlinked dat will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-1974977

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.