Method and apparatus for clustering a collection of linked docum

Data processing: database and file management or data structures – Database design – Data structure types

Patent

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

707102, G06F 1721

Patent

active

06038574&

ABSTRACT:
The method and apparatus of the present invention generates clusters of documents in a collection of linked documents based on co-citation analysis. The frequency linkage is determined for each document in the collection. In other words, the number of times each document is linked to by another document in the collection is determined. Further, a minimum frequency linkage (link frequency threshold) is specified based on a predetermined minimum frequency of document linkage. Additionally, a list of pairs of documents that are linked to by the same document is created so that each of the pairs of documents has a count of the number of times (co-citation frequency) that they are both linked to by another document. Pairs of linked documents are clustered using a suitable co-citation technique.

REFERENCES:
patent: 5568640 (1996-10-01), Nishiyama et al.
patent: 5594897 (1997-01-01), Goffman
patent: 5675819 (1997-10-01), Schuetze
patent: 5717922 (1998-02-01), Hohensee et al.
patent: 5819258 (1998-10-01), Vaithyanathan et al.
patent: 5870552 (1999-02-01), Dozier et al.
patent: 5895470 (1999-04-01), Pirolli et al.
patent: 5920859 (1999-07-01), Li
Botafogo et al., "Structural Analysis of Hypertexts: Identifying Hierarchies and Useful Metrics", ACM Transactions on Information Systems, Vol. 10, No. 2, Apr. 1992, pps. 142-180.
Griffith et al., "The Structure of Scientific Literatures II: Toward a macro-and Miicrostructure for Science", Science Studies, 4 (1974), pps. 339-365.
Larson, R.R., "Bibliometrics of the World Wide Web: An Exploratory Analysis of the Intellectual Structure of Cyberspace", Proceedings of the 1996 American Society for Information Science Annual Meeting, pp. 1-13.
Small, H., "Co-citation in the Scientific Literature: A New Measure of the Relationship Between Two Documents", Journal of the American Society for Information Science, Jul.-Aug. 1973, pp. 265-269.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Method and apparatus for clustering a collection of linked docum does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Method and apparatus for clustering a collection of linked docum, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Method and apparatus for clustering a collection of linked docum will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-179135

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.