Data processing: database and file management or data structures – Database design – Data structure types
Patent
1998-03-18
2000-03-14
Homere, Jean R.
Data processing: database and file management or data structures
Database design
Data structure types
707102, G06F 1721
Patent
active
06038574&
ABSTRACT:
The method and apparatus of the present invention generates clusters of documents in a collection of linked documents based on co-citation analysis. The frequency linkage is determined for each document in the collection. In other words, the number of times each document is linked to by another document in the collection is determined. Further, a minimum frequency linkage (link frequency threshold) is specified based on a predetermined minimum frequency of document linkage. Additionally, a list of pairs of documents that are linked to by the same document is created so that each of the pairs of documents has a count of the number of times (co-citation frequency) that they are both linked to by another document. Pairs of linked documents are clustered using a suitable co-citation technique.
REFERENCES:
patent: 5568640 (1996-10-01), Nishiyama et al.
patent: 5594897 (1997-01-01), Goffman
patent: 5675819 (1997-10-01), Schuetze
patent: 5717922 (1998-02-01), Hohensee et al.
patent: 5819258 (1998-10-01), Vaithyanathan et al.
patent: 5870552 (1999-02-01), Dozier et al.
patent: 5895470 (1999-04-01), Pirolli et al.
patent: 5920859 (1999-07-01), Li
Botafogo et al., "Structural Analysis of Hypertexts: Identifying Hierarchies and Useful Metrics", ACM Transactions on Information Systems, Vol. 10, No. 2, Apr. 1992, pps. 142-180.
Griffith et al., "The Structure of Scientific Literatures II: Toward a macro-and Miicrostructure for Science", Science Studies, 4 (1974), pps. 339-365.
Larson, R.R., "Bibliometrics of the World Wide Web: An Exploratory Analysis of the Intellectual Structure of Cyberspace", Proceedings of the 1996 American Society for Information Science Annual Meeting, pp. 1-13.
Small, H., "Co-citation in the Scientific Literature: A New Measure of the Relationship Between Two Documents", Journal of the American Society for Information Science, Jul.-Aug. 1973, pp. 265-269.
Card Stuart K.
Mackinlay Jock D.
Pirolli Peter L.
Pitkow James E.
Dominko Richard B.
Homere Jean R.
Xerox Corporation
LandOfFree
Method and apparatus for clustering a collection of linked docum does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Method and apparatus for clustering a collection of linked docum, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Method and apparatus for clustering a collection of linked docum will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-179135