Data processing: database and file management or data structures – Database design – Data structure types
Reexamination Certificate
1998-08-06
2001-10-30
Hong, Stephen S. (Department: 2176)
Data processing: database and file management or data structures
Database design
Data structure types
C707S793000, C707S793000
Reexamination Certificate
active
06311198
ABSTRACT:
DESCRIPTION
1. Field of the Invention
The present invention relates to a method for threading documents, and in particular to a method for efficiently threading documents, such as newspaper articles, that appear in chronological order and for representing the documents in a directed graph, and to a technology for efficiently representing and visualizing the characteristics of a set of documents by employing a directed graph.
2. Related Art
Recently, various documents have come to be distributed electronically. Ordinary users can read multiple documents, such as articles presented in newspapers and on TV news programs, related to topics in the chronological order in which they appear. Conventionally, a document clustering method is employed as the method for threading a plurality of documents. According to this method, basically, a similarity between two of the multiple documents is calculated to create a cluster. For example, for ranking type clustering, on the order of O(n
2
) calculations are required. When n is a greater number, a large calculation cost is required. If, for example, a company, such as a newspaper company, that has a database in which multiple articles are stored desires to collect information relating to a specific article using the ordinary clustering method, an unrealistic period of time of over one year is required.
SUMMARY OF THE INVENTION
It is one object of the present invention to provide a method and a system for efficiently threading a large quantity of article data (documents).
It is another object of the present invention to provide a method and a system for threading a large quantity of article data at high speed.
It is an additional object of the present invention to provide a method and a system for threading in O(n) order a large quantity of article data.
It is a further object of the present invention to provide a system for threading a large quantity of article data at high speed and for displaying a relationship among the articles so that users can easily understand it.
It is a further object of the present invention to provide a method and a system by which a user can easily access a large quantity of threaded data and understand the contents.
To achieve the above objects, according to the present invention, for threading n chronologically ordered documents, a method is provided whereby a similarity among the n documents is calculated and the similarity is employed to create a similarity matrix using time constraints, and is converted into an adjacency matrix for identifying a relationship among the n documents.
By applying the threading method that employs the time constraints, a large quantity of article data can be efficiently threaded in the O(n) order. Users can easily access the large quantity of data and can understand the contents.
REFERENCES:
patent: 5745893 (1998-04-01), Hill et al.
patent: 5895474 (1999-04-01), Maarek et al.
patent: 5930784 (1999-07-01), Hendrickson
patent: 5931907 (1999-08-01), Davies et al.
patent: 6006227 (1999-12-01), Freeman et al.
patent: 6026388 (2000-02-01), Liddy et al.
Tucker, Applied Combinatorics, 1995, John Wiley & Sons, Inc., pp. 110-111.*
Salton et al., Introduction to Modern Information Retrieval, MacGraw-Hill Book Company, pp. 215-250.*
Wilkinson, Stabillity of Reduction of a Matrix to Almost Triangular and Triangular Forms by Elementary Similarity Transformations, ACM, Jul. 1959, pp. 336-359.*
Sato et al., Fuzzy Clustering Model for Asymmetry and Self-Similarity, Fuzzy Systems, Proceedings of the Sixth IEEE Internationale Conference, Jul. 1997, pp. 963-968, vol. 2.
Takeda Koichi
Uramoto Naohiko
Drumheller Ronald L.
Hong Stephen S.
Huynh Cong-Lac
International Business Machines - Corporation
LandOfFree
Method and system for threading documents does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Method and system for threading documents, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Method and system for threading documents will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-2563243