Data processing: database and file management or data structures – Database design – Data structure types
Patent
1996-12-30
1998-07-07
Kulik, Paul V.
Data processing: database and file management or data structures
Database design
Data structure types
707 1, 707 3, G06F 1730
Patent
active
057783632
ABSTRACT:
A method is provided for specifying the representation of a document and determining the relevance of the document according to an externally defined topic profile. The topic profile includes one or more compound terms having a positive correlation with the topic of interest. Each compound term has a specified form such as capitalization, punctuation, number, or adjacency relation, that is either ignored by conventional indexing processes or requires substantial data overhead to track. The compound terms of the topic profile are tagged to indicate how corresponding terms are treated when identified in a document being analyzed. Application of the topic profile to a document generates a document representation in which compound terms present in the document are retained in their specified form. A similarity function between the document representation and the topic profile is calculated, and the result is compared to a relevance threshold associated with the topic profile. A document is deemed relevant to the topic when the similarity function meets or exceeds the threshold.
REFERENCES:
patent: 5418951 (1995-05-01), Damashek
patent: 5442778 (1995-08-01), Pedersen et al.
K.L. Kwok, "Experiments with a Component Theory of Probalistic Information Retrieval Based on Single Terms as Document Components" ACM Transactions on Information Systems, vol. 8 No. 4, Oct. 1990, pp. 363-386.
Udi Manber & Sun Wu, "Glimpse: A Tool to Search Through Entire File Systems," Oct., 1993, pp. 1-10.
Salton, "Automatic Text Processing," Ch. 8-10, 1989, Addison-Wesley, pp. 228-371.
Salton/Mc Gill, "Introduction To Modern Information Retrieval," Ch. 3-6, 1983 McGraw-Hill, pp. 53-256.
Intel Corporation
Kulik Paul V.
Novakoski Leo V.
LandOfFree
Method for measuring thresholded relevance of a document to a sp does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Method for measuring thresholded relevance of a document to a sp, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Method for measuring thresholded relevance of a document to a sp will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-1217896