Data processing: database and file management or data structures – Database design – Data structure types
Patent
1997-08-18
2000-08-15
Amsbury, Wayne
Data processing: database and file management or data structures
Database design
Data structure types
707 3, 707104, G06F 1730
Patent
active
061050237
ABSTRACT:
A system for filtering documents and includes a document parser, a profile parser, and a comparator. The document parser accepts incoming documents as input and provides inverted lists of terms contained in the document's output. The profile parser accepts as input user queries and provides as output query nets representing the user queries. The comparator compares the inverted lists representing the documents against the query that is representing the user queries to determine if an incoming document matches a user query. A related method for filtering incoming documents includes the steps of receiving an incoming document and parsing it to produce an inverted list of terms contained in the incoming document. The inverted list is then used to retrieve user queries. Any user queries matching less than a pre-determined number of terms are immediately discarded. The remaining user queries are scored and user queries having a score less than a predetermined threshold are discarded. The remaining user queries are the queries which the incoming document matches.
REFERENCES:
patent: 5265065 (1993-11-01), Turtle
patent: 5418948 (1995-05-01), Turtle
patent: 5488725 (1996-01-01), Turtle et al.
patent: 5671403 (1997-09-01), Shekita et al.
patent: 5717914 (1998-02-01), Husick et al.
patent: 5737734 (1998-04-01), Schultz
patent: 5742816 (1998-04-01), Barr et al.
Allan et al., "Inquery at TREC-5," presented at The Fifth Text Retrieval (TREC-5) Conference, National Institute of Standards and Technology, Gaitherburg, Maryland, Nov. 20, 1996, 14 pages.
Callan, Jamie, "Document Filtering with Inference Networks," presented at The Nineteenth Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Zurich, Switzerland, Aug. 18, 1996, 8 pages.
Callan et al., "The INQUERY Retrieval System," In Proceedings of the Third International Conference on Database and Expert Systems Applications, Valencia, Spain, Springer-Verlag, 1992, pp. 78-83.
Callan et al., "TIPSTER Phase 2 Activities: The University of Massachusetts at Amherst," TIPSTER Text Phase II 24-Month Workshop, Tysons Corner, Virginia, May 5, 1996, 17 pages.
"In Route Effectiveness," Center for Intelligent Information Retreival (CIIR) Industrial Advisory Board (IAB) Meeting, University of Massachusetts, Amherst, Massachusetts, Apr. 23, 1996, 10 pages.
Salton, Gerald, "The Smart Project in Automatic Document Retrieval," in Proceedings of the Fourteenth Annual Interational ACM/SIGIR Conference on Research and Development in Information Retrieval, Chicago, Illinois, 1991, pp. 356-358.
Brown, E., An Approach for Improving Performance in Inference Network Based Information Retrieval, Umass Technical Report 94-73, University of Massachusetts, Amherst, Massachusetts, Sep., 1994, 33 pgs.
Brown, E., Execution Performance Issues in Full-Text Information Retrieval, PhD. dissertation, Umass Technical Report TR95-81, University of Massachusetts, Amherst, Massachusetts, Oct., 1995, 196 pgs.
Brown, E., "Fast Evaluation of Structured Queries for Information Retrieval," presented at The 18th International AGM SIGIR Conference on Research and Development in Information Retrieval, Seattle, WA, Jul. 9, 1995, 9 pgs.
Haines, David, Adaptive Query Modification in a Probabilistic Information Retrieval Model, Ph.D. Dissertation, Umass Technical Report, TR96-61, University of Massachusetts, Amherst, Massachusetts, Sep., 1996, 177 pgs.
Kalt, T., A New Probabilistic Model of Text Classification and Retrieval, Center for Intelligent Information Retrieval Technical Report, University of Massachusetts, Amherst, Massachusetts, Jan. 29, 1996, 9 pgs.
Krovetz, Robert, Word Sense Disambiguation for Large Text Databases, Ph.D. dissertation, University of Massachusetts, Amherst, Massachusetts, May, 1995, 116 pgs.
Lu, Z., Callan, J. and Croft, W.B., Applying Inference Networks to Multiple Collection Searching, Umass Technical Report TR96-42, University of Massachusetts, Amherst, Massachusetts, Mar., 1996, 2 0 pgs.
Xu, J., Broglio, J. and Croft, W.B., The Design and Implementation of a Part of Speech Tagger for English, Umass Technical Report, CMPSC1 TR94-26, University of Massachusetts, Amherst, Massachusetts, Aug., 1994, 14 pgs.
Amsbury Wayne
Dataware Technologies, Inc.
Havan Thu-Thao
LandOfFree
System and method for filtering a document stream does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with System and method for filtering a document stream, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and System and method for filtering a document stream will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-2018250