System and method for filtering a document stream

Data processing: database and file management or data structures – Database design – Data structure types

Patent

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

707 3, 707104, G06F 1730

Patent

active

061050237

ABSTRACT:
A system for filtering documents and includes a document parser, a profile parser, and a comparator. The document parser accepts incoming documents as input and provides inverted lists of terms contained in the document's output. The profile parser accepts as input user queries and provides as output query nets representing the user queries. The comparator compares the inverted lists representing the documents against the query that is representing the user queries to determine if an incoming document matches a user query. A related method for filtering incoming documents includes the steps of receiving an incoming document and parsing it to produce an inverted list of terms contained in the incoming document. The inverted list is then used to retrieve user queries. Any user queries matching less than a pre-determined number of terms are immediately discarded. The remaining user queries are scored and user queries having a score less than a predetermined threshold are discarded. The remaining user queries are the queries which the incoming document matches.

REFERENCES:
patent: 5265065 (1993-11-01), Turtle
patent: 5418948 (1995-05-01), Turtle
patent: 5488725 (1996-01-01), Turtle et al.
patent: 5671403 (1997-09-01), Shekita et al.
patent: 5717914 (1998-02-01), Husick et al.
patent: 5737734 (1998-04-01), Schultz
patent: 5742816 (1998-04-01), Barr et al.
Allan et al., "Inquery at TREC-5," presented at The Fifth Text Retrieval (TREC-5) Conference, National Institute of Standards and Technology, Gaitherburg, Maryland, Nov. 20, 1996, 14 pages.
Callan, Jamie, "Document Filtering with Inference Networks," presented at The Nineteenth Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Zurich, Switzerland, Aug. 18, 1996, 8 pages.
Callan et al., "The INQUERY Retrieval System," In Proceedings of the Third International Conference on Database and Expert Systems Applications, Valencia, Spain, Springer-Verlag, 1992, pp. 78-83.
Callan et al., "TIPSTER Phase 2 Activities: The University of Massachusetts at Amherst," TIPSTER Text Phase II 24-Month Workshop, Tysons Corner, Virginia, May 5, 1996, 17 pages.
"In Route Effectiveness," Center for Intelligent Information Retreival (CIIR) Industrial Advisory Board (IAB) Meeting, University of Massachusetts, Amherst, Massachusetts, Apr. 23, 1996, 10 pages.
Salton, Gerald, "The Smart Project in Automatic Document Retrieval," in Proceedings of the Fourteenth Annual Interational ACM/SIGIR Conference on Research and Development in Information Retrieval, Chicago, Illinois, 1991, pp. 356-358.
Brown, E., An Approach for Improving Performance in Inference Network Based Information Retrieval, Umass Technical Report 94-73, University of Massachusetts, Amherst, Massachusetts, Sep., 1994, 33 pgs.
Brown, E., Execution Performance Issues in Full-Text Information Retrieval, PhD. dissertation, Umass Technical Report TR95-81, University of Massachusetts, Amherst, Massachusetts, Oct., 1995, 196 pgs.
Brown, E., "Fast Evaluation of Structured Queries for Information Retrieval," presented at The 18th International AGM SIGIR Conference on Research and Development in Information Retrieval, Seattle, WA, Jul. 9, 1995, 9 pgs.
Haines, David, Adaptive Query Modification in a Probabilistic Information Retrieval Model, Ph.D. Dissertation, Umass Technical Report, TR96-61, University of Massachusetts, Amherst, Massachusetts, Sep., 1996, 177 pgs.
Kalt, T., A New Probabilistic Model of Text Classification and Retrieval, Center for Intelligent Information Retrieval Technical Report, University of Massachusetts, Amherst, Massachusetts, Jan. 29, 1996, 9 pgs.
Krovetz, Robert, Word Sense Disambiguation for Large Text Databases, Ph.D. dissertation, University of Massachusetts, Amherst, Massachusetts, May, 1995, 116 pgs.
Lu, Z., Callan, J. and Croft, W.B., Applying Inference Networks to Multiple Collection Searching, Umass Technical Report TR96-42, University of Massachusetts, Amherst, Massachusetts, Mar., 1996, 2 0 pgs.
Xu, J., Broglio, J. and Croft, W.B., The Design and Implementation of a Part of Speech Tagger for English, Umass Technical Report, CMPSC1 TR94-26, University of Massachusetts, Amherst, Massachusetts, Aug., 1994, 14 pgs.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

System and method for filtering a document stream does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with System and method for filtering a document stream, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and System and method for filtering a document stream will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-2018250

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.