Locating meaningful stopwords or stop-phrases in...

Data processing: database and file management or data structures – Database design – Data structure types

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C707S793000

Reexamination Certificate

active

07409383

ABSTRACT:
A stopword detection component detects stopwords (also stop-phrases) in search queries input to keyword-based information retrieval systems. Potential stopwords are initially identified by comparing the terms in the search query to a list of known stopwords. Context data is then retrieved based on the search query and the identified stopwords. In one implementation, the context data includes documents retrieved from a document index. In another implementation, the context data includes categories relevant to the search query. Sets of retrieved context data are compared to one another to determine if they are substantially similar. If the sets of context data are substantially similar, this fact may be used to infer that the removal of the potential stopword(s) is not material to the search. If the sets of context data are not substantially similar, the potential stopword can be considered material to the search and should not be removed from the query.

REFERENCES:
patent: 6360215 (2002-03-01), Judd et al.
patent: 6477524 (2002-11-01), Taskiran et al.
patent: 6804662 (2004-10-01), Annau et al.
patent: 7039631 (2006-05-01), Finger, II
patent: 2003/0004914 (2003-01-01), McGreevy
patent: 2003/0069877 (2003-04-01), Grefenstette et al.
patent: 2003/0088562 (2003-05-01), Dillon et al.
patent: 2003/0115187 (2003-06-01), Bode et al.
patent: 2003/0233618 (2003-12-01), Wan
patent: 2004/0068697 (2004-04-01), Harik et al.
patent: 2004/0088308 (2004-05-01), Bailey et al.
patent: 2004/0215608 (2004-10-01), Gourlay
Chang et al.: Predicate Rewriting for Translating Boolean Queries in a Heterogeneous Information System, ACM Transactions on Information Systems, vol. 17, No. 1, Jan. 1999.
Co-pending U.S. Appl. No. 10/676,571, filed Sep. 30, 2003, titled “Method and Apparatus for Characterizing Documents Based on Clusters of Related Words,” Georges Harik et al., 84 page specification, 16 sheets of drawings.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Locating meaningful stopwords or stop-phrases in... does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Locating meaningful stopwords or stop-phrases in..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Locating meaningful stopwords or stop-phrases in... will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-3998282

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.