Detecting spam documents in a phrase based information...

Data processing: database and file management or data structures – Database design – Data structure types

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C707S793000

Reexamination Certificate

active

07603345

ABSTRACT:
An information retrieval system uses phrases to index, retrieve, organize and describe documents. Phrases are identified that predict the presence of other phrases in documents. Documents are the indexed according to their included phrases. A spam document is identified based on the number of related phrases included in a document.

REFERENCES:
patent: 5321833 (1994-06-01), Chang et al.
patent: 5495567 (1996-02-01), Iizawa et al.
patent: 5523946 (1996-06-01), Kaplan et al.
patent: 5696962 (1997-12-01), Kupiec
patent: 5724571 (1998-03-01), Woods
patent: 5754939 (1998-05-01), Herz et al.
patent: 5771378 (1998-06-01), Holt et al.
patent: 5826261 (1998-10-01), Spencer
patent: 5835087 (1998-11-01), Herz et al.
patent: 5915249 (1999-06-01), Spencer
patent: 5920854 (1999-07-01), Kirsch et al.
patent: 5956722 (1999-09-01), Jacobson et al.
patent: 5960383 (1999-09-01), Fleischer
patent: 5983216 (1999-11-01), Kirsch et al.
patent: 6070158 (2000-05-01), Kirsch et al.
patent: 6085186 (2000-07-01), Christianson et al.
patent: 6098034 (2000-08-01), Razin et al.
patent: 6178419 (2001-01-01), Legh-Smith et al.
patent: 6185550 (2001-02-01), Snow et al.
patent: 6185558 (2001-02-01), Bowman et al.
patent: 6298344 (2001-10-01), Inaba et al.
patent: 6349316 (2002-02-01), Fein et al.
patent: 6363377 (2002-03-01), Kravets et al.
patent: 6366911 (2002-04-01), Christy
patent: 6366933 (2002-04-01), Ball et al.
patent: 6415283 (2002-07-01), Conklin
patent: 6470307 (2002-10-01), Turney
patent: 6499030 (2002-12-01), Igata
patent: 6542888 (2003-04-01), Marques
patent: 6549895 (2003-04-01), Lai
patent: 6571240 (2003-05-01), Ho et al.
patent: 6594658 (2003-07-01), Woods
patent: 6596030 (2003-07-01), Ball et al.
patent: 6654739 (2003-11-01), Apte et al.
patent: 6684183 (2004-01-01), Korall et al.
patent: 6691106 (2004-02-01), Sathyanarayan
patent: 6741981 (2004-05-01), McGreevy
patent: 6741982 (2004-05-01), Soderstrom et al.
patent: 6741984 (2004-05-01), Zaiken et al.
patent: 6772150 (2004-08-01), Whitman et al.
patent: 6778980 (2004-08-01), Madan et al.
patent: 6820237 (2004-11-01), Abu-Hakima et al.
patent: 6823333 (2004-11-01), McGreevy
patent: 6832224 (2004-12-01), Gilmour
patent: 6839682 (2005-01-01), Blume et al.
patent: 6859800 (2005-02-01), Roche et al.
patent: 6862710 (2005-03-01), Marchisio
patent: 6886010 (2005-04-01), Kostoff
patent: 6910003 (2005-06-01), Arnold et al.
patent: 6978274 (2005-12-01), Gallivan et al.
patent: 6981040 (2005-12-01), Konig et al.
patent: 6983345 (2006-01-01), Lapir et al.
patent: 6997793 (2006-02-01), Ito
patent: 7028026 (2006-04-01), Yang et al.
patent: 7051023 (2006-05-01), Kapur et al.
patent: 7051024 (2006-05-01), Fein et al.
patent: 7058589 (2006-06-01), Leamon
patent: 7085771 (2006-08-01), Chung et al.
patent: 7089236 (2006-08-01), Stibel
patent: 7137065 (2006-11-01), Huang et al.
patent: 7139756 (2006-11-01), Cooper et al.
patent: 7149748 (2006-12-01), Stephan
patent: 7158983 (2007-01-01), Willse et al.
patent: 7171619 (2007-01-01), Bianco
patent: 7194483 (2007-03-01), Mohan et al.
patent: 7200802 (2007-04-01), Kawatani
patent: 7206389 (2007-04-01), Dumoulin et al.
patent: 7240064 (2007-07-01), Risvik et al.
patent: 7243092 (2007-07-01), Woehler et al.
patent: 7254580 (2007-08-01), Gharachorloo et al.
patent: 7263530 (2007-08-01), Hu et al.
patent: 7426507 (2008-09-01), Patterson
patent: 7454449 (2008-11-01), Plow et al.
patent: 7536408 (2009-05-01), Patterson
patent: 2001/0000356 (2001-04-01), Woods
patent: 2001/0021938 (2001-09-01), Fein et al.
patent: 2002/0042707 (2002-04-01), Zhao et al.
patent: 2002/0042793 (2002-04-01), Choi
patent: 2002/0046018 (2002-04-01), Marcu et al.
patent: 2002/0052901 (2002-05-01), Guo et al.
patent: 2002/0065857 (2002-05-01), Michalewicz et al.
patent: 2002/0078090 (2002-06-01), Hwang et al.
patent: 2002/0091671 (2002-07-01), Prokoph
patent: 2002/0138467 (2002-09-01), Jacobson et al.
patent: 2002/0143524 (2002-10-01), O'Neil et al.
patent: 2002/0147578 (2002-10-01), O'Neil et al.
patent: 2002/0174113 (2002-11-01), Kanie et al.
patent: 2002/0188587 (2002-12-01), McGreevy
patent: 2002/0188599 (2002-12-01), McGreevy
patent: 2003/0031996 (2003-02-01), Robinson
patent: 2003/0037041 (2003-02-01), Hertz
patent: 2003/0051214 (2003-03-01), Graham et al.
patent: 2003/0069877 (2003-04-01), Grefenstette et al.
patent: 2003/0078913 (2003-04-01), McGreevy
patent: 2003/0088627 (2003-05-01), Rothwell et al.
patent: 2003/0093790 (2003-05-01), Logan et al.
patent: 2003/0144995 (2003-07-01), Franz et al.
patent: 2003/0195877 (2003-10-01), Ford et al.
patent: 2004/0006736 (2004-01-01), Kawatani
patent: 2004/0034633 (2004-02-01), Rickard
patent: 2004/0052433 (2004-03-01), Henry et al.
patent: 2004/0064438 (2004-04-01), Kostoff
patent: 2004/0068396 (2004-04-01), Kawatani
patent: 2004/0133560 (2004-07-01), Simske
patent: 2004/0158580 (2004-08-01), Carmel et al.
patent: 2004/0186824 (2004-09-01), Delic et al.
patent: 2004/0186827 (2004-09-01), Anick et al.
patent: 2004/0225667 (2004-11-01), Hu
patent: 2004/0260692 (2004-12-01), Brill et al.
patent: 2005/0043940 (2005-02-01), Elder
patent: 2005/0060295 (2005-03-01), Gould et al.
patent: 2005/0060651 (2005-03-01), Anderson
patent: 2005/0071328 (2005-03-01), Lawrence
patent: 2005/0071741 (2005-03-01), Acharya et al.
patent: 2005/0154723 (2005-07-01), Liang
patent: 2005/0165778 (2005-07-01), Obata et al.
patent: 2005/0216564 (2005-09-01), Myers et al.
patent: 2005/0256848 (2005-11-01), Alpert et al.
patent: 2005/0278620 (2005-12-01), Baldwin et al.
patent: 2006/0018551 (2006-01-01), Patterson
patent: 2006/0020571 (2006-01-01), Patterson
patent: 2006/0020607 (2006-01-01), Patterson
patent: 2006/0031195 (2006-02-01), Patterson
patent: 2006/0036593 (2006-02-01), Dean et al.
patent: 2006/0106792 (2006-05-01), Patterson
patent: 2006/0143174 (2006-06-01), Dey et al.
patent: 2006/0143714 (2006-06-01), Peterson et al.
patent: 2006/0200464 (2006-09-01), Gideoni et al.
patent: 2008/0005064 (2008-01-01), Sarukkai
patent: 2008/0306943 (2008-12-01), Patterson et al.
Fetterly et al., “Detecting Phrase-Level Duplication on the World Wide Web”, SIGIR'05, Aug. 15-19, 2005, pp. 1-8.
Ntoulas et al., “Dectecting Spam Web Pages through Content Analysis”, WWW 2006, May 23-26, 2006, pp. 1-10.
Ahmed et al., “Word Stemming to Enhance Spam Filtering”, First Conference on Email and Anti-Spam (CEAS) 2004 Proceedings, Jul. 30-31, 2004, pp. 1-2, accessed online at <http://www.ceas.cc/papers-2004/167.pdf> on Sep. 30, 2008.
PCT International Search Report and Written Opinion, PCT/US06/02709, Jun. 25, 2007, 9 pages.
Examiner's First Report on Australian Patent Application No. 2005203237, Sep. 11, 2007, 2 Pages.
Examiner's First Report on Australian Patent Application No. 2005203240, Sep. 13, 2007, 2 Pages.
Examiner's First Report on Australian Patent Application No. 2005203238, Sep. 10, 2007, 2 Pages.
Examiner's First Report on Australian Patent Application No. 2005203239, Sep. 13, 2007, 2 Pages.
Caropreso, M. et al., “Statistical Phrases in Automated Text Categorization,” Internet Publication-Technical Report, May 26, 2000, pp. 1-18.
Jones, S. et al., “Topic-Based Browsing Within a Digital Library Using Keyphrases,” Proceedings of the Fourth ACM conference on Digital Libraries, Aug. 11-14, 1999, Berkeley, CA: ACM Press, (1999), 114-121.
Chang, C.T. K., et al., “Performance and Implications of Semantic Indexing in a Distributed Environment,” Proceedings of the 8th International Conference on Information Knowledge Management, New York, NY, USA (1999), 391-398.
English translation of First Office Action from the State Intellectual Property Office for Chinese Patent Application No. 200510085371.5, dated Apr. 2008.
English translation of First Office Action from the State Intellectual Property Office for Chinese Patent Application No. 200510085373.4, dated Mar. 2008.
Chen, Hsinchun et al., “Automatic Construction of Networks of Concepts Characterizing Document Databases”, IEEE Transactions on Systems, Man, and Cybernetics, vol. 22, No. 5,

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Detecting spam documents in a phrase based information... does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Detecting spam documents in a phrase based information..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Detecting spam documents in a phrase based information... will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-4139421

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.