Information security – Access control or authentication – Network
Reexamination Certificate
2007-09-18
2007-09-18
Smithers, Matthew B (Department: 2137)
Information security
Access control or authentication
Network
Reexamination Certificate
active
10454168
ABSTRACT:
The present invention involves a system and method that facilitate extracting data from messages for spam filtering. The extracted data can be in the form of features, which can be employed in connection with machine learning systems to build improved filters. Data associated with origination information as well as other information embedded in the body of the message that allows a recipient of the message to contact and/or respond to the sender of the message call be extracted as features. The features, or a subset thereof, can be normalized and/or deobfuscated prior to being employed as features of the machine learning systems. The (deobfuscated) features can be employed to populate a plurality of feature lists that facilitate spam detection and prevention. Exemplary features include an email address, an IP address, a URL, an embedded image pointing to a URL, and/or portions thereof.
REFERENCES:
patent: 5377354 (1994-12-01), Scannell et al.
patent: 5619648 (1997-04-01), Canale et al.
patent: 5638487 (1997-06-01), Chigier
patent: 5704017 (1997-12-01), Heckerman et al.
patent: 5805801 (1998-09-01), Holloway et al.
patent: 5835087 (1998-11-01), Herz et al.
patent: 5884033 (1999-03-01), Duvall et al.
patent: 5905859 (1999-05-01), Holloway et al.
patent: 6003027 (1999-12-01), Prager
patent: 6023723 (2000-02-01), McCormick et al.
patent: 6047242 (2000-04-01), Benson
patent: 6052709 (2000-04-01), Paul
patent: 6072942 (2000-06-01), Stockwell et al.
patent: 6101531 (2000-08-01), Eggleston et al.
patent: 6112227 (2000-08-01), Heiner
patent: 6161130 (2000-12-01), Horvitz et al.
patent: 6167434 (2000-12-01), Pang
patent: 6192360 (2001-02-01), Dumais et al.
patent: 6199102 (2001-03-01), Cobb
patent: 6266692 (2001-07-01), Greenstein
patent: 6308273 (2001-10-01), Goertzel et al.
patent: 6314421 (2001-11-01), Sharnoff et al.
patent: 6321267 (2001-11-01), Donaldson
patent: 6327617 (2001-12-01), Fawcett
patent: 6330590 (2001-12-01), Cotten
patent: 6370526 (2002-04-01), Agrawal et al.
patent: 6393465 (2002-05-01), Leeds
patent: 6421709 (2002-07-01), McCormick et al.
patent: 6424997 (2002-07-01), Buskirk, Jr. et al.
patent: 6434600 (2002-08-01), Waite et al.
patent: 6453327 (2002-09-01), Nielsen
patent: 6477551 (2002-11-01), Johnson et al.
patent: 6484197 (2002-11-01), Donohue
patent: 6484261 (2002-11-01), Wiegel
patent: 6505250 (2003-01-01), Freund et al.
patent: 6546416 (2003-04-01), Kirsch
patent: 6592627 (2003-07-01), Agrawal et al.
patent: 6615242 (2003-09-01), Riemers
patent: 6633855 (2003-10-01), Auvenshine
patent: 6643686 (2003-11-01), Hall
patent: 6684201 (2004-01-01), Brill
patent: 6691156 (2004-02-01), Drummond et al.
patent: 6701440 (2004-03-01), Kim et al.
patent: 6728690 (2004-04-01), Meek et al.
patent: 6732149 (2004-05-01), Kephart
patent: 6732157 (2004-05-01), Gordon et al.
patent: 6732273 (2004-05-01), Byers
patent: 6742047 (2004-05-01), Tso
patent: 6751348 (2004-06-01), Buzuloiu et al.
patent: 6757830 (2004-06-01), Tarbotton et al.
patent: 6768991 (2004-07-01), Hearnden
patent: 6775704 (2004-08-01), Watson et al.
patent: 6779021 (2004-08-01), Bates et al.
patent: 6842773 (2005-01-01), Ralston et al.
patent: 6915334 (2005-07-01), Hall
patent: 6928465 (2005-08-01), Earnest
patent: 6971023 (2005-11-01), Makinson et al.
patent: 2001/0046307 (2001-11-01), Wong
patent: 2002/0016956 (2002-02-01), Fawcett
patent: 2002/0059425 (2002-05-01), Belfiore et al.
patent: 2002/0073157 (2002-06-01), Newman et al.
patent: 2002/0091738 (2002-07-01), Rohrabaugh et al.
patent: 2002/0184315 (2002-12-01), Earnest
patent: 2002/0199095 (2002-12-01), Bandini et al.
patent: 2003/0009698 (2003-01-01), Lindeman et al.
patent: 2003/0016872 (2003-01-01), Sun
patent: 2003/0037074 (2003-02-01), Dwork et al.
patent: 2003/0041126 (2003-02-01), Buford et al.
patent: 2003/0088627 (2003-05-01), Rothwell et al.
patent: 2003/0167311 (2003-09-01), Kirsch
patent: 2003/0200541 (2003-10-01), Cheng et al.
patent: 2003/0204569 (2003-10-01), Andrews et al.
patent: 2003/0229672 (2003-12-01), Kohn
patent: 2004/0003283 (2004-01-01), Goodman et al.
patent: 2004/0015554 (2004-01-01), Wilson
patent: 2004/0054887 (2004-03-01), Paulsen et al.
patent: 2004/0073617 (2004-04-01), Milliken et al.
patent: 2004/0093371 (2004-05-01), Burrows et al.
patent: 2004/0139160 (2004-07-01), Wallace et al.
patent: 2004/0139165 (2004-07-01), McMillan et al.
patent: 2004/0177120 (2004-09-01), Kirsch
patent: 2004/0199585 (2004-10-01), Wang
patent: 2004/0199594 (2004-10-01), Radatti et al.
patent: 2005/0015455 (2005-01-01), Liu
patent: 2006/0036701 (2006-02-01), Bulfer et al.
patent: 413 537 (1991-02-01), None
patent: 720 333 (1996-07-01), None
patent: 1376427 (2003-03-01), None
patent: 1376427 (2004-01-01), None
patent: 1376427 (2004-01-01), None
patent: WO96/35994 (1996-11-01), None
patent: 9967731 (1999-12-01), None
patent: WO 02/071286 (2002-09-01), None
patent: WO 2004/059506 (2004-07-01), None
Fabrizio Sebastiani. Machine Learning in Automated Text Categorization. ACM Computing Surveys, vol. 34 Issue 1, pp. 1-47, 2002.
I. Androutsopoulos, G. Paliouras, V. Karkaletsis, G. Sakkis, C.D. Spyropoulos, and P. Stamatopoulos. Learning to Filter Spam E-mail: A Comparison of a Naive Bayesian and a Memory-based Approach. 4th PKDD's Workshop on Machine Learning and Textual Information Access, 2000. 13 pages.
Thorsten Joachims. Transductive Inference for Text Classification using Support Vector Machines. Proceedings of the 16th International Conference on Machine Learning, 1999, 10 pages.
Cynthia Dwork, et al.; “Pricing Via Processing or Combatting Junk Mail”; Presented at Crypto '92; pp. 1-11.
Thorsten Joachims; “Text Categorization with Support Vector Machines: Learning with Many Relevant Features”; LS-8 Report 23, Nov. 1997, 18 pages.
Daphne Koller, et al.; “Hierarchically Classifying Doucments Using Very Few Words”; In ICML-97: Proceedings of the Fourteenth International Conference on Machine Learning; San Francisco, CA: Morgan Kaufmann 1997; 9 pages.
Ellen Spertus; “Smokey: Automatic Recognition of Hostile Messages”; Proceedings of the Conference on Innovative Applications in Artificial Intelligence (IAAI), 1997, 8 pages.
Hinrich Schutze, et al.; “A Comparison of Classifiers and Document Representations for the Routing Problem”; Proceedings of the 18th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Seattle, WA, Jul. 9-13, 1995; pp. 229-237.
Yiming Yang, et al.; “A Comparative Study on Feature Selection in Text Categorization”; School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, and Verity, Inc., Sunnyvale, CA; 9 pages.
David D. Lewis, et al.; “A Comparison of Two Learning Algorithms for Text Categorization”; Third Annual Symposium on Document Analysis and Information Retrieval; Apr. 11-13, 1994; pp. 81-93.
Mehran Sahami; “Learning Limited Dependence Bayesian Classifiers”; In KDD-96: Proceedings of the Second International Conference on Knowledge Discovery and Data Mining; AAAI Press, 1996; Menlo Park, CA; pp. 335-338.
William W. Cohen; “Learning Rules that Classify E-Mail”; In the Proceedings of the 1996 AAAI Spring Symposium on Machine Learning in Information Access. Downloaded from William Cohen's web page: http://www.research.att.com
wcohen/pubs.html.
Makoto Iwayama, et al., Hierarchical Bayesian Clustering for Automatic Text Classifiation, Natural Language,1995, pp. 1322-1327.
David D. Lewis, An Evaluation of Phrasal and Clustered Representations on a Text Categorization Task, 15th Annual International SIGIR '92, Denmark 1992, pp. 37-50.
Daphne Koller, et al, Toward Optimal Feature Selection, Machine Learning Proc. of the Thirteenth International Conference, Morgan Kaufmann, 1996, 9 pages.
David Dolan Lewis, Representation and Learning in Information Retrieval, University of Massachusetts, 1992.
Tom Mitchell, Machine Learning, Carnegie Mellon University, Bayesian Learning, Chapter 6, pp. 180-184.
Y. H. Li, et al., Classification of Text Documents, The Com
Goodman Joshua T.
Gwozdz Daniel
Howell Nathan D.
Mehr John D.
Rounthwaite Robert L.
Amin Turocy & Calvin LLP
Microsoft Corporation
Smithers Matthew B
LandOfFree
Origination/destination features and lists for spam prevention does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Origination/destination features and lists for spam prevention, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Origination/destination features and lists for spam prevention will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-3758167