Electrical computers and digital processing systems: multicomput – Computer conferencing – Demand based messaging
Patent
1998-06-23
2000-12-12
Luu, Le Hien
Electrical computers and digital processing systems: multicomput
Computer conferencing
Demand based messaging
709205, 709207, 709240, 707 5, 707 6, G06F 1516, G06F 15173, G06F 1730
Patent
active
061611301
ABSTRACT:
A technique, specifically a method and apparatus that implements the method, which through a probabilistic classifier (370) and, for a given recipient, detects electronic mail (e-mail) messages, in an incoming message stream, which that recipient is likely to consider "junk". Specifically, the invention discriminates message content for that recipient, through a probabilistic classifier (e.g., a support vector machine) trained on prior content classifications. Through a resulting quantitative probability measure, i.e., an output confidence level, produced by the classifier for each message and subsequently compared against a predefined threshold, that message is classified as either, e.g., spam or legitimate mail, and, e.g., then stored in a corresponding folder (223, 227) for subsequent retrieval by and display to the recipient. Based on the probability measure, the message can alternatively be classified into one of a number of different folders, depicted in a pre-defined visually distinctive manner or simply discarded in its entirety.
REFERENCES:
patent: 5377354 (1994-12-01), Scannell et al.
patent: 5619648 (1997-04-01), Canale et al.
patent: 5638487 (1997-06-01), Chigier
patent: 5835087 (1998-11-01), Herz et al.
patent: 6003027 (1999-12-01), Prager
patent: 6023723 (2000-02-01), McCormick et al.
U.S. application No. 09/102,946, Dumais et al., filed Jun. 23, 1998.
Y.H. Lin et al, "Classification of Text Documents", Department of Computer Science and Engineering, Michigan State University, E. Lansing, Michigan, The Computer Journal, vol. 41, No. 8, 1998.
J. Takkinen et al, "CAFE: A Conceptual Model for Managing Information in Electronic Mail", Laboratory for Intelligent Information Systems, Department of Computer and Information Science, Linkoping University, Sweden, Conference on System Sciences, 1998 IEEE.
J. Palme et al, "Issues when designing filters in messaging systems", Department of Computer and Systems Sciences, Stockholm University, Royal Institute of Technology, Skeppargarten 73, S-115 30, Stockholm, Sweden, Computer Communications, 1996.
M. Iwayama et al, "Hierarchical Bayesian Clustering for Automatic Text Classification", Natural Language, 1995.
Thorsten Joachims, "Text Categorization with Support Vector Machines: Learning with Many Relevant Features", LS-8 Report 23, University of Dortmund, Computer Science Department, Nov. 1997.
Daphne Koller et al., "Hierarchically classifying documents using very few words", In ICML-97: Proceedings of the Fourteenth International Conference on Machine Learning, San Francisco, CA: Morgan Kaufmann, 1997.
Ellen Spertus, "Smokey: Automatic Recognition of Hostile Messages", Proceedings of the Conference on Innovative Applications in Artificial Intelligence (IAAI), 1997.
Hinrich Schutze et al, "A Comparison of Classifiers and Document Representations for the Routing Problem", Proceedings of the 18.sup.th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Seattle, Washington, Jul. 9-13, 1995, pp. 229-237.
Yiming Yang et al, "A Comparative Study on Feature Selection in Text Categorization", School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, and Verity, Inc., Sunnyvale, CA.
Yiming Yang et al, "An Example-Based Mapping Method for Text Categorization and Retrieval", ACM Transactions on Information Systems, vol. 12, No. 3, Jul. 1994, pp. 252-277.
David D. Lewis et al, "A Comparison of Two Learning Algorithms for Text Categorization", Third Annual Symposium on Document Analysis and Information Retrieval, Las Vegas, Nevada, Apr. 11-13, 1994, pp. 81-93.
Mehran Sahami, "Learning Limited Dependence Bayesian Classifiers", In KDD-96: Proceedings of the Second International Conference on Knowledge Discovery and Data Mining, pp. 335-338, Menlo Park, CA: AAAI Press, 1996.
William W. Cohen, "Learning Rules that Classify E-Mail", In the Proceedings of the 1996 AAAI Spring Symposium on Machine Learning in Information Access. Downloaded from William Cohen's web page: http://www.research.att.com//wwcohen/pub.html.
David D. Lewis, "An Evaluation of Phrasal and Clustered Representations on a Text Categorization Task", 15.sup.th Annual International SIGIR '92, Denmark, 1992, pp. 37-50.
Daphne Koller et al., "Toward Optimal Feature Selection", Machine Learning: Proc. of the Thirteenth International Conference, Morgan Kaufmann, 1996.
David Dolan Lewis, Ph.D., "Representation and learning in information retrieval", University of Massachusetts, 1992.
Tom M. Mitchell, "Machine Learning", Carnegie Mellon University, Bayesian Learning, Chapter 6, pp. 180-184.
Dumais Susan T.
Heckerman David E.
Horvitz Eric
Platt John C.
Sahami Mehran
Kang Paul
Luu Le Hien
Michaelson Peter L.
Microsoft Corporation
LandOfFree
Technique which utilizes a probabilistic classifier to detect "j does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Technique which utilizes a probabilistic classifier to detect "j, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Technique which utilizes a probabilistic classifier to detect "j will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-226385