URL-based content categorization

Data processing: database and file management or data structures – Database and file access – Preparing data for information retrieval

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C707S782000, C707S783000, C707S784000, C707S952000

Reexamination Certificate

active

08078625

ABSTRACT:
Content may be categorized by accessing a URL associated with the content, determining a set of n-grams contained in the URL, and determining a category of the content based on the set of n-grams.

REFERENCES:
patent: 5418951 (1995-05-01), Damashek
patent: 6161130 (2000-12-01), Horvitz et al.
patent: 6199103 (2001-03-01), Sakaguchi et al.
patent: 6330590 (2001-12-01), Cotten
patent: 6421709 (2002-07-01), McCormick et al.
patent: 6507866 (2003-01-01), Barchi
patent: 6976070 (2005-12-01), Hoashi et al.
patent: 7555523 (2009-06-01), Hartmann
patent: 2002/0116463 (2002-08-01), Hart
patent: 2002/0116641 (2002-08-01), Mastrianni
patent: 2002/0181703 (2002-12-01), Logan et al.
patent: 2002/0199095 (2002-12-01), Bandini et al.
patent: 2003/0225763 (2003-12-01), Guilak et al.
patent: 2005/0022031 (2005-01-01), Goodman et al.
patent: 2006/0149710 (2006-07-01), Koningstein et al.
Author: Gary Robinson; Title: A statistical approach to the spam problem; Date: year 2003; Publisher: ACM Digital Library; vol. 2003; Pertinent pp. 1-10 (as renumbered).
A Kolcz et al., “SVM-based Filtering of E-mail Spam with Content-specific Misclassification Costs,” TextDM' 2001 (IEE ICDM-2001 Workshop on Text Mining), San Jose, CA 2001.
A. McCallum and K. Nigam. A comparison of event models for Naïve Bayes text classification. In Proceedings of theAAAI-98 Workshop on Learning for Text Categorization, 1998.
C. Lin,Formulations of support vector machines: A note from an optimization point of view, Neural Computation, 13 (2001), pp. 307-317.
F. Provost and T. Fawcett. Robust Classification for Imprecise Environments.Machine Learning, 42:203-231, 2001.
G. Sakkis, I. Androutsopoulso, G. Paliouras, V. Karkaletsis, C. Spyropoulos, and P. Stamatopoulos. Stacking Classifiers for Anti-Spam Filtering of E-Mail. In L. Lee and D. Harman editors,Proceedings of the 6thConference on Empirical Methods in Natural Language Processing(EMNLP 2001), pp. 44-50. Carnegie Mellon University, 2001.
G. Schohn and D. Cohn. Less is more:Active learning with support vector machines. In Proc. 17thInternational Conf. on Machine Learning, pp. 839-846. Morgan Kaufmann, San Francisco, CA, 2000.
G. Weiss and F. Provost. The effect of class distribution on classifier learning: an empirical study. Technical Report ML-TR-44, Department of Computer Science, Rutgers University, 2002.
H. Drucker et al., “Support Vector Machines for Spam Categorization,” IEEE Transactions on Neural Networks, vol. 10, No. 5, Sep. 1999.
H. Kaitarai. Filtering Junk E-Mail.: A Performance Comparison between Genetic Programming and Naïve Bayes. Technical Report (presented at the Fall 1999 Meeting of the CMU Text Learning Group), Department of Electrical and Computer Engineering, University of Waterloo, Nov. 1999.
http://www.paulgraham.com/better.html, Better Bayesian Filtering, Jan. 2003, pp. 1-11.
I. Androutsopoulos, G. Paliouras, V. Karkaletsis, G. Sakkis, C. Spyropoulos, and P. Stamatopoulos. Learning to Filter Spam E-Mail: A Comparison of a Naïve Bayesian and a Memory-Based Approach. In H. Zaragoza, P. Gallinari, and M. Rajman, editors,Proceedings of the Workshop on Machine Learning and Textual Information Access, 4thEuropean Conference on Principles and Practice of Knowledge Discovery in Databases(PKDD 2000), pp. 1-13. 2000.
I. Androutsopoulos, J. Koutsias, K. Chandrinos, G. Paliouras, and C. Spyropoulos. An Evaluation of Naïve Bayesian Anti-Spam Filtering. In G. Potamias, V. Moustakis, and M. van Someren, editors,Proceedings of the Workshop on Machine Learning in the New Information Age: 11thEuropean Conference on Machine Learning(EMCL 2000), pp. 9-17. 2000.
I. Androutsopoulos, J. Koutsias, K. Chandrinos, G. Paliouras, and C. Spyropoulos. An Experimental Comparison of Naïve Bayesian and Keyword-Based Anti-Spam Filtering with Encrypted Personal E-mail Messages. In N. Belkin, P. Ingwersen, and M. Leong, editors,Proceedings of the 23rdAnnual International ACM SIGIR Conference on Research and Development in Information Retrieval(SIGIR 2000), pp. 160-167. 2000.
J. D. M. Rennie. Improving multi-class text classification with Naïve Bayes. Technical Report AITR-2001-004, Massachusetts Institute of Technology, 2001.
J. Diederich, J. Kindermann, E. Leopold, and G. Paass. Authorship Attribution with Support Vector Machines. InProceedings of the Learning Workshop, Snowbird, Utah, 2000.
J. G. Hidalgo, M. M. López, and E. P. Sanz. Combining Text and Heuristics for Cost-Sensitive Spam Filtering. InProceedings of the Fourth Computational Natural Language Learning Workshop, CoNLL-2000, Lisbon, Portugal, 2000. Association for Computational Linguistics.
J. Platt,Fast training of support vector machines using sequential minimal optimization, in Advances in Kernel Methods—Support Vector Learning, B. Schölkopf, C. Burges, and A. Smola, eds., MIT Press, 1999.
I. Provost. Naïve-Bayes vs . Rule-Learning in Classification of Email. Technical report, Dept. of Computer Sciences at the U. of Texas at Austin, 1999.
M. Hearst et al., Support Vector Machines, IEE Intelligent Systems, Jul./Aug. 1998.
M. Sahami, S. Dumais, D. Heckerman, and E. Horvitz. A Bayesian Approach to Filtering Junk E-Mail. InProceedings of the AAAI-98 Workshop on Learning for Text Categorization, 1998.
P. Bennett,Assessing the calibration of naïve bayes' posterior estimates, tech. report, Dept. of Computer Science, School of Science, Carnegie Mellon University 2000.
P. Domingos,MetaCost: A general method for making classifiers cost-sensitive, in Proceedings of the Fifth International Conference of Knowledge Discovery and Data Mining, ACM Press, 1999, pp. 155-164.
P. Pantel and D. Lin. SpamCop: A Spam Classification & Organization Program. InLearning for Text Categorization: Papers from the 1998 Workshop, Madison, Wisconsin, 1998. AAAI Technical Report WS-98-05.
R. Segal and J. Kephart. Incremental Learning in SwiftFile. In Proceedings of the Seventh International Conference on Machine Learning, 2000.
S. Dumais, J. Platt, D. Heckerman, and M. Sahami,Inductive learning algorithms and representations for text categorization, in Proceedings of 7thInternational Conference on Information and Knowledge Management, 1998, pp. 229-237.
S. Eyheramendy, D. Lewis, and D. Madigan. On the Naïve Bayes model for text categorization. InProceedings of the Ninth International Workshop on Artcial Intelligence and Statistics, 2003.
T. Joachims, “Text Categorization with Support Vector Machines: Learning with Many Relevant Features”, University of Dortmund, Computer Science Dept., LS-8 Report 23, 1998.
Y. Yang and J. P. Pedersen. A comparative study on feature selection in text categorization. InProceedings of the Fourteenth International Conference on Machine Learning(ICML '97), pp. 412-420, 1997.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

URL-based content categorization does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with URL-based content categorization, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and URL-based content categorization will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-4303055

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.