Domain name statistical classification using character-based...

Data processing: artificial intelligence – Knowledge processing system – Knowledge representation and reasoning technique

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

Reexamination Certificate

active

08005782

ABSTRACT:
Systems and methods of classifying domain names are disclosed. Character-based n-grams are derived from a domain name in order to classify such domain name in one or more categories. In one aspect, a geometrical approach is used. Domain name character-based n-grams are mapped to vector points in a multidimensional space. The relationship between a domain name vector point and vector points of other domain names is used as an indicator of the classification of the domain name vector point. In another aspect, a statistical approach is used. Relative frequencies of one or more character-based n-grams in various classifications are used as indicators. Each character-based n-gram can be associated with a respective probability that indicates a likelihood that the character-based n-gram is found in a domain name of a given classification. Such a probability can serve as an estimator of a classification of a new domain name having such character-based n-gram.

REFERENCES:
patent: 5452442 (1995-09-01), Kephart
patent: 6131082 (2000-10-01), Hargrave et al.
patent: 6266664 (2001-07-01), Russell-Falla et al.
patent: 6560596 (2003-05-01), Margulies et al.
patent: 6578032 (2003-06-01), Chandrasekar et al.
patent: 6826576 (2004-11-01), Lulich et al.
patent: 6947918 (2005-09-01), Brill
patent: 7133860 (2006-11-01), Iizuka et al.
patent: 2002/0035611 (2002-03-01), Dooley
patent: 2003/0233232 (2003-12-01), Fosler-Lussier et al.
patent: 2004/0162895 (2004-08-01), Mok et al.
patent: 2004/0167982 (2004-08-01), Cohen et al.
patent: 2005/0234953 (2005-10-01), Zhang et al.
patent: 2005/0289168 (2005-12-01), Green et al.
patent: 2006/0059337 (2006-03-01), Poyhonen et al.
patent: 2006/0089924 (2006-04-01), Raskutti et al.
patent: 2006/0095404 (2006-05-01), Adelman et al.
patent: 2006/0106866 (2006-05-01), Green et al.
patent: 2006/0112040 (2006-05-01), Oda
patent: 2006/0149710 (2006-07-01), Koningstein et al.
patent: 2006/0212142 (2006-09-01), Madani et al.
patent: 2006/0212413 (2006-09-01), Rujan et al.
patent: 2006/0229899 (2006-10-01), Hyder et al.
patent: 2006/0287988 (2006-12-01), Mason
patent: 2007/0022419 (2007-01-01), Subbarao et al.
patent: 2007/0094500 (2007-04-01), Shannon et al.
patent: 2009/0043721 (2009-02-01), Reznik et al.
patent: 08-221447 (1996-08-01), None
patent: 2000-231559 (2000-08-01), None
patent: 10-2002-0011671 (2002-02-01), None
Zhang, et al. “The Role of URLs in Objectionable Web Content Categorization”, Proc. 2006 IEEE/WIC/ACM International Conference on Web Intelligence, pp. 1-7.
Paul N. Bennett et al. “Detecting Action-Items in E-mail”, Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval, 2005.
Min-Yen Kan et al. “Fast webpage classification using URL features”, Proceedings of the 14th ACM international conference on Information and knowledge management, 2005.
Xiaogang Peng et al. “Automatic Web Page Classification in a Dynamic and Hierarchical Way”, IEEE International Conference on Data Mining, 2002, pp. 386-393.
Helmut Berger et al. “On the Impact of Document Representation on Classifier Performance in e-Mail Categorization”, 2005.
International Search Report for PCT/US2008/072668 mailed Aug. 8, 2008. 10 Pages.
International Search Report and Written Opinion for PCT/US2008/072666 mailed Jan. 21, 2009, 13 pages.
Zhang, et al. “The Role of URLs in Objectionable Web Content Categorization”, Proc. 2006 IEEE/WIC/ACM International Conference on Web Intelligence, pp. 1-7.
Kanaris, et al. “Spam Detection Using Character N-Grams”, SETN 2006, LNAI 3955, 2006, pp. 95-104.
International Preliminary Report on Patentability for PCT/US2008/072668 mailed Feb. 25, 2010, 6 pages.
Asirvatham, et al., “Web Page Classification based on Document Structure”, retrieved from <http://www.iiit.net/ students/stud—pdfs/kranthi1.pdf> on Aug. 21, 2007, 10 pages.
Zhang, et al., “The Role of URLs in Objectionable Web Content Categorization”, retrieved from <http://ieeexplore.ieee.org/ie15/4061321/4061322/04061377.pdf?isNumber=&htry=7> on Aug. 21, 2007, 7 pages.
Kan, et al., “Fast Webpage Classification using URL Features”, retrieved from <http://wing.comp.nus.edu.sg/meurlin/ nustrc8—05.pdf> on Aug. 21, 2007, 9 pages.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Domain name statistical classification using character-based... does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Domain name statistical classification using character-based..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Domain name statistical classification using character-based... will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-2738022

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.