Domain name geometrical classification using character-based...

Data processing: artificial intelligence – Neural network – Learning task

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

Reexamination Certificate

active

08041662

ABSTRACT:
Character-based n-grams are derived from a domain name in order to classify such domain name in pre-established categories. Domain name character-based n-grams are mapped to vector points in a multidimensional space, where the number of dimensions is the number of different n-grams that can exist for an n-character combination. The relationship between the domain name vector point and the vector points of the various other domain names is used to classify the domain name vector point. The classification system can use statistical methods using relative frequencies of character-based n-grams in various classifications as indicators. A dictionary set of character-based n-grams can be derived from one or more domain names and associated with probability indicating the likelihood that the character-based n-gram is found in a domain name of a given classification. Such probability can be an estimator of a classification of a new domain name having such character-based n-gram.

REFERENCES:
patent: 5452442 (1995-09-01), Kephart
patent: 6131082 (2000-10-01), Hargrave et al.
patent: 6266664 (2001-07-01), Russell-Falla et al.
patent: 6560596 (2003-05-01), Margulies et al.
patent: 6578032 (2003-06-01), Chandrasekar et al.
patent: 6826576 (2004-11-01), Lulich et al.
patent: 6947918 (2005-09-01), Brill
patent: 7133860 (2006-11-01), Iizuka et al.
patent: 2002/0035611 (2002-03-01), Dooley
patent: 2003/0233232 (2003-12-01), Fosler-Lussier et al.
patent: 2004/0162895 (2004-08-01), Mok et al.
patent: 2004/0167982 (2004-08-01), Cohen et al.
patent: 2005/0234953 (2005-10-01), Zhang et al.
patent: 2005/0289168 (2005-12-01), Green et al.
patent: 2006/0059337 (2006-03-01), Poyhonen et al.
patent: 2006/0089924 (2006-04-01), Raskutti et al.
patent: 2006/0095404 (2006-05-01), Adelman et al.
patent: 2006/0106866 (2006-05-01), Green et al.
patent: 2006/0112040 (2006-05-01), Oda
patent: 2006/0149710 (2006-07-01), Koningstein et al.
patent: 2006/0212142 (2006-09-01), Madani et al.
patent: 2006/0212413 (2006-09-01), Rujan et al.
patent: 2006/0229899 (2006-10-01), Hyder et al.
patent: 2006/0287988 (2006-12-01), Mason
patent: 2007/0022419 (2007-01-01), Subbarao et al.
patent: 2007/0094500 (2007-04-01), Shannon et al.
patent: 2009/0043720 (2009-02-01), Reznik et al.
patent: 08-221447 (1996-08-01), None
patent: 2000-231559 (2000-08-01), None
patent: 10-2002-0011671 (2002-02-01), None
Zhang, et al. “The Role of URLs in Objectionable Web Content Categorization”, Proc. 2006 IEEE/WIC/ACM International Conference on Web Intelligence, pp. 1-7.
Kanaris et al. “Spam Detection Using Character N-Grams”, SETN 2006, LNAI 3955, 2006, pp. 95-104.
Paul N. Bennett et al. “Detecting Action-Items in E-mail”, Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval, 2005.
Min-Yen Kan et al. “Fast webpage classification using URL features”, Proceedings of the 14th ACM international conference on Information and knowledge management, 2005.
Xiaogang Peng et al. “Automatic Web Page Classification in a Dynamic and Hierarchical Way”, IEEE International Conference on Data Mining, 2002, pp. 386-393.
Helmut Berger et al. “On the Impact of Document Representation on Classifier Performance in e-Mail Categorization”, 2005.
International Search Report for PCT/US2008/072666 mailed Jan. 21, 2009. 13 Pages.
International Search Report for PCT/US2008/072668 mailed Aug. 8, 2008, 10 Pages.
Asirvatham, et al., “Web Page Classification based on Document Structure”, retrieved from <http://www.iiit.net/students/stud—pdfs/kranthi1.pdf> on Aug. 21, 2001, 10 pages.
Zhang, et al., “The Role of URLs in Objectionable Web Content Categorization”, retrieved from <http://ieeexplore.ieee.org/iel5/4061321/4061322/04061377.pdf?isNumber=&htry=7> on Aug. 21, 2007, 7 pages.
Kan, et al., “Fast Webpage Classification using URL Features”, retrieved from <http://wing.comp.nus.edu.sg/meurlin
ustrc8—05.pdf> on Aug. 21, 2007, 9 pages.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Domain name geometrical classification using character-based... does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Domain name geometrical classification using character-based..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Domain name geometrical classification using character-based... will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-4292603

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.