Training procedure for N-gram-based statistical content...

Data processing: database and file management or data structures – Database and file access – Preparing data for information retrieval

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C707S758000

Reexamination Certificate

active

07917522

ABSTRACT:
A training procedure for N-gram based statistical document classification has been disclosed. In one embodiment, a set of N-grams is selected out of a second set of N-grams, each of the N-grams having a sequence of N bytes, where N is an integer. Then a statistical content classification model is generated based on occurrences of the N-grams, if any, in a set of training documents and a set of validation documents. The statistical content classification model is provided to content filters to classify content.

REFERENCES:
patent: 5678041 (1997-10-01), Baker et al.
patent: 6003030 (1999-12-01), Kenner et al.
patent: 6061692 (2000-05-01), Thomas et al.
patent: 6092038 (2000-07-01), Kanevsky et al.
patent: 6272456 (2001-08-01), de Campos
patent: 6502125 (2002-12-01), Kenner et al.
patent: 6691156 (2004-02-01), Drummond et al.
patent: 6772214 (2004-08-01), McClain et al.
patent: 6981029 (2005-12-01), Menditto et al.
patent: 7031910 (2006-04-01), Eisele
patent: 7089246 (2006-08-01), O'Laughlen
patent: 7194464 (2007-03-01), Kester et al.
patent: 2003/0225763 (2003-12-01), Guilak et al.
patent: 2005/0086252 (2005-04-01), Jones et al.
patent: 2005/0273450 (2005-12-01), McMillen et al.
patent: 0155873 (2000-01-01), None
Kareem Darwish et al., Probabilistic structured query methods, 2003, ACM, 338-344.
Chih-Ping Wei et al., A mining-based category evolution approach to managing online document categories, Aug. 7, 2002, IEEE, 1-10.
Kosinov, S., “Evaluation of N-grams conflation approach in text-based information retrieval”, 2001, IEEE, pp. 136-142.
Tsatsanis et al. “Object and texture classification using higher order statistics”, Jul. 1992, IEEE, pp. 733-750.
Notice of Allowance and Fees Due mailed Apr. 29, 2010 for U.S. Appl. No. 11/881,770, filed Jul. 27, 2007, 10 pages.
Office Action mailed Jan. 5, 2010 for U.S. Appl. No. 11/881,770, filed Jul. 27, 2007, 10 pages.
“SurfControl Web Filter”, accessed at: http://www.surfcontrol.com/products/web/ on Jun. 23, 2004, 2 pages.
“Protecting Children Online at School”, Bess Internet Filtering Products, accessed at: http://www.n2h2.com/products/bess—home/php on Jun. 23, 2004, 1 page.
How N2H2 Filtering Works, Bess Internet Filtering Products, accessed at: http://www.n2h2.com/products/bess.php?device=categories on Jun. 23, 2004, 1 page.
“Available Features”, Bess Internet Filtering Products, access at http://www.n2h2.com/products/bess.php?device=features on Jun. 23, 2004, 5 pages.
“Filtering Categories”, Bess Internet Filtering Products, accessed at: http://www.n2h2.com/products/bess.php?device=categories on Jun. 23, 2004, 8 pages.
Websense, Inc., “Websense Enterprise”, 2004, 4 pages.
“The Growing Risks of Internet Abuse”, accessed at: http://www.websense.com/products on Jun. 24, 2004, 7 pages.
Office Action mailed Aug. 19, 2009 for U.S. Appl. No. 11/881,770, filed Jul. 27, 2007, 8 pages.
“N-gram”, accessed at: http://en.wikipedia.org/wiki/Ngram on Apr. 25, 2007, 5 pages.
SurfControl Web Filtering Solutions, accessed at: http://www.surfcontrol.com/Print.aspx?id=375&mid=4 on Jun. 11, 2007, 6 pages.
Aho et al., “Efficient String Matching: An Aid to Bibliographic Search”, Association of Computing Machinery, Inc., 1975, 8 pages.
SonicWall Content Filtering Service User's Guide, 2003, pp. 1-22.
SonicWall Content Filtering Service—Standard Administrator's Guide, 2003, pp. 1-24.
SonicWall Content Filter List Administrator's Guide, 2001, pp. 1-20.
SonicWall Content Filtering Service—Premium Administrator's Guide, 2003, pp. 1-30.
“An Introduction to Filtering: What to Look for When Purchasing an Internet Filtering Solution”, A White Paper from N2H2, Inc., 2003, 14 pages.
Burke, “Content Security: The Business Value of Blocking Unwanted Content”, White Paper, 2003 IDC, #3770, 12 pages.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Training procedure for N-gram-based statistical content... does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Training procedure for N-gram-based statistical content..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Training procedure for N-gram-based statistical content... will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-2646069

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.