Spam identification using an algorithm based on histograms...

Electrical computers and digital processing systems: multicomput – Computer conferencing – Demand based messaging

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C709S203000, C709S213000

Reexamination Certificate

active

08001195

ABSTRACT:
A system, method and computer program product for identifying spam in email messages, including (a) identifying unique words and all their variations in the text of the email; (b) filtering noise words from the text; (c) determining how many times each unique word or its morphological variations is found in the text; (d) assigning an identifier to each unique word in the text based on the number of times the unique word is found; (e) creating a lexical vector of the text based on all the identifiers assigned; (f) generating a histogram based on the lexical vector; (g) comparing the histogram against the histograms of lexical vectors corresponding to known spam texts stored in the database; (h) if the histograms coincide within a certain threshold, then the email text is identified as spam.

REFERENCES:
patent: 7299261 (2007-11-01), Oliver et al.
patent: 7555523 (2009-06-01), Hartmann

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Spam identification using an algorithm based on histograms... does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Spam identification using an algorithm based on histograms..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Spam identification using an algorithm based on histograms... will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-2629267

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.