System and method for identifying and categorizing messages...

Data processing: database and file management or data structures – Database design – Data structure types

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C707S793000, C707S793000, C707S793000, C709S206000, C376S249000

Reexamination Certificate

active

07577656

ABSTRACT:
A system and method for identifying messages in a message store is provided. At least part of metadata associated with and at least part of content contained in each of a plurality of messages in a message store are encoded by generating a metadata sequence and a content sequence for each message. The messages are grouped into sets by similar metadata sequences and similar content sequences. The messages in each set are compared. Each such message not matching any other such message in the set is marked as a unique message. Each such message matching at least one other such message in the set is marked as an exact duplicate message. Each such message including a subset of at least one other such message in the set is marked as a near duplicate message.

REFERENCES:
patent: 5056021 (1991-10-01), Ausborn
patent: 5477451 (1995-12-01), Brown et al.
patent: 5488725 (1996-01-01), Turtle et al.
patent: 5732265 (1998-03-01), Dewitt et al.
patent: 5799276 (1998-08-01), Komissarchik et al.
patent: 5867799 (1999-02-01), Lang et al.
patent: 5909677 (1999-06-01), Broder et al.
patent: 5950146 (1999-09-01), Vapnik
patent: 6026397 (2000-02-01), Sheppard
patent: 6119124 (2000-09-01), Broder et al.
patent: 6137911 (2000-10-01), Zhilyaev
patent: 6148102 (2000-11-01), Stolin
patent: 6173275 (2001-01-01), Caid et al.
patent: 6189002 (2001-02-01), Roitblat
patent: 6230155 (2001-05-01), Broder et al.
patent: 6240409 (2001-05-01), Aiken
patent: 6263348 (2001-07-01), Kathrow et al.
patent: 6292880 (2001-09-01), Beguelin et al.
patent: 6349296 (2002-02-01), Broder et al.
patent: 6389433 (2002-05-01), Bolosky et al.
patent: 6446061 (2002-09-01), Doerre et al.
patent: 6453327 (2002-09-01), Nielsen
patent: 6510406 (2003-01-01), Marchisio
patent: 6549957 (2003-04-01), Hanson et al.
patent: 6560597 (2003-05-01), Dhillon et al.
patent: 6598054 (2003-07-01), Schuetze et al.
patent: 6654739 (2003-11-01), Apte et al.
patent: 6658423 (2003-12-01), Pugh et al.
patent: 6675159 (2004-01-01), Lin et al.
patent: 6678705 (2004-01-01), Berchtold et al.
patent: 6701305 (2004-03-01), Holt et al.
patent: 6711585 (2004-03-01), Copperman et al.
patent: 6751628 (2004-06-01), Coady
patent: 7020645 (2006-03-01), Bisbee et al.
patent: 7035903 (2006-04-01), Baldonado
patent: 2002/0065892 (2002-05-01), Malik
patent: 2002/0122543 (2002-09-01), Rowen
patent: 2002/0184193 (2002-12-01), Cohen
patent: WO 2006/008733 (2006-01-01), None
D. Gourley, B. Totty, “HTTP, The Definite Guide,” pp. 288-299, O'Reilly and Associates, Inc., Sebastopol, CA (2002).
Linhui, Jiang, “K-Mean Algorithm: Iterative Partitioning Clustering Algorithm,” http://www.cs.regina.ca/˜linhui/K—mean—algorithm.html, (2001) Computer Science Department, University of Regina, Saskatchewan, Canada.
Kanugo et al., “The Analysis Of A Simple K-Means Clustering Algorithm,” pp. 100-109, PROC 16th annual symposium of computational geometry (May 2000).
Pelleg et al., “Accelerating Exact K-Means Algorithms With Geometric Reasoning,” School of Computer Science, Carnegie Mellon University, Pittsburgh.
Jain et al., “Data Clustering: A Review,” vol. 31, No. 3, ACM Computing surveys, (Sep. 1999).
R. Orfali et al., “Client/Server Survival Guide,” Ch. 19, John Wiley & Sons, Inc. (1999 3rd ed.).
Christina Yip Chung et al., “Thematic Mapping—From Unstructured Documents To Taxonomies,” CIKM'02, Nov. 4-9, 2002, pp. 608-610, ACM, McLean, Virginia, USA.
Hiroyuki Kawano, “Overview of Mondou Web Search Engine Using Text Mining And Information Visualizing Technologies,” IEEE, 2001, pp. 234-241.
James Osborn et al., “Justice: A Judicial Search Tool Using Intelligent Concept Extraction,” ICAIL-99, 1999, pp. 173-181, ACM.
Chen An et al., “Fuzzy Concept Graph And Application In Web Document Clustering,” 2001, pp. 101-106, IEEE.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

System and method for identifying and categorizing messages... does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with System and method for identifying and categorizing messages..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and System and method for identifying and categorizing messages... will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-4076160

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.