Automatic generation of embedded signatures for duplicate...

Data processing: database and file management or data structures – Database and file access – Search engines

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C707S750000, C707S758000, C707S770000, C715S209000, C715S234000, C715S258000

Reexamination Certificate

active

07979413

ABSTRACT:
In accordance with an aspect of the invention, a method and system are disclosed for constructing an embedded signature in order to facilitate post-facto detection of leakage of sensitive data. The leakage detection mechanism involves: 1) identifying at least one set of words in an electronic document containing sensitive data, the set of words having a low frequency of occurrence in a first collection of electronic documents; and, 2) transmitting a query to search a second collection of electronic documents for any electronic document that contains the set of words having a low frequency of occurrence. This leakage detection mechanism has at least the following advantages: a) it is tamper-resistant; b) it avoids the need to add a watermark to the sensitive data, c) it can be used to locate the sensitive data even if the leakage occurred before the embedded signature was ever identified; and, d) it can be used to detect an embedded signature regardless of whether the data is being presented statically or dynamically.

REFERENCES:
patent: 5081608 (1992-01-01), Tamura et al.
patent: 5982956 (1999-11-01), Lahmi
patent: 6470307 (2002-10-01), Turney
patent: 6820237 (2004-11-01), Abu-Hakima et al.
patent: 7184570 (2007-02-01), Rhoads
patent: 7296089 (2007-11-01), Krishnamurthy et al.
patent: 7313251 (2007-12-01), Rhoads
patent: 7369677 (2008-05-01), Petrovic et al.
patent: 7369678 (2008-05-01), Rhoads
patent: 7672971 (2010-03-01), Betz et al.
patent: 7725475 (2010-05-01), Alspector et al.
patent: 7801893 (2010-09-01), Gulli′ et al.
patent: 7853554 (2010-12-01), Wan
patent: 2002/0023058 (2002-02-01), Taniguchi et al.
patent: 2003/0154381 (2003-08-01), Ouye et al.
patent: 2004/0172394 (2004-09-01), Smolsky
patent: 2005/0086205 (2005-04-01), Franciosa et al.
patent: 2005/0120245 (2005-06-01), Torisaki et al.
patent: 2007/0174296 (2007-07-01), Gibbs et al.
patent: 2008/0028474 (2008-01-01), Horne et al.
patent: 2008/0228754 (2008-09-01), Frank et al.
patent: 2008/0243825 (2008-10-01), Staddon et al.
Yookyung Jo, Carl Lagoze and C. Lee Giles—“Dectecting research Topics via the Correlation Between Graphics and Texts”—KDD'07 Aug. 12-15, 2007, San Jose, California, USA (ACM 2007) (pp. 1-10).
Deepayan Chakrabarti, Ravi Kumar and Kunal Punera—“Page-Level Template Detection via Isotonic Smoothing”—International World Wide Web Conference Committee (IW3C2), WWW. 2007, May 8-12, 2007, Banff, Alberta, Canada, Trac: Data Mining, session: Identifying Structure in Wed Pages (pp. 61-70).
The Snow Home Page, downloaded from http://www.darkside.com.au/snow/index.html on May 29, 2008.
Brassil, J., et al., “Electronic Marking and Identification Techniques to Discourage Document Copying”, IEEE Journal on Selected Areas in Communication, v. 13, No. 8, Oct. 1995.
Low, S.H., et al., “Document Marking and Identification Using Both Line and Word Shifting”, INFOM'95, 14th Ann. Jt. Conf. IEEE Comp. & Comm. Soc. Apr. 2, 1995, p. 853.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Automatic generation of embedded signatures for duplicate... does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Automatic generation of embedded signatures for duplicate..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Automatic generation of embedded signatures for duplicate... will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-2631516

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.