Data processing: database and file management or data structures – Database design – Data structure types
Patent
1997-07-25
2000-02-22
Black, Thomas G.
Data processing: database and file management or data structures
Database design
Data structure types
706 45, 707104, 707501, G06F 1700
Patent
active
060291672
ABSTRACT:
A method and apparatus for retrieving similar or identical textual passages among different documents is disclosed. Normal discourse structures along with textual content attributes are used to encode a known passage with "marker sequences" that give a characterizing "signature" to the passage. The encoded known passage is then evaluated against similarly encoded passages appearing in a database of documents. If it is determined that there is a possible match between the encoded known passage and an encoded passage in a database document, a sequential string search is performed to determine whether the two passages are likely to be similar or identical. If the sequential string search records a probable match between the known passage and the database passage, the database passage is displayed for further review.
REFERENCES:
patent: 5418951 (1995-05-01), Damashek
patent: 5590317 (1996-12-01), Iguchi et al.
patent: 5649183 (1997-07-01), Berkowitz et al.
patent: 5706496 (1998-01-01), Noguchi et al.
patent: 5752051 (1998-05-01), Cohen
Black Thomas G.
Claritech Corporation
Harper Blaney
Jung David Yiuk
LandOfFree
Method and apparatus for retrieving text using document signatur does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Method and apparatus for retrieving text using document signatur, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Method and apparatus for retrieving text using document signatur will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-529130