Data processing: database and file management or data structures – Database design – Data structure types
Patent
1998-02-13
2000-07-18
Treat, William M.
Data processing: database and file management or data structures
Database design
Data structure types
382161, G06F 700
Patent
active
060920653
ABSTRACT:
The present invention groups character sequences by identifying a sequence of characters. A set of internal repeats in said sequence of characters is identified by a pattern discovery technique. For at least one internal repeat belonging to the set of internal repeats, it is determined if the internal repeat corresponds to a group of character sequences; If so, first data that identifies the sequence of characters and second data that associates the sequence of characters with the group of character sequences is stored in persistent memory. The pattern discovery mechanism discovers patterns in a sequence of characters in two phases. In a sampling phase, preferably proper templates corresponding to the sequence of characters are generated. Patterns are then generated corresponding to the templates and stored in memory. In a convolution phase, the patterns stored in memory are combined to identify a set of maximal patterns.
REFERENCES:
patent: 5742811 (1998-04-01), Agrawal et al.
patent: 5787425 (1998-07-01), Bigus
patent: 5799268 (1998-08-01), Boguraev
patent: 5909681 (1999-06-01), Passera et al.
Rigoutsos et al., "Searching in Parallel for Similar Strings [Biological Sequences]", Computational Science and Engineering, IEEE, vol. 1, ISS.2, pp. 60-75.
Califano et al., "Flash: A Fast Look-up Algorithm for String Homology", Proceedings of the Computer Society Conference on Computer Vision and Pattern Recognition, IEEE, pp. 353-359, Jun. 15-17, 1993.
Agraveal et al, "Mining Sequential Patterns", Proceedings of the Eleventh International Conference on Data Engineering, IEEE, pp. 3-14, Mar. 6-10, 1995.
Chen et al., "Data Mining: An Overview from a Database Perspective", Transactions on Knowledge and Data Engineering, IEEE, vol. 8, iss. 6, pp. 866-883, Dec. 1996.
Floratos Aristidis
Rigoutsos Isidore
International Business Machines - Corporation
Sbrollini Jay P.
Treat William M.
LandOfFree
Method and apparatus for discovery, clustering and classificatio does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Method and apparatus for discovery, clustering and classificatio, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Method and apparatus for discovery, clustering and classificatio will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-2047805