Data processing: database and file management or data structures – Database design – Data structure types
Reexamination Certificate
2005-01-11
2005-01-11
Corrielus, Jean M. (Department: 2172)
Data processing: database and file management or data structures
Database design
Data structure types
C707S793000, C707S793000, C706S012000, C706S014000, C706S020000
Reexamination Certificate
active
06842751
ABSTRACT:
A data classification method and apparatus are disclosed for labeling unknown objects. The disclosed data classification system employs a model selection technique that characterizes domains and identifies the degree of match between the domain meta-features and the learning bias of the algorithm under analysis. An improved concept variation meta-feature or an average weighted distance meta-feature, or both, are used to fully discriminate learning performance, as well as conventional meta-features. The “concept variation” meta-feature measures the amount of concept variation or the degree of lack of structure of a concept. The present invention extends conventional notions of concept variation to allow for numeric and categorical features, and estimates the variation of the whole example population through a training sample. The “average weighted distance” meta-feature of the present invention measures the density of the distribution in the training set. While the concept variation meta-feature is high for a training set comprised of only two examples having different class labels, the average weighted distance can distinguish between examples that are too far apart or too close to one other.
REFERENCES:
patent: 5465321 (1995-11-01), Smyth
patent: 5742738 (1998-04-01), Koza et al.
patent: 5835901 (1998-11-01), Duvoisin et al.
patent: 5884294 (1999-03-01), Kadar et al.
patent: 5970482 (1999-10-01), Pham et al.
patent: 6058385 (2000-05-01), Koza et al.
patent: 6301579 (2001-10-01), Becker
patent: 6356884 (2002-03-01), Thaler
patent: 6728689 (2004-04-01), Drissi et al.
“Information Extraction as a Bisis for High-Precision Text Classification”—Ellen Riloff and Wendy Lehnert—ACM Tranaction on Information Systems, vol. 12, No. 3, Jul. 1994, (pps: 296-333).
Rish Irina
Vilalta Ricardo
Ly Anh
Percello, Esq. Louis J.
Ryan & Mason & Lewis, LLP
LandOfFree
Methods and apparatus for selecting a data classification... does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Methods and apparatus for selecting a data classification..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Methods and apparatus for selecting a data classification... will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-3399073