Clustering based text classification

Data processing: artificial intelligence – Neural network – Learning task

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

Reexamination Certificate

active

10921477

ABSTRACT:
Systems and methods for clustering-based text classification are described. In one aspect text is clustered as a function of labeled data to generate cluster(s). The text includes the labeled data and unlabeled data. Expanded labeled data is then generated as a function of the cluster(s). The expanded label data includes the labeled data and at least a portion of unlabeled data. Discriminative classifier(s) are then trained based on the expanded labeled data and remaining ones of the unlabeled data.

REFERENCES:
patent: 5297042 (1994-03-01), Morita
patent: 5361628 (1994-11-01), Marko et al.
patent: 5418948 (1995-05-01), Turtle
patent: 5442778 (1995-08-01), Pedersen et al.
patent: 5488725 (1996-01-01), Turtle et al.
patent: 5694592 (1997-12-01), Driscoll
patent: 5812134 (1998-09-01), Pooser et al.
patent: 5819258 (1998-10-01), Vaithyanathan et al.
patent: 5845278 (1998-12-01), Kirsch et al.
patent: 5987460 (1999-11-01), Niwa et al.
patent: 6003027 (1999-12-01), Prager
patent: 6006225 (1999-12-01), Bowman et al.
patent: 6112202 (2000-08-01), Kleinberg
patent: 6167398 (2000-12-01), Wyard et al.
patent: 6169986 (2001-01-01), Bowman et al.
patent: 6188776 (2001-02-01), Covell et al.
patent: 6226408 (2001-05-01), Sirosh
patent: 6298351 (2001-10-01), Castelli et al.
patent: 6400828 (2002-06-01), Covell et al.
patent: 6470307 (2002-10-01), Turney
patent: 6556983 (2003-04-01), Altschuler et al.
patent: 6560600 (2003-05-01), Broder
patent: 6615209 (2003-09-01), Gomes et al.
patent: 6628821 (2003-09-01), Covell et al.
patent: 6697998 (2004-02-01), Damerau et al.
patent: 6711585 (2004-03-01), Copperman et al.
patent: 6742003 (2004-05-01), Heckerman et al.
patent: 6760714 (2004-07-01), Caid et al.
patent: 6772120 (2004-08-01), Moreno et al.
patent: 6892193 (2005-05-01), Bolle et al.
patent: 6944602 (2005-09-01), Cristianini
patent: 7136876 (2006-11-01), Adar et al.
patent: 2002/0015366 (2002-02-01), Sawano
patent: 2002/0178153 (2002-11-01), Nishioka et al.
patent: 2003/0046389 (2003-03-01), Thieme
patent: 2003/0065632 (2003-04-01), Hubey
patent: 2003/0110181 (2003-06-01), Schuetze et al.
patent: 2003/0200198 (2003-10-01), Chandrasekar et al.
patent: 2003/0208482 (2003-11-01), Kim et al.
patent: 2003/0226100 (2003-12-01), Farahat et al.
patent: 2003/0233370 (2003-12-01), Barabas et al.
patent: 2004/0010331 (2004-01-01), Terada et al.
patent: 2004/0117189 (2004-06-01), Bennett
patent: 2004/0249808 (2004-12-01), Azzarn et al.
patent: 2005/0015366 (2005-01-01), Carrasco et al.
patent: 2005/0055321 (2005-03-01), Fratkina et al.
patent: 2005/0097188 (2005-05-01), Fish
patent: 2005/0216443 (2005-09-01), Morton et al.
patent: 10029644 (2002-01-01), None
patent: 0809197 (1997-11-01), None
patent: 1320042 (2003-06-01), None
patent: WO9749048 (1997-12-01), None
patent: WO9948028 (1999-09-01), None
CBC: clustering based text classification requiring minimal labeled data Hua-Jun Zeng; Xuan-Hui Wang; Zheng Chen; Hongjun Lu; Wei-Ying Ma; Data Mining, 2003. ICDM 2003. Third IEEE International Conference on Nov. 19-22, 2003 pp. 443-450 Digital Object Identifier 10.1109/ICDM.2003.1250951.
Labeling unclustered categorical data into clusters based on the important attribute values Hung-Leng Chen; Kun-Ta Chuang;Ming-Syan Chen; Data Mining, Fifth IEEE International Conference on Nov. 27-30, 2005 Page(s):8 pp. Digital Object Identifier 10.1109/ICDM.2005.85.
Text classification with enhanced semi-supervised fuzzy clustering Keswani, G.; Hall, L.O.; Fuzzy Systems, 2002. Fuzz-IEEE'02. Proceedings of the 2002 IEEE International Conference on vol. 1, May 12-17, 2002 pp. 621-626 Digital Object Identifier 10.1109/Fuzz.2002.1005064.
Biased clustering methods for image classification Santos, R.; Ohashi, T.; Yoshida, T.; Ejima, T.; Computer Graphics, Image Processing, and Vision, 1998. Proceedings. SIBGRAPI '98. International Symposium on Oct. 20-23, 1998 pp. 278-285 Digital Object Identifier 10.1109/SIBGRA.1998.722761.
Constrained minimun cut for classification using labeled and unlabeled data Li, C.H.; Computer Vision and Pattern Recognition, 2001. CVPR 2001. Proceedings of the IEEE Computer Society Conference on vol. 2, 2001 pp. II-597-II-602 vol. 2 Digital Object Identifier 10.1109/CVPR.2001.991017.
Clustering unlabeled data with SOMs improves classification of labeled real-world data Dara, R.; Kremer, S.C.; Stacey , D.A.; Neural Networks, 2002. IJCNN '02. Proceedings of the 2002 International Joint Conference on vol. 3, May 12-17, 2002 pp. 2237-2242 Digital Object Identifier 10.1109/IJCNN.2002.1007489.
Homotopy-Based Semi-Supervised Hidden Markov Tree Analysis Dasgupta, N.; Shihao Ji; Carin, L.; Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on vol. 2, May 14-19, 2006 pp. ll-97-ll-100
A robust semi-supervised EM-based clustering algorithm with a reject option Saint-Jean, C.; Freicot, C.; Pattern Recognition, 2002. Proceedings. 16th International Conference on vol. 3, Aug. 11-15, 2002 pp. 399-402 vol.3.
Cluster analysis using self-organizing maps and image processing techniques Costa, J.A.F.; de Andrade Netto, M.L.; Systems, Man, and Cybernetics, 1999. IEEE SMC '99 Conference Proceedings. 1999 IEEE International Conference on vol. 5, Oct. 12-15, 1999 pp. 367-372 vol. 5.
Learning with constrained and unlabelled data Lange, T.; Law, M.H.C.; Jain, A.K.; Buhmann, J.M.; Computer Vision and Pattern Recognition, 2005. CVPR 2005. IEEE Computer Society Conference on vol. 1, Jun. 20-25, 2005 pp. 731-738 vol. 1.
Clustering-training for Data Stream Mining Shuang Wu; Chunyu Yang; Jie Zhou; Data Mining Workshops, 2006. ICDM Workshops 2006. Sixth IEEE International Conference on Dec. 2006 pp. 653-656.
Semi-supervised image database categorization using pairwise constraints Grira, N.; Crucianu, M.; Boujemaa, N.; Image processing, 2005. ICIP 2005. IEEE International Conference on vol. 3, Sep. 11-14, 2005 pp. III-1228-III-1231.
Semi-Supervised Clustering of Corner-Oriented Attributed Graphs Jin Tang; Chunyan Zhang; Bin Luo; Hybrid Intelligent Systems, 2006. HIS '06. Sixth International Conference on Dec. 2006 pp. 33-33.
Categorization and keyword identification of unlabeled documents Ning Kang; Domeniconi, C.; Barbara, D.; Data Mining, Fifth IEEE International Conference on Nov. 27-30, 2005 p. 4 pp.
Subspace Clustering and Label Propagation for Active Feedback in Image Retrieval Tao Qin; Tie-Yan Liu; Xu-Dong Zhang; Wei-Ying Ma; Hong-Jiang Zhang; Multimedia Modelling Conference, 2005. MMM 2005. Proceedings of the 11th International Jan. 12-14, 2005 pp. 172-179.
Robust classifiers by mixed adaptation Gutfinger, D.; Sklansky, J.; Pattern Analysis and Machine Intelligence, IEEE Transactions on vol. 13, Issue 6, Jun. 1991 pp. 552-567.
Equilibrium-Based Support Vector Machine for Semisupervised Classification Lee D. ; Lee J. ; IEEE Transactions on Neural Networks : Accepted for future publication vol. pp, Issue 99, 2007 pp. 1-1.
Semi-supervised feature selection via multiobjective optimization Handl, J.; Knowles, J.; Neural Networks, 2006. IJCNN '06. International Joint Conference on Jul. 16-21, 2006 pp. 3319-3326.
A Generalized K-Means Algorithm with Semi-Supervised Weight Coefficients Morii, F.; Pattern Recognition, 2006. ICPR 2006. 18th International Conference on vol. 3, Aug. 20-24, 2006 pp. 198-201.
Efficient Margin-Based Query Learning on Action Classification Shimosaka, M.; Mori, T.; Sato, T.; Intelligent Robots and Systems, 2006 IEEE/RSJ International Conference on Oct. 2006 pp. 2778-2784.
A comparison between criterion functions for linear classifiers, with an application to neural nets Barnard, E.; Casasent, D.; Systems, Man and Cybernetics, IEEE Transactions on vol. 19, Issue 5, Sep.-Oct. 1989 pp. 1030-1041 Digital Object Identifier 10.1109/21.44018.
Boosting Classifiers for Music Genre Classification Bagci, U.; Erzin, E., Signal Processing and Communications Applications, 2006 IEEE 14thApr. 17-19, 2006 pp. 1-3 Digital Object Identifier 10.1109/SIU.2006.1659881.
Human Balance Estimation using a Wireless 3D Acceleration Sensor Network Simila, H.: Kaartinen J.; Lindholm, M.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Clustering based text classification does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Clustering based text classification, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Clustering based text classification will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-3929739

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.