Data processing: database and file management or data structures – Database design – Data structure types
Reexamination Certificate
2005-02-09
2008-12-02
Vy, Hung T (Department: 2163)
Data processing: database and file management or data structures
Database design
Data structure types
Reexamination Certificate
active
07461056
ABSTRACT:
A method for extracting key terms and associated key terms for use in text mining is provided. The method includes receiving unstructured text documents, such as emails over a customer service system. Term candidates are extracted based on identifying consecutive word strings satisfying a context independency threshold. Term candidates are weighted using mutual information to generate a list of weighted terms. The weighted terms are then recounted. Terms are associated based on Chi-square values. Associated terms can then be used for information retrieval. A user interface can be personalized with individual user profiles.
REFERENCES:
patent: 5953718 (1999-09-01), Wical
patent: 6460034 (2002-10-01), Wical
patent: 6718325 (2004-04-01), Chandra
patent: 7043476 (2006-05-01), Robson
patent: 7240062 (2007-07-01), Andersen et al.
patent: 2003/0066025 (2003-04-01), Garner et al.
patent: 2003/0074368 (2003-04-01), Schuetze et al.
patent: 2003/0110181 (2003-06-01), Schuetze et al.
patent: 2003/0233224 (2003-12-01), Marchisio et al.
patent: 2004/0167888 (2004-08-01), Kayahara et al.
patent: 2007/0174041 (2007-07-01), Yeske
Martin, Rajman et al. , Text Mining-knowledge extraction from unstructured textual data , Mar. 31, 2004, Laboratory Computer Science Dpt Swiss Federal Institute of Technology (http://liawww.epfl.ch/Publications/Archive/RajmanBesancon98a.pdf).
Mikio Yamamoto et al., Using Suffix Arrays to compute Term frequency and Document Frequency for All substrings in a Corpus , Apr. 28, 2000 (http://research.microsoft.com/users/church/wwwfiles/CL—suffix—array.pdf).
In-Ho Kang, Query Type Classification for Web Document Retrieval, Jul. 28-Aug. 1, 2003, ACM.
“Text Data Minining With Optimized Pattern Discovery” Hiroki Arimura, Jul. 16, 2000.
“Hierarchy-Conscisous Data Structures for String Analysis” Carlo Fantozzi, Jun. 6, 2002.
“On Classification and Regression”, S. Morishita, Proc. DS'98, LNAI 1532, 1998.
L. F. Chien 1999. PAT-Tree-Based Adaptive Keyphrase Extraction for Intelligent Chinese Information Retrieval. Information Processing and Management, vol. 35, No. 4, pp. 501-521, no months, year.
M. Yamamoto and K. Church, 2001. Using Suffix Arrays to Compute Term Frequency and Document Frequency for All Substrings in a Corpus. Computational Linguistics, vol. 27:1, pp. 1-30, MIT Press, no month.
Cao Yunbo
Li Hang
Martin Benjamin
Ribet Olivier
Koehler Steven M.
Microsoft Corporation
Vy Hung T
Westman Champlin & Kelly P.A.
LandOfFree
Text mining apparatus and associated methods does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Text mining apparatus and associated methods, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Text mining apparatus and associated methods will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-4028994