Text mining apparatus and associated methods

Data processing: database and file management or data structures – Database design – Data structure types

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

Reexamination Certificate

active

07461056

ABSTRACT:
A method for extracting key terms and associated key terms for use in text mining is provided. The method includes receiving unstructured text documents, such as emails over a customer service system. Term candidates are extracted based on identifying consecutive word strings satisfying a context independency threshold. Term candidates are weighted using mutual information to generate a list of weighted terms. The weighted terms are then recounted. Terms are associated based on Chi-square values. Associated terms can then be used for information retrieval. A user interface can be personalized with individual user profiles.

REFERENCES:
patent: 5953718 (1999-09-01), Wical
patent: 6460034 (2002-10-01), Wical
patent: 6718325 (2004-04-01), Chandra
patent: 7043476 (2006-05-01), Robson
patent: 7240062 (2007-07-01), Andersen et al.
patent: 2003/0066025 (2003-04-01), Garner et al.
patent: 2003/0074368 (2003-04-01), Schuetze et al.
patent: 2003/0110181 (2003-06-01), Schuetze et al.
patent: 2003/0233224 (2003-12-01), Marchisio et al.
patent: 2004/0167888 (2004-08-01), Kayahara et al.
patent: 2007/0174041 (2007-07-01), Yeske
Martin, Rajman et al. , Text Mining-knowledge extraction from unstructured textual data , Mar. 31, 2004, Laboratory Computer Science Dpt Swiss Federal Institute of Technology (http://liawww.epfl.ch/Publications/Archive/RajmanBesancon98a.pdf).
Mikio Yamamoto et al., Using Suffix Arrays to compute Term frequency and Document Frequency for All substrings in a Corpus , Apr. 28, 2000 (http://research.microsoft.com/users/church/wwwfiles/CL—suffix—array.pdf).
In-Ho Kang, Query Type Classification for Web Document Retrieval, Jul. 28-Aug. 1, 2003, ACM.
“Text Data Minining With Optimized Pattern Discovery” Hiroki Arimura, Jul. 16, 2000.
“Hierarchy-Conscisous Data Structures for String Analysis” Carlo Fantozzi, Jun. 6, 2002.
“On Classification and Regression”, S. Morishita, Proc. DS'98, LNAI 1532, 1998.
L. F. Chien 1999. PAT-Tree-Based Adaptive Keyphrase Extraction for Intelligent Chinese Information Retrieval. Information Processing and Management, vol. 35, No. 4, pp. 501-521, no months, year.
M. Yamamoto and K. Church, 2001. Using Suffix Arrays to Compute Term Frequency and Document Frequency for All Substrings in a Corpus. Computational Linguistics, vol. 27:1, pp. 1-30, MIT Press, no month.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Text mining apparatus and associated methods does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Text mining apparatus and associated methods, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Text mining apparatus and associated methods will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-4028994

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.