Information retrieval and text mining using distributed...

Data processing: database and file management or data structures – Database design – Data structure types

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C707S793000

Reexamination Certificate

active

07152065

ABSTRACT:
The use of latent semantic indexing (LSI) for information retrieval and text mining operations is adapted to work on large heterogeneous data sets by first partitioning the data set into a number of smaller partitions having similar concept domains. A similarity graph network is generated in order to expose links between concept domains which are then exploited in determing which domains to query as well as in expanding the query vector. LSI is performed on those partitioned data sets most likely to contain information related to the user query or text mining operation. In this manner LSI can be applied to datasets that heretofore presented scalability problems. Additionally, the computation of the singular value decomposition of the term-by-document matrix can be accomplished at various distributed computers increasing the robustness of the retrieval and text mining system while decreasing search times.

REFERENCES:
patent: 4839853 (1989-06-01), Deerwester et al.
patent: 5301109 (1994-04-01), Landauer et al.
patent: 2002/0026456 (2002-02-01), Bradford
patent: 2003/0037073 (2003-02-01), Tokuda et al.
Berry, M., et al., “Using Linear Algebra for Intelligent Information Retrieval,” SIAM Review 37(4): pp. 573-595, Dec. 1994.
Steinbach, M., et al., “A Comparison of Document Clustering Techniques,” Technical Report 00-034, Department of Computer Science and Engineering, University of Minnesota, 2000.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Information retrieval and text mining using distributed... does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Information retrieval and text mining using distributed..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Information retrieval and text mining using distributed... will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-3681284

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.