Method and system for optimally searching a document...

Data processing: database and file management or data structures – Database design – Data structure types

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C707S793000, C707S793000, C707S793000, C704S001000

Reexamination Certificate

active

06847966

ABSTRACT:
A term-by-document matrix is compiled from a corpus of documents representative of a particular subject matter that represents the frequency of occurrence of each term per document. A weighted term dictionary is created using a global weighting algorithm and then applied to the term-by-document matrix forming a weighted term-by-document matrix. A term vector matrix and a singular value concept matrix are computed by singular value decomposition of the weighted term-document index. The k largest singular concept values are kept and all others are set to zero thereby reducing to the concept dimensions in the term vector matrix and a singular value concept matrix. The reduced term vector matrix, reduced singular value concept matrix and weighted term-document dictionary can be used to project pseudo-document vectors representing documents not appearing in the original document corpus in a representative semantic space. The similarities of those documents can be ascertained from the position of their respective pseudo-document vectors in the representative semantic space.

REFERENCES:
patent: 4839853 (1989-06-01), Deerwester et al.
patent: 5301109 (1994-04-01), Landauer et al.
patent: 5642518 (1997-06-01), Kiyama et al.
patent: 5675819 (1997-10-01), Schuetze
patent: 5873056 (1999-02-01), Liddy et al.
patent: 5895464 (1999-04-01), Bhandari et al.
patent: 5950189 (1999-09-01), Cohen et al.
patent: 5953718 (1999-09-01), Wical
patent: 5983237 (1999-11-01), Jain et al.
patent: 6101492 (2000-08-01), Jacquemin et al.
patent: 6138116 (2000-10-01), Kitagawa et al.
patent: 6356864 (2002-03-01), Foltz et al.
patent: 6615208 (2003-09-01), Behrens et al.
Deerwester et al., “Indexing by Latent Semantic Analysis,” Date Unknown.
Kolda, T.G. et al., “A Semidiscrete Matrix Decomposition for Latent Semantic Indexing in Information Retrieval,” Date Unknown.
Berry, Michael, “Latent Semantic Indexing,” Date Unknown.
Foltz, Peter W., “Using Latent Semantic Indexing for Information Filtering,” Date Unknown.
Author Unknown, “Latent Semantic Indexing (LSI),” Date Unknown.
Author Unknown, “Indexing and Custering,” http://www.media.mit.edu/˜emnett/research/slides/sld011.htm; Date Unknown.
Author Unknown, Slide Presentation (Slides 1-12), http://www.cs.rip,edu/˜sidbel/4962/class10, Date Unknown.
Author Unknown, “Latent Semantic Indexing,” Date Unknown.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Method and system for optimally searching a document... does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Method and system for optimally searching a document..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Method and system for optimally searching a document... will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-3408251

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.