Data processing: database and file management or data structures – Database design – Data structure types
Reexamination Certificate
2005-01-25
2005-01-25
Robinson, Greta (Department: 2177)
Data processing: database and file management or data structures
Database design
Data structure types
C707S793000, C707S793000, C707S793000, C704S001000
Reexamination Certificate
active
06847966
ABSTRACT:
A term-by-document matrix is compiled from a corpus of documents representative of a particular subject matter that represents the frequency of occurrence of each term per document. A weighted term dictionary is created using a global weighting algorithm and then applied to the term-by-document matrix forming a weighted term-by-document matrix. A term vector matrix and a singular value concept matrix are computed by singular value decomposition of the weighted term-document index. The k largest singular concept values are kept and all others are set to zero thereby reducing to the concept dimensions in the term vector matrix and a singular value concept matrix. The reduced term vector matrix, reduced singular value concept matrix and weighted term-document dictionary can be used to project pseudo-document vectors representing documents not appearing in the original document corpus in a representative semantic space. The similarities of those documents can be ascertained from the position of their respective pseudo-document vectors in the representative semantic space.
REFERENCES:
patent: 4839853 (1989-06-01), Deerwester et al.
patent: 5301109 (1994-04-01), Landauer et al.
patent: 5642518 (1997-06-01), Kiyama et al.
patent: 5675819 (1997-10-01), Schuetze
patent: 5873056 (1999-02-01), Liddy et al.
patent: 5895464 (1999-04-01), Bhandari et al.
patent: 5950189 (1999-09-01), Cohen et al.
patent: 5953718 (1999-09-01), Wical
patent: 5983237 (1999-11-01), Jain et al.
patent: 6101492 (2000-08-01), Jacquemin et al.
patent: 6138116 (2000-10-01), Kitagawa et al.
patent: 6356864 (2002-03-01), Foltz et al.
patent: 6615208 (2003-09-01), Behrens et al.
Deerwester et al., “Indexing by Latent Semantic Analysis,” Date Unknown.
Kolda, T.G. et al., “A Semidiscrete Matrix Decomposition for Latent Semantic Indexing in Information Retrieval,” Date Unknown.
Berry, Michael, “Latent Semantic Indexing,” Date Unknown.
Foltz, Peter W., “Using Latent Semantic Indexing for Information Filtering,” Date Unknown.
Author Unknown, “Latent Semantic Indexing (LSI),” Date Unknown.
Author Unknown, “Indexing and Custering,” http://www.media.mit.edu/˜emnett/research/slides/sld011.htm; Date Unknown.
Author Unknown, Slide Presentation (Slides 1-12), http://www.cs.rip,edu/˜sidbel/4962/class10, Date Unknown.
Author Unknown, “Latent Semantic Indexing,” Date Unknown.
Sommer Matthew S.
Thompson Kevin B.
Baker & Botts L.L.P.
Engenium Corporation
Lewis Cheryl
Robinson Greta
LandOfFree
Method and system for optimally searching a document... does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Method and system for optimally searching a document..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Method and system for optimally searching a document... will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-3408251