Image analysis – Image transformation or preprocessing – Image storage or retrieval
Reexamination Certificate
1999-10-15
2001-10-02
Couso, Jose L. (Department: 2621)
Image analysis
Image transformation or preprocessing
Image storage or retrieval
C707S793000, C358S403000
Reexamination Certificate
active
06298174
ABSTRACT:
FIELD OF THE INVENTION
This invention relates generally to the field of information storage and retrieval, or “information visualization”. More particularly, the invention relates to a novel method for text-based information retrieval and analysis through the creation of a visual representation for complex, symbolic information. This invention also relates to a method of stored information analysis that (i) requires no human pre-structuring of the problem (ii) is subject to independent, (iii) is adaptable to multi-media information, and (iv) is constructed on a framework of visual presentation and human interaction.
DESCRIPTION OF THE PRIOR ART
Current visualization approaches demonstrate effective methods for visualizing mostly structured and/or hierarchical information such as organization charts, directories, entity-attribute relationships, and the like. Mechanisms to permit free text visualizations have not yet been perfected. The idea that open text fields themselves or raw prose might be candidates for information visualization is novel. The need to read and assess large amounts of text that is retrieved through graph theory or figural displays as “visual query” tools on document bases puts severe limits on the amount of text information that can be processed by any analyst for any purpose. At the same time, the amount of “open source” digital information is increasing exponentially. Whether it be for market analysis, global environmental assessment, international law enforcement or intelligence for national security, the analyst task is to peruse large amounts of data to detect and recognize informational ‘patterns’ and pattern irregularities across the various sources.
True text visualizations that would overcome these time and attentional constraints must represent textual content and meaning to the analyst without them having to read it in the manner that text normally requires. These visualizations would instead result from a content abstraction and spatialization of the original text document that would transform it into a new visual representation conveying information by image instead of prose.
Prior researchers have attempted to create systems for analysis of large text-based information data bases. Such systems have been built on Boolean queries, document lists and time consuming human involvement in sorting, editing and structuring. The simplification of Boolean function expressions is a particularly well-known example of prior systems. For example, in U.S. Pat. No. 5,465,308, a method and apparatus for pattern recognition utilizes a neural network to recognize two dimensional input images which are sufficiently similar to a database of previously stored two dimensional images. Images are first image processed and subjected to a Fourier transform which yields a power spectrum. An in-class to out-of-class study is performed on a typical collection of images in order to determine the most discriminatory regions of the Fourier transform. Feature vectors are input to a neural network, and a query feature vector is applied to the neural network to result in an output vector, which is subjected to statistical analysis to determine if a sufficiently high confidence level exists to indicate that a successful identification has been made.
SUMMARY OF THE INVENTION
The SPIRE (Spatial Paradigm for Information Retrieval and Exploration) software supports text-based information retrieval and analysis through the creation of a visual representation for complex, symbolic information. A primary goal of SPIRE is to provide a fundamentally new visual method for the analysis of large quantities of information. This method of analysis involves information retrieval, characterization and examination, accomplished without human pre-structuring of the problem or pre-sorting of the information to be analyzed. The process produces a visual representation of results.
More specifically, the novel process provides a method of determining and displaying the relative content and context of a number of related documents in a large document set. The relationships of a plurality of documents are presented in a three-dimensional landscape with the relative size and height of a peak in the three-dimensional landscape representing the relative significance of the relationship of a topic, or term, and the individual document in the document set. The steps of the process are:
(a) constructing an electronic database of a plurality of documents to be analyzed;
(b) creating a plurality of high dimensional vectors, one for each of the plurality of documents, such that each of the high dimensional vectors represents the relative relationship of the individual documents to the term, or topic attribute;
(c) arranging the high dimensional vectors into clusters, with each of the clusters representing a plurality of documents grouped by relative significance of their relationship to a topic attribute;
(d) calculating centroid coordinates as the center of mass of each cluster, the centroid coordinates being stored or projected in a two-dimensional plane;
(e) constructing a vector for each document, with each vector containing the distance from the document to each centroid coordinate in high-dimensional space;
(f) creating a plurality of term (or topic) layers, each of the term layers corresponding to a descriptive term (or topic) applied to each cluster, and identifying x,y coordinates for each document associated with each term layer; and
(g) creating a z coordinate associated with each term layer for each x,y coordinate by applying a smoothing function to the x,y coordinates for each document, and superimposing upon one another all of the term layers.
REFERENCES:
patent: 5301105 (1994-04-01), Cummings, Jr.
patent: 5317507 (1994-05-01), Gallant
patent: 5619709 (1997-04-01), Caid et al.
Lantrip David B.
Pennock Kelly A.
Pottier Marc C.
Schur Anne
Thomas James J.
Battelle (Memorial Institute)
Couso Jose L.
Dang Duy M.
Sterne Kessler Goldstein & Fox P.L.L.C.
LandOfFree
Three-dimensional display of document set does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Three-dimensional display of document set, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Three-dimensional display of document set will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-2568056