Image analysis – Applications – 3-d or stereo imaging analysis
Reexamination Certificate
2001-09-26
2003-06-24
Johns, Andrew W. (Department: 2621)
Image analysis
Applications
3-d or stereo imaging analysis
C382S305000, C382S306000, C382S307000, C382S309000, C707S793000, C707S793000, C707S793000, C707S793000
Reexamination Certificate
active
06584220
ABSTRACT:
FIELD OF THE INVENTION
This invention relates generally to the field of information storage and retrieval, or “information visualization”. More particularly, the invention relates to a novel method for text-based information retrieval and analysis through the creation of a visual representation for complex, symbolic information. This invention also relates to a method of stored information analysis that (i) requires no human pre-structuring of the problem (ii) is subject independent, (iii) is adaptable to multi-media information, and (iv) is constructed on a framework of visual presentation and human interaction.
DESCRIPTION OF THE PRIOR ART
Current visualization approaches demonstrate effective methods for visualizing mostly structured and/or hierarchical information such as organization charts, directories, entity-attribute relationships, and the like. Mechanisms to permit free text visualizations have not yet been perfected. The idea that open text fields themselves or raw prose might be candidates for information visualization is novel. The need to read and assess large amounts of text that is retrieved through graph theory or figural displays as “visual query” tools on document bases puts severe limits on the amount of text information that can be processed by any analyst for any purpose. At the same time, the amount of “open source” digital information is increasing exponentially. Whether it be for market analysis, global environmental assessment, international law enforcement or intelligence for national security, the analyst task is to peruse large amounts of data to detect and recognize informational ‘patterns’ and pattern irregularities across the various sources.
True text visualizations that would overcome these time and attentional constraints must represent textual content and meaning to the analyst without them having to read it in the manner that text normally requires. These visualizations would instead result from a content abstraction and spatialization of the original text document that would transform it into a new visual representation conveying information by image instead of prose.
Prior researchers have attempted to create systems for analysis of large text-based information data bases. Such systems have been built on Boolean queries, document lists and time consuming human involvement in sorting, editing and structuring. The simplification of Boolean function expressions is a particularly well-known example of prior systems. For example, in U.S. Pat. No. 5,465,308, a method and apparatus for pattern recognition utilizes a neural network to recognize two dimensional input images which are sufficiently similar to a database of previously stored two dimensional images. Images are first image processed and subjected to a Fourier transform which yields a power spectrum. An in-class to out-of-class study is performed on a typical collection of images in order to determine the most discriminatory regions of the Fourier transform. Feature vectors are input to a neural network, and a query feature vector is applied to the neural network to result in an output vector, which is subjected to statistical analysis to determine if a sufficiently high confidence level exists to indicate that a successful identification has been made.
SUMMARY OF THE INVENTION
The SPIRE (Spatial Paradigm for Information Retrieval and Exploration) software supports text-based information retrieval and analysis through the creation of a visual representation for complex, symbolic information. A primary goal of SPIRE is to provide a fundamentally new visual method for the analysis of large quantities of information. This method of analysis involves information retrieval, characterization and examination, accomplished without human pre-structuring of the problem or pre-sorting of the information to be analyzed. The process produces a visual representation of results.
More specifically, the novel process provides a method of determining and displaying the relative content and context of a number of related documents in a large document set. The relationships of a plurality of documents are presented in a three-dimensional landscape with the relative size and height of a peak in the three-dimensional landscape representing the relative significance of the relationship of a topic, or term, and the individual document in the document set. The steps of the process are:
(a) constructing an electronic database of a plurality of documents to be analyzed;
(b) creating a plurality of high dimensional vectors, one for each of the plurality of documents, such that each of the high dimensional vectors represents the relative relationship of the individual documents to the term, or topic attribute;
(c) arranging the high dimensional vectors into clusters, with each of the clusters representing a plurality of documents grouped by relative significance of their relationship to a topic attribute;
(d) calculating centroid coordinates as the center of mass of each cluster, the centroid coordinates being stored or projected in a two-dimensional plane;
(e) constructing a vector for each document, with each vector containing the distance from the document to each centroid coordinate in high-dimensional space;
(f) creating a plurality of term (or topic) layers, each of the term layers corresponding to a descriptive term (or topic) applied to each cluster, and identifying x,y coordinates for each document associated with each term layer; and
(g) creating a z coordinate associated with each term layer for each x,y coordinate by applying a smoothing function to the x,y coordinates for each document, and superimposing upon one another all of the term layers.
REFERENCES:
patent: 5301105 (1994-04-01), Cummings, Jr.
patent: 5317507 (1994-05-01), Gallant
patent: 5619709 (1997-04-01), Caid et al.
patent: 5864855 (1999-01-01), Ruocco et al.
patent: 6298174 (2001-10-01), Lantrip et al.
patent: 6304675 (2001-10-01), Osbourn et al.
Omiecinski et al. (“Performance Analysis of a Concurrent File Reorganization Algorithm for Record Clustering”, IEEE Transactions on Knowledge and Data Engineering, vol. 6, No. 2, Apr. 1994, pp. 248-257).*
Bobbie (“Clustering Relations of Large Databases for Parallel Querying”, System Science IEEE, Proceedings of the Twenty-Seventh Annual Hawaii International Conference on System Science 1994, pp. 246-252).*
Liebeherr et al. (“The Effect of Index Partition Scheme on the Performance of Distributed Query Processing”, IEEE Transactions on Knowledge and Data Engineering, vol. 5, No. 3, Jun. 1993, pp. 510-522).
Lantrip David B.
Pennock Kelly A.
Pottier Marc C.
Schur Anne
Thomas James J.
Battelle (Memorial Institute)
Dang Duy M.
Johns Andrew W.
Milde & Hoffberg LLP
LandOfFree
Three-dimensional display of document set does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Three-dimensional display of document set, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Three-dimensional display of document set will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-3090375