Dynamic content organization in information retrieval systems

Data processing: database and file management or data structures – Database design – Data structure types

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C707S793000, C707S793000, C707S793000, C705S035000, C705S002000, C706S045000

Reexamination Certificate

active

06236987

ABSTRACT:

BACKGROUND
1. Field of Invention
The present invention relates generally to information retrieval systems and methods, and more particularly, to the dynamic organization of content retrieved in response to user input queries.
2. Background of the Invention
Conventional information retrieval systems typically allow for one of two types of query paradigms, either topic navigation, or full text retrieval, or a limited combination of both. In a full text retrieval system, queries containing any keywords are processed to produce documents or other content which contains these keywords (or their synomyns, and other variants) or that otherwise best satisfy the query. Typically, the output content is organized in as a simple list, arranged either alphabetically, chronologically, or both some other sort criteria. These types of information retrieval systems are common in every type of information domain, such as document management systems, library catalogs, search engines for the World Wide Web, relational databases, and the like.
The problem with this type of query and retrieval paradigm is that it fails to provide to the user a useful arrangement of the returned set of documents and content in terms of the meaning or nature of the content itself. More particularly, it fails to organize the content according to a set of topics pertinent to the returned content. The lack of a topic organization makes it difficult for the user to evaluate the overall query results, and to further navigate or explore the search results for content of interest. This problem is especially significant when dealing with novice or casual users of an information database. These users are unlikely to specify their queries with a high degree of precision, and are also unlikely to know the range and variety of different types of documents available in the database. The absence of a topic arrangement of query results makes it difficult for such users to explore both the documents that satisfy the query, and other documents which may be of interest but which did not satisfy the original query. At best, full text systems allows the user to refine or generalize the query by conjoining or disjoining additional keywords to the original query. However, the problem remains that the resulting documents will have no topic arrangement.
To overcome these types of problems, topic based query systems have been employed. In a topic system, a collection of documents is organized under a hierarchy of topics and subtopics. Each topic is associated with a number of documents that are about that topic. The user navigates the topic hierarchy in a strictly linear fashion from topic to subtopic. When a topic of interest is found, the user can review the documents associated with that topic.
The problem with this type of information retrieval system is that the selection of topics is unlikely to include topics that match every users' potential interests. In particular, users often search for documents that satisfy two or more unrelated concepts which have no equivalent topic in the topic hierarchy. For example, a general purpose document collection may contain groups of topics such as:
Topic
Subtopics . . .
Art
American
Ancient Art
Asian
. . .
Museums
America
Asia
Europe
Louvre
. . .
and
Animals
Mammals
Insects
Reptiles
Crocodiles
Frogs
Snakes
. . .
Each of these topics would be is associated with its own set of documents, which may or may not overlap with the documents associated with other topics. The user is typically constrained to view documents under a single topic at a time. However, the user may have an interest in finding documents that are about both art museums and snakes. Since the topic hierarchy does not contain this precise intersection of topics, the user is unable to easily locate documents of interest, and must instead review all of the documents associated with “museums” and separately all of the documents associated with “snakes” to determine if any of them match this particular combination of topics.
One reason for this deficiency of conventional topic based systems is that the user is unable to specify a query which is the intersection of multiple topics in the topic hierarchy. For a topic hierarchy containing N topics, the possible number of topic intersections is N!. Since the more useful topic hierarchies will have hundreds or thousands of topics, it is computationally infeasible to determine a priori every possible topic intersection to determine which documents are associated each intersection.
Other systems provide a combination of topic and full text retrieval. In these systems, a full text query is processed to identify various topics in the topic hierarchy that match the query, or portions of it, and these topics and their documents are displayed to the user. However, if the located topics are not actually what the user is interested in, then a new query must be specified, and the process repeated. The user has no ability to modify the topics of the query directly to obtain a more refined intersection of topics, again due to the problem of the large number of topic intersections.
Accordingly, it is desirable to provide a system and method of query analysis and information retrieval that dynamically generates a topic organization of the content located in response to a user query, allowing for navigation and exploration of that content. Further, it is desirable to provide a system that offers the flexibility of full text retrieval in its ability to generalize and refine a search, and the organizational benefits of navigation and querying in a topic hierarchy.
SUMMARY OF THE INVENTION
The present invention overcomes the limitations of conventional information retrieval systems and methods by combining the refinement and generalization capabilities of a full text retrieval system with navigational benefits of a topic hierarchy. In particular, the present invention allows for conceptual navigation through a topic hierarchy using arbitrarily complex queries topic intersections, and by allowing the user to iteratively modify the topics or keywords used to query the topic hierarchy. The present invention further provides a variety of different topic arrangements and organizes the content resulting from a current query into different topic arrangements, and which enable the user to easily explore the content of a document collection using concepts and ideas, and not merely keywords.
In one aspect, an information retrieval system and method in accordance with the present invention operates upon a document collection of documents of any type, including include text, graphics, video, audio, multimedia and any other form of computer readable information. Each document is associated with one or more topics. The topics have arbitrary semantic relationships with one another, particularly including topic-subtopic relationships, where a subtopic is a semantic refinement of a topic. The information retrieval system receives a current query including various query terms. The current query may be an actual query input by a user, or a modification or expansion of a user query. The query terms may be any keywords including topic terms.
The current query is processed to select a set of an initial set documents from the entirety of the document collection. In accordance with the present invention, the information retrieval system organizes the set of documents according to the various topics associated with the documents contained therein into a dynamically created topic arrangement of topics and related subtopics. The topic arrangement generally organizes the documents in the set by selecting a set of topics and/or related subtopics that can be used to either refine (narrow) or generalize (broaden) the user's query. These topic arrangements are dynamically created by selection of specific topics from the topic hierarchy that optimally satisfy various criteria as to the quality of their ability to refine, generalize, cover, or distinguish the documents in the document set with respect to other documents in th

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Dynamic content organization in information retrieval systems does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Dynamic content organization in information retrieval systems, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Dynamic content organization in information retrieval systems will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-2538161

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.