Data processing: database and file management or data structures – Database design – Data structure types
Patent
1997-09-18
1999-05-18
Kulik, Paul V.
Data processing: database and file management or data structures
Database design
Data structure types
707 3, 707 5, G06F17/30
Patent
active
059059800
ABSTRACT:
The present invention provides a document processing apparatus, word extracting apparatus, word extracting method and storage medium for storing a word extracting program, capable of appropriately presenting effective associate words to the user. A retrieving element executes retrieval of documents based on a retrieval condition inputted through a retrieval condition inputting element. A keyword designating element designates an arbitrary word among the words included in the retrieved documents as an associate-word-searching word and designates other words as candidates for an associate word. A simultaneous appearance probability calculating element calculates a simultaneous appearance probability of the associate-word-searching word and one of the candidates for the associate word in any of the retrieved documents. A first independent appearance probability calculating element obtains an independent appearance probability of the associate-word-searching word in each of all documents. A second independent appearance probability calculating element calculates an independent appearance probability of each of the candidates for the associate word in each of all documents. A calculating element calculates the sum or product of the independent appearance probability of the associate-word-searching word and the independent appearance probability of each of the candidates for the associate word. An associate word extracting element extracts a word according to the ratio of the simultaneous appearance probability to the sum or product calculated by the calculating element.
REFERENCES:
patent: 5265065 (1993-11-01), Turtle
patent: 5418948 (1995-05-01), Turtle
patent: 5488725 (1996-01-01), Turtle et al.
patent: 5576954 (1996-11-01), Driscoll
patent: 5694559 (1997-12-01), Hobson et al.
patent: 5694592 (1997-12-01), Driscoll
patent: 5737734 (1998-04-01), Schultz
patent: 5749081 (1998-05-01), Whiteis
Kwok "A Network Approuch to Probabilistic Information Retrieval" ACM Transactions on Information Systems, vol. 13, No. 3, pp. 324-353, Jul. 1995.
Syu et al. "A Competition-Based Connectionist Model for Information Retrieval Using a Merged Thesaurus" CIKM, 94, pp. 164-170, Mar. 1994.
Verma et al. "Evaluation of Overflow Probabilities in Resource Management" IEEE Database, ICC 92, pp. 1212-1216, Aug. 1992.
Tseng et al. "A Probabilitistic A Approuch to Query Processing in Heterogeneous Database Systems" IEEE Database, pp. 176-183, Jul. 1992.
Haruno et al., "Bilingual Text Alignment Using Statistical and Dictionary Information," Information Processing Society of Japan, SIG Notes, 96-NL-112, pp. 23-30, 1996.
Ohmori et al., "Automated Formation of bilingual Dictionary Using Statistical Information," Proceedings of the Second Annual Meeting of the Association for Natural Language Processing, pp. 49-52, 1996.
Masuichi Hiroshi
Tateno Masakazu
Umemoto Hiroshi
Fuji 'Xerox Co., Ltd.
Kulik Paul V.
Wallace, Jr. Michael J.
LandOfFree
Document processing apparatus, word extracting apparatus, word e does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Document processing apparatus, word extracting apparatus, word e, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Document processing apparatus, word extracting apparatus, word e will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-1769074