Data processing: database and file management or data structures – Database design – Data structure types
Patent
1996-03-29
1999-07-13
Feild, Joseph H.
Data processing: database and file management or data structures
Database design
Data structure types
G06F 1724
Patent
active
059241080
ABSTRACT:
An author-oriented document summarizer for a word processor is described. The document summarizer performs a statistical analysis to generate a list of ranked sentences for consideration in the summary. The summarizer counts how frequently content words appear in a document and produces a table correlating the content words with their corresponding frequency counts. Phrase compression techniques are used to produce more accurate counts of repeatedly used phrases. A sentence score for each sentence is derived by summing the frequency counts of the content words in a sentence and dividing that tally by the number of the content words in the sentence. The sentences are then ranked in order of their sentence scores. Concurrent with the statistical analysis, during the same pass through the document the summarizer performs a cue-phrase analysis to weed out sentences with words or phrases that have been pre-identified as potential problem phrases. The cue-phrase analysis compares sentence phrases with a pre-compiled list of words and phrases and sets conditions on whether the sentences containing them can be used in the summary. Following the cue-phrase analysis, the summarizer creates a summary containing the higher ranked sentences. The summary may also include a conditioned sentence if the conditions established for inclusion of the sentence have been satisfied. The summarizer then inserts the sentence at the beginning of the document before the start of the text.
REFERENCES:
patent: 4965763 (1990-10-01), Zamora
patent: 5689716 (1997-11-01), Chen
patent: 5778397 (1998-07-01), Kupiec et al.
Sumita, Ono, Chino, Ukita, and Amano, "A Discourse Structure Analyzer for Japanese Text," Proceedings of the International Conference of Fifth Generation Computer Systems 1992, pp. 1133-1140.
H.P. Luhn, The Automatic Creation of Literature Abstracts, IBM Journal, Apr. 1958, pp. 159-165.
Kenji Ono, Kazuo Sumita, Seiji Miike, "Abstract Generation Based On Rhetorical Structure Extraction," Proceedings of the 15.sup.th International Conference on Computational Linguistics, vol. 1, at pp. 344-348, for a conference held Aug. 5-9, 1994 in Kyoto, Japan.
"Test Summarisation", BT Laboratories, retrieved from BT Web site at www.bt.com.
"Short Cuts", Science Technology section, The Economist, Dec. 17.sup.th, 1994, pp. 85-86.
Salton, Allan, Buckley, and Singhal, "Automatic Analysis, Theme Generation, and Summarization of Machine-Readable Texts", Science, vol. 264, Jun. 3, 1994, pp. 1421-1426.
Newspaper Excerpt on Produce release from Visual Recall 2.0 by Jessica Davis.
Cokus Shawn J.
Dolan William B.
Fein Ronald A.
Fries Edward J.
Messerly John
Feild Joseph H.
Kindred Alford W.
Microsoft Corporation
LandOfFree
Document summarizer for word processors does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Document summarizer for word processors, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Document summarizer for word processors will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-2288614