Apparatus for and method of summarising text

Data processing: database and file management or data structures – Database design – Data structure types

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C707S793000, C707S793000, C707S793000

Reexamination Certificate

active

10797107

ABSTRACT:
Apparatus for identifying topics of document data has:a word ranker (171) for ranking words that are present in or representative of the content of the document data;a co-occurrence ranker (172) for ranking co-occurrences of words that are present in or representative of the content of the document data;a phrase ranker (170) for ranking phrases in the document data;a word selector (174) for selecting the highest ranking words;a co-occurrence identifier (176) for identifying which of the highest ranking co-occurrences contain at least one of the highest ranking words;a phrase identifier (177) for identifying the phrases containing at least one word from the identified co-occurrences;a phrase selector (178) for selecting the highest ranking one or ones of the identified phrases as the topic or topics of the document data; andan output device (40) for outputting data relating to the selected topics.

REFERENCES:
patent: 5644776 (1997-07-01), DeRose et al.
patent: 5708825 (1998-01-01), Sotomayor
patent: 5918240 (1999-06-01), Kupiec et al.
patent: 5924108 (1999-07-01), Fein et al.
patent: 6167368 (2000-12-01), Wacholder
patent: 6185592 (2001-02-01), Boguraev et al.
patent: 6353824 (2002-03-01), Boguraev et al.
patent: 6594658 (2003-07-01), Woods
patent: 6789230 (2004-09-01), Katariya et al.
patent: 2002/0174101 (2002-11-01), Fernley et al.
patent: 2002/0184267 (2002-12-01), Nakao
patent: 2003/0028558 (2003-02-01), Kawatani
patent: 2003/0069877 (2003-04-01), Grefenstette et al.
patent: 2004/0029085 (2004-02-01), Hu et al.
patent: 1 033 662 (2000-09-01), None
patent: 1 071 023 (2001-01-01), None
patent: 1 217 533 (2002-06-01), None
patent: 1 233 349 (2002-08-01), None
patent: 2 332 544 (1999-06-01), None
J. Vergne, “Tutorial: Trends in Robust Parsing”, Jul. 29, 2000, Coling 2000, available at http:users.info.unicaen.fr/˜jvergne/tutorialColing2000.html and http:users.info.unicaen.fr/˜jvergne/RobustParsingCourseSlides.pdf.zip.
G.A. Miller et al., “Introduction to WordNet: An On-line Lexical Database”, Cognitive Science Laboratory, Princeton University, Aug. 1993, available at http://www.cogsci.princeton.edu/˜wn/.
R. Barzilay et al., “Using Lexical Chains for Text Summarization”, Proceedings of the Intelligent Scalable Text Summarization Workshop, Madrid, ACU 1997.
R. Barzilay, “Lexical Chains for Summarization”, M.Sc. Thesis, Ben-Gurion University of the Negev, Department of Mathematics & Computer Science, Nov. 30, 1997.
H.P Edmundson, “New Methods in Automatic Extracting”, pp. 23-33, 36, 37, 40, and 41 of “Advances in Automatic Text Summarisation” by I. Mani et al., 1999.
F.C. Johnson et al., “The application of linguistic processing to automatic abstract generation”, Journal of Document & Text Management, vol. 1, No. 3, 1993, pp. 215-241.
B. Boguraev et al., “Dynamic Presentation of Document Content for Rapid On-Line Skimming”, AAAI Spring 1998 Symposium on Intelligent Text Summarization.
I. Mani et al., “Multi-document Summarization by Graph Search and Matching”, available at http://arxiv.org.PS—cache/cmp-lg/pdf/9712/9712004.pdf, Proceedings of the Fourteenth National Conference on Artificial Intelligence (AAAI'97), Dec. 10, 1997, pp. 622-628.
D. Lawrie et al., “Finding Topic Words for Hierarchical Summarization”, SIGIR '01, Sep. 9-12, 2001, New Orleans, LA, pp. 349-357 and presentation slides.
Roget's on-line thesaurus, available at http://humanities.uchicago.edu/orgs/ARTEL/forms—unrest/Roget.html.actual (date unknown).
Swan, R., et al., “TimeMines: Constructing Timelines with Statistical Models of Word Usage”, ACM SIGKDD 2000, Workshop on Text Mining, pp. 73-80, CIIR Technical Report IR-213, (2000) available at http://ciir.cs.umass.edu.
“NP Chunking”, available at http://lcg-www.uia.ac.be/˜erikt/research
p-chunking.html, Sep. 13, 2002.
I. Mani et al., “Automatic Summarization”, Chapter 4, John Benjamin Publishing Company, 2001, pp. 77-89.
S. Evert et al., “Methods for Qualitative Evaluation of Lexical Association Measures”, Proceedings of the 39thAnnual Meeting of the Association for Computational Linguistics, Toulouse, France, 2001.
D. Lin et al., Presentation slides entitled “Collocation”, Department of Computing Science, University of Alberta, Feb. 5, 2002.
T. Dunning, “Accurate Methods for the Statistics of Surprise and Coincidence”, Computational Linguistics 19(1), 1993, available at http://citeseer.nj.nec.com/dunning93accurate/html.
“The Festival Speech Synthesis System”, Section 9.1 of the manual for that system, Nov. 20, 2002, available at http://www.cstr.ed.ac.uk/projects/festival/manual/festival—9.html.
M.A. Hearst, “Multi-Paragraph Segmentation of Expository Text”, 32ndAnnual Meeting of the Association for Computational Linguistics in 1994, available at http://citeseer.nj.nec.com/hearst94multiparagraph.html.
Armour, T., presentation slides entitled “Asymmetric Threat Initiative”, 2000, available at http://www.darpa.mil/darpatech2000/presentations/iso.pdf/4armouratb&wrev1.pdf.
Armour, T., “Asymmetric Threat Initiative”, 2000, available at http://www.darpa.mil/body/dar—archives.html.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Apparatus for and method of summarising text does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Apparatus for and method of summarising text, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Apparatus for and method of summarising text will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-3885186

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.