Data processing: presentation processing of document – operator i – Presentation processing of document – Text summarization or condensation
Reexamination Certificate
2001-03-26
2009-10-20
Hutton, Doug (Department: 2176)
Data processing: presentation processing of document, operator i
Presentation processing of document
Text summarization or condensation
C705S003000, C705S004000, C705S005000, C707S793000, C707S793000, C704S245000
Reexamination Certificate
active
07607083
ABSTRACT:
Text summarizers using relevance measurement technologies and latent semantic analysis techniques provide accurate and useful summarization of the contents of text documents. Generic text summaries may be produced by ranking and extracting sentences from original documents; broad coverage of document content and decreased redundancy may simultaneously be achieved by constructing summaries from sentences that are highly ranked and different from each other. In one embodiment, conventional Information Retrieval (IR) technologies may be applied in a unique way to perform the summarization; relevance measurement, sentence selection, and term elimination may be repeated in successive iterations. In another embodiment, a singular value decomposition technique may be applied to a terms-by-sentences matrix such that all the sentences from the document may be projected into the singular vector space; a text summarizer may then select sentences having the largest index values with the most important singular vectors as part of the text summary.
REFERENCES:
patent: 6029195 (2000-02-01), Herz
patent: 6356864 (2002-03-01), Foltz et al.
patent: 6523026 (2003-02-01), Gillis
patent: 6611825 (2003-08-01), Billheimer et al.
patent: 6865572 (2005-03-01), Boguraev et al.
patent: 2002/0002450 (2002-01-01), Nunberg et al.
patent: 2002/0078090 (2002-06-01), Hwang et al.
patent: 2001-014341 (2001-01-01), None
Goldstein et al. “Summarizing Text Documents: Sentence Selection and Evaluation Metrics” published Aug. 1999 by ACM Press pp. 121-128.
Furnas et al. “Information Retrieval using a Singular Value Decomposition Model of Latent Sematic Structure” Published 1988 by ACM Press pp. 465-480.
Goldstein et al. “Multi-Document Summarization By Sentence Extraction” published Apr. 2000 by ACM Press pp. 40-48.
William W. Cohen “Data Integration Using Similarity Joins and a Word-Based Information Representation Language” By At& T Lab- Research, Shannon Laboratory, Published by ACM, vol. 18; No. 3, Jul. 2000, pp. 288-321.
Chang et al., “Next generation content representation, creation and search for new media applications in education,”IEEE Proceedings, special issue on Multimedia Signal Processing, vol. 86, pp. 884-904 May 1998.
Baldwin et al., “Dynamic coreference-based summarization” inProceedings of the Third Conference on Empirical Methods in Natural Lanuage Processing(EMNLPS), (Granada, Spain), Jun. 1998.
Sanderson, “Accurate user directed summarization from existing tools,” inProceedings of the 7thInternational Conference on Information and Knowledge Management(CIKM98), 1998.
Hovy et al., “Automated text summarization in SUMMARIST,” inProceedings of the TIPSTER Workshop, (Baltimore, MD), 1998.
http://www.SRA.com.
Buckley et al., “The smart/empire TIPSTER IR system,” inProceedings of TIPSTER Phase III Workshop, 1999.
Goldstein et al., “Summarizing text documents: Sentence selection and evaluation metrics,”inProceedings of ACM SIGIR '99, (Berkeley, CA), Aug. 1999.
Dumais, “Improving the retrieval of information from external sources,”Behavior Research Methods, Instruments, and Computers, vol. 23, 1991.
Press et al.,Numerical Recipes in C: The Art of Scientific Computing, Cambridge, England: Cambridge University Press, 2 ed., 1992.
Deerwester et al., “Indexing by latent semantic analysis,”Journal of the American Society for Informatio Science, vol. 41, pp. 391-407, 1990.
Berry et al., “Using linear algebra for intelligent information retrieval,” Tech. Rep. UT-CS-94-270, University of Tennessee, Computer Science Department, Dec. 1994.
Berger et al., “A maximum entropy approach to natural language processing,”Journal of Computational Linguistics, vol. 22, 1996.
Japanese Patent Application Publication No. 6-215049, published Aug. 5, 1994.
Japanese Patent Application Publication No. 9-319768, published Dec. 12, 1997.
Japanese Patent Application Publication No. 11-102372, published Apr. 13, 1999.
Japanese Patent Application Publication No. 11-259521, published Sep. 24, 1999.
Hirao, Tsutomu: Abstracting of texts based on importance of terms. Journal of Information Processing, Japan, Information Processing Society of Japan, May 15, 1998, vol. 98, No. 34, p. 41-47.
Gong Yihong
Liu Xin
Hutton Doug
NEC Corporation
Sughrue & Mion, PLLC
Tran Quoc A
LandOfFree
Test summarization using relevance measures and latent... does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Test summarization using relevance measures and latent..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Test summarization using relevance measures and latent... will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-4117880