Document categorisation system

Data processing: presentation processing of document – operator i – Operator interface – On-screen workspace or object

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C707S736000, C707S737000, C707S738000, C707S739000, C707S740000

Reexamination Certificate

active

07971150

ABSTRACT:
A document categorization system, including a clusterer for generating clusters of related electronic documents based on features extracted from the documents, and a filter module for generating a filter on the basis of the clusters to categorize further documents received by the system. The system may include an editor for manually browsing and modifying the clusters. The categorization of the documents is based on n-grams, which are used to determine significant features of the documents. The system includes a trend analyzer for determining trends of changing document categories over time, and for identifying novel clusters. The system may be implemented as a plug-in module for a spreadsheet application for permitting one-off or ongoing analysis of text entries in a worksheet.

REFERENCES:
patent: 5418951 (1995-05-01), Damashek
patent: 5448473 (1995-09-01), Takeuchi et al.
patent: 5548697 (1996-08-01), Zortea
patent: 5675710 (1997-10-01), Lewis
patent: 5745893 (1998-04-01), Hill et al.
patent: 5819258 (1998-10-01), Vaithyanathan et al.
patent: 5857179 (1999-01-01), Vaithyanathan et al.
patent: 5864855 (1999-01-01), Ruocco et al.
patent: 5948058 (1999-09-01), Kudoh et al.
patent: 5953718 (1999-09-01), Wical
patent: 6012058 (2000-01-01), Fayyad et al.
patent: 6032146 (2000-02-01), Chadha et al.
patent: 6122628 (2000-09-01), Castelli et al.
patent: 6128613 (2000-10-01), Wong et al.
patent: 6167397 (2000-12-01), Jacobson et al.
patent: 6233575 (2001-05-01), Agrawal et al.
patent: 6289337 (2001-09-01), Davies et al.
patent: 6298351 (2001-10-01), Castelli et al.
patent: 6360215 (2002-03-01), Judd et al.
patent: 6424971 (2002-07-01), Kreulen et al.
patent: 6430547 (2002-08-01), Busche et al.
patent: 6442545 (2002-08-01), Feldman et al.
patent: 6446061 (2002-09-01), Doerre et al.
patent: 6502081 (2002-12-01), Wiltshire et al.
patent: 6654742 (2003-11-01), Kobayashi et al.
patent: 6665681 (2003-12-01), Vogel
patent: 6701314 (2004-03-01), Conover et al.
patent: 6766035 (2004-07-01), Gutta
patent: 6922706 (2005-07-01), Kurtzberg et al.
patent: 7003519 (2006-02-01), Biettron et al.
patent: 7194471 (2007-03-01), Nagatsuka et al.
patent: 2002/0059161 (2002-05-01), Li
patent: 2002/0138460 (2002-09-01), Cochrane et al.
patent: 704810 (1996-04-01), None
patent: 704810 (1996-04-01), None
patent: 1024437 (2000-08-01), None
patent: 1024437 (2000-08-01), None
patent: WO97/08604 (1997-03-01), None
patent: WO 98/58344 (1998-12-01), None
Salton, “The SMART Retrieval System—Experiments in Automatic Document Processing,” Prentice-Hall, New Jersey, (1971).
Rasmussen, E. “Clustering Algorithms,” Information Retneva/, W. B. Frake and R. Baeza-Yates ed., Prentice-Hall, New Jersey, (1992).
Raskutti, B. et al., “An Evaluation of Cdteria for Measuring the Quality of Clusters,” Proceedings of the Sixteenth International Joint Conference on Artificial Intelligence, pp. 905-910 (1999).
Cohen, J. D., “Highlights: Language and Domain Independent Automatic Indexing Terms for Abstracting,” Journal of the American Society for
forrnation Science, 46(3): 162-174 (1995).
Rasmussen, E. “Clustering Algorithms,”Information Retrieval, W. B. Frake and R. Baeza-Yates ed., Prentice-Hall, New Jersey, (1992).
Cohen, J. D., “Highlights: Language and Domain Independent Automatic Indexing Terms for Abstracting,”Journal of the American Society for Information Science, 46(3):162-174 (1995).
Baeza-Yates, R. et al., “Chapter 5: Modern Information Retrieval, Query Operations”,Modern Information Retrieval, Harlow, Addison-Wesley, GB, (1999) pp. 117-139 Article No. XP002311981.
Baeza-Yates, R. et al., “Chapter 7: Moderen Information Retrieval, Text Operations”,Modern Information Retrieval, Harlow: Addison-Wesley, GB (Jan. 1, 1999) pp. 163-190 Article No. 002287648.
Griffiths, Alan et al., “Hierarchic Agglomerative Clustering Methods for Automatic Document Classification”,Department of Information Studies, University of Sheffield, Western Bank, Sheffield UK, The Journal of Documentation, (Sep. 1984) vol. 40, No. 3 pp. 175-205.
Iwayama, Makoto, et al., “Cluster-Based Text Categorization: A Comparison of Category Search Strategies”,IN: Proceedings of the 18th Annual Interntional ACM SIGIR Conference on Research and Development in Information Retrieval, ACM, Seattle, WA (1995) pp. 273-280.
Joachims, Thorsten, “Text Categorization With Support Vector Machines: Learning With Many Relevant Features”,Machines Learning, European Conference on Machine Learning Proceedings(Apr. 21, 1998) pp. 137-142 Article No. XP002119808.
Kattenborn, Herbert, “Introducing in Excel 97: Department for Administrative and Clinical data—Hospital of the University of JLU-Giessen”, URL: http://www.uniklinikum-giessen.de/res58/excel97skript.pdf German Title—Einfuhrung in Excel97 Abt. fur Klinische and Administrative Datenverarbeitung—Klinikum der JLU-Universitat GieBen printed from Internet on Jan. 14, 2010) pp. 1-150.
Osuna, Edgar, et al., “An Improved Training Algorithm for Support Vector”,Neural Networks for Signal Processing, Proceedings of the IEEE Signal Processing Society Workshop(1997) pp. 276-285 Article No. XP002119807.
Willett, Peter, “Recent Trends in Hierarchic Document Clustering: A Critical Review”,Department of Information Studies, University of Sheffield, Great Britian, Information Proceeding&Management, vol. 24, No. 5 (Jan. 1998) pp. 577-597 Article No. XP000573921.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Document categorisation system does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Document categorisation system, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Document categorisation system will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-2716042

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.