System and method for extracting information from text using...

Data processing: speech signal processing – linguistics – language – Linguistics – Natural language

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C707S705000, C715S230000, C715S256000

Reexamination Certificate

active

07912705

ABSTRACT:
A fact extraction tool set (“FEX”) finds and extracts targeted pieces of information from text using linguistic and pattern matching technologies, and in particular, text annotation and fact extraction. Text annotation tools break a text, such as a document, into its base tokens and annotate those tokens or patterns of tokens with orthographic, syntactic, semantic, pragmatic and other attributes. A user-defined “Annotation Configuration” controls which annotation tools are used in a given application. XML is used as the basis for representing the annotated text. A tag uncrossing tool resolves conflicting (crossed) annotation boundaries in an annotated text to produce well-formed XML from the results of the individual annotators. The fact extraction tool is a pattern matching language which is used to write scripts that find and match patterns of attributes that correspond to targeted pieces of information in the text, and extract that information.

REFERENCES:
patent: 5890103 (1999-03-01), Carus
patent: 6108698 (2000-08-01), Tenev et al.
patent: 6279017 (2001-08-01), Walker
patent: 6385630 (2002-05-01), Ejerhed
patent: 6442545 (2002-08-01), Feldman et al.
patent: 6714939 (2004-03-01), Saldanha et al.
patent: 6910003 (2005-06-01), Arnold et al.
patent: 2001/0018697 (2001-08-01), Kunitake et al.
patent: 2002/0013694 (2002-01-01), Murata et al.
patent: 2002/0103775 (2002-08-01), Quass et al.
patent: 2002/0165717 (2002-11-01), Solmer et al.
patent: 2002/0177991 (2002-11-01), Ejerhed
patent: 2003/0007397 (2003-01-01), Kobayashi et al.
patent: 2003/0154070 (2003-08-01), Tokuda et al.
patent: 2003/0158723 (2003-08-01), Masuichi et al.
patent: 2003/0167162 (2003-09-01), Simpson et al.
patent: 2003/0229854 (2003-12-01), Lemay
patent: 2004/0078190 (2004-04-01), Fass et al.
patent: 2004/0243556 (2004-12-01), Ferrucci et al.
patent: 2004/0243645 (2004-12-01), Broder et al.
patent: 2005/0066271 (2005-03-01), Uchiyama et al.
patent: 2005/0154979 (2005-07-01), Chidlovskii et al.
Ravichandran, D. and Hovy, E. 2001. Learning surface text patterns for a Question Answering system. In Proceedings of the 40th Annual Meeting on Association For Computational Linguistics (Philadelphia, Pennsylvania, Jul. 7-12, 2002). Annual Meeting of the ACL. Association for Computational Linguistics, Morristown, NJ, 41-47.
Litkowski, K. C. (2003a). Question Answering Using XML-Tagged Documents. In E. M. Voorhees & L.P. Buckland (eds.), The Eleventh Text Retrieval Conference (TREC 2002). NIST Special Publication 500-251. Gaithersburg, MD., 122-131.
Collard, M.L.; Kagdi, H.H.; Majestic, J.I., “An XML-based lightweight C++ fact extractor,” Program Comprehension, 2003. 11th IEEE International Workshop on, vol., No., pp. 134-143, May 10-11, 2003.
Simov, K., Osenova, P., Slavecheva, M., Kolkovsha, S., Balabanova, E., Doikoff, D., Ivanova, K., Simov, A., & Kouylekov, M. (2002). Building a linguistically interpreted corpus of Bulgarian:the BulTreeBank. In Proceedings of Third International Conference on Language Resources and Evaluation LREC-2002 (pp. 1729-1736). Las Palmas de Gran Canaria, Spain.
Krauthammer M., Johnson S.B., Hripcsak G., Campbell D.A., and Friedman C. Representing nested semantic information in a linear string of text using XML. Proc AMIA Symp. 405-9, 2002.
R. Grishman et al., “Message Understanding Conference—6: A Brief History” (Nov. 1996). “Proceedings of the 16th International Conference on Computational Linguistics,” Copenhagen, (Jun. 1996), pp. 466-471.
D. Appelt et al., “Introduction to Information Extraction Technology, A Tutorial Prepared for IJCAI-99,” “Proceedings of the 16th International Joint Conference on Artificial Intelligence” (Jul. 31-Aug. 6, 1999), pp. 1-41.
B. Glasgow, “MITA: An Information Extraction Approach to Analysis of Free-form Text in Life Insurance Applications,” “AI Magazine,” 19(1):59-71, 1998.
R. Feldman et al., “Text Mining at the Term Level,” “Proc. of the 2nd European Symposium on Principles of Data Mining and Knowledge Discovery” (Nantes, France, Sep. 1998), pp. 1-9.
H. Cunningham, “Software Architecture for Language Engineering,” Ph.D. Thesis, Department of Computer Science, University of Sheffield (Jun. 2000), p. i (Abstract), Table of Contents, List of Figures, Chapter 7, and Appendix A.
H. Cunningham et al., “Experience of using GATE for NLP R&D,” “Proceedings of the Workshop on Using Toolsets and Architectures to Build NLP Systems at COLING-2000,” (Luxembourg 2000), pp. 1-8.
R. Grishman, “Real-Time Event Extraction for Infectious Disease Outbreaks,” “Proceedings of Human Language Technology Conference (HLT) (2002),” pp. 1-4.
B. Crysmann, “An Integrated Architecture for Shallow and Deep Processing,” Proceedings of ACL-2002, “Association for Computational Linguistics 40th Anniversary Meeting” (Jul. 2002), pp. 1-8.
H. Cunningham, “Developing Language Processing Components with GATE (a User Guide) for GATE version 2.1 beta 1 (Aug. 2002,” University of Sheffield (2001-2002) Table of Contents, and Chapters 5 and 6.
K. Bontcheva, “Using Human Language Technology for automatic Annotation and Indexing of Digital Library Content,” “Proceedings of the 6th European Conference on Research and Advanced Technology for Digital Libraries” (2002), pp. 613-625.
R. Feldman, “Mining biomedical literature using information extraction,” “Current Drug Discovery” (Oct. 2002) pp. 19-23.
H. Cunningham et al., “GATE: an Architecture for Development of robust HLT Applications,” “Proceedings of ACL 2002” (2002), pp. 1-8.
D. Maynard, “Architectural Elements of Language Engineering Robustness,” “Journal of Natural Language Engineering—Special Issue on Robust Methods in Analysis of Natural Language Data” 1 (1):1-20 (2002).
J. Hobbs et al., “FASTUS: Extracting Information from Natural-Language Texts,” “Finite State Devices for Natural Language Processing” (MIT Press 2000), pp. 1-22.
S. Miller et al. “A Novel Use of Statistical Parsing to Extract Information from Text,” 6th Applied Natural Language Processing Conference (2000), pp. 1-8.
J. Hobbs et al., “FASTUS: A Cascaded Finite-State Transducer for Extracting Information from Natural-Language Text,” “Finite State Devices for Natural Language Processing” (MIT Press 1996), pp. 1-22.
“FindEngineTM white paper, Version 1.0,” Hapax Information Systems AB (Sep. 2001) pp. 1-14.
B. Baldwin, “EAGLE: An Extensible Architecture for General Linguistic Engineering,” “Proceedings of RIAO '97” (Jun. 1997), pp. 271-283.
Mitchell Marcus, Grace Kim, Mary Ann Marcinkiewicz, Robert Maclntyre, Ann Bies, Mark Ferguson, Karen Katz, Britta Schasberger, The Penn Treebank: annotating predicate argument structure. Proceedings of the workshop on Human Language Technology, Mar. 8-11, 1994, Plainsboro, NJ [doi>10.3115/1075812.1075835].
H. Cunningham, D. Maynard, V. Tabian, C. Ursu and K. Bontceva: “Developing Language Processing Compnents with GATE.” GATE v2.0 User Guide, University of Sheffield, 2002. http:/
rrc.mitre.org/NRRC/02—results/tao.pdf.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

System and method for extracting information from text using... does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with System and method for extracting information from text using..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and System and method for extracting information from text using... will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-2737536

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.