Patent
1995-06-06
1998-08-18
Horabik, Michael
395 50, 395 54, 395 75, G06F 1518
Patent
active
057969264
ABSTRACT:
A system is provided for learning extraction patterns (grammar) for use in connection with an information extraction system. The learning system learns extraction patterns from examples of texts and events. The patterns can then be used to recognize similar events in other input texts. The learning system builds new extraction patterns by recognizing local syntactic relationships between the sets of constituents within individual sentences that participate in events to be extracted. The learning system generalizes extraction patterns it has learned previously through simple inductive learning of sets of words that can be treated synonymously within the patterns. Sets of patterns for a sample extraction task perform nearly at the level of a hand-built dictionary of patterns.
REFERENCES:
patent: 5034898 (1991-07-01), Lu
patent: 5212821 (1993-05-01), Gorin
patent: 5222197 (1993-06-01), Teng
patent: 5355510 (1994-10-01), Sekine
patent: 5481650 (1996-01-01), Cohen
patent: 5487135 (1996-01-01), Freeman
patent: 5504840 (1996-04-01), Hiji
E. Brill, "Some advances in transformation-based part of speech tagging," In Proceedings of the Twelfth National Conference on Artificial Intelligence (AAAI-94), pp. 722-727 (1994).
Chinchor et al., "MUC-5 evaluation metrics," In Proceedings of the Fifth Message Understanding Conference MUC-5 Morgan Kaufmann, San Mateo, CA (1993).
R. J. Hall, "Learning by failing to explain," Machine Learning, 3(1) pp. 45-77 (1988).
Hobbs, et al., "FASTUS: A system for extracting information from natural-language text," Technical Report No. 519, SRI International, (1992).
J.R. Hobbs, "The generic information extraction system," In Proceedings of the Fifth Message Understanding Conference (MUC-5) Morgan Kaufmann, San Mateo, CA (1993).
Lehnert et al., "UMass/Hughes: Description of the CIRCUS system used for MUC-5," In Proceedings of the Fifth Message Understanding Conference MUC-5 Morgan Kaufman, San Mateo, CA (1993).
George Miller, "Five papers on WordNet," International Journal of Lexicography 3 pp. 235-312 (1990).
Mitchell et al., "Explanation-based generalization: A Unifying view," Machine Learning 1 (1986).
M. Pazzani, "Learning to predict and explain: An integration of similarity-based, theory driven, and explanation-based learning," Journal of the Learning Sciences 1(2) pp. 153-199 (1991).
E. Riloff, "Automatically constructing a dictionary for information extraction tasks," In Proceedings of the Eleventh National Conference on Artificial Intelligence (AAAI-93) pp. 811-816 (1993).
Soderland et al., "Wrap-Up: A trainable discourse module for information extraction," Journal of Artificial Intelligence Research (JAIR) 2 pp. 131-158 (1994).
K. VanLehn, "Learning one subprocedure per lesson," Artificial Intelligence 31(1) pp. 1-40 (1987).
Allen Kenneth R.
Horabik Michael
Price Waterhouse LLP
Wong Albert K.
LandOfFree
Method and apparatus for learning information extraction pattern does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Method and apparatus for learning information extraction pattern, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Method and apparatus for learning information extraction pattern will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-1123826