Scalable semi-structured named entity detection

Data processing: database and file management or data structures – Database design – Database and data structure management

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C707S759000, C707S766000, C707S769000, C707S770000, C707S822000

Reexamination Certificate

active

08073877

ABSTRACT:
Disclosed are methods and apparatus for performing named entity recognition. A set of candidates and corresponding contexts are obtained, each of the set of candidates being a potential seed example of an entity. The contexts of at least a portion of the set of candidates are compared with contexts of a set of seed examples of the entity such that a subset of the set of candidates are added to the set of the seed examples. A set of rules are created from the set of seed examples obtained in the comparing step. A final set of seed examples of the entity is generated by executing the set of rules against the set of candidates.

REFERENCES:
patent: 7558778 (2009-07-01), Carus et al.
patent: 7574048 (2009-08-01), Shilman et al.
patent: 7672833 (2010-03-01), Blume et al.
patent: 2010/0227301 (2010-09-01), Yu et al.
patent: 2010/0318537 (2010-12-01), Surendran et al.
“Named entity recognition”, downloaded from http://en.wikipedia.org/wiki/Named—entity—recognition on Jan. 20, 2009.
McCallum, A. et al., “Early Results for Named Entity Recognition with Conditional Random fields, Feature Induction and Web-Enhanced Lexicons”, downloaded from http://www.cs.umass.edu/˜mccallum-conll2003 on Jan. 20, 2009.
P. Turney, “Mining the Web for Synonyms: PMI-IR Versus LSA on TOEFL”, in Proceedings of the Twelfth European Conference on Machine Learning, 2001.
E. Agichtein, et al., “Snowball: Extracting Relations from Large Plain-Text Collections”, in Proceedings of the Fifth ACM Conference on Digital Libraries, 2000.
D. Appelt, et al., “FASTUS: A Finite-State Processor for Information Extraction from Real-World Text”, In Proceedings of IJCAI-93, 1993.
D. E. Appelt, et al., “SRI International FASTUS System MUC-6 Test Results and Analysis”. In Proceedings of the Sixth Message Understanding Conference (MUC-6), pp. 237-248, San Francisco, 1995.
S. Baluja, et al., “Applying Machine Learning for High Performance Named-Entity Extraction”, Computational Intelligence, 16(4), Nov. 2000.
O. Bender, et al., “Maximum Entropy Models for Named Entity Recognition”, In Seventh Conference on Natural Language Learning (CoNLL-03), 2003.
D. M. Bikel, et al., “An Algorithm that Learns What's in a Name”, Machine Learning, 34(1-3):211-231, 1999.
S. Brin, “Extracting Patterns and Relations from the World Wide Web”, In Proceedings of the First International Workshop on the Web and Databases, 1998.
Y.-Y. Chen, et al., “Efficient Query Processing in Geographic Web Search Engines”, In Proceedings of the 2006 ACM SIGMOD International Conference on Management of Data, 2006.
M. Collins, et al., “Unsupervised Models for Named Entity Classification”, In Proceedings of the Joint SIGDAT Conference on Empirical Methods in Natural Language Processing and Very Large Corpora, 1999.
C. Elkan, et al., “Learning Classifiers from Only Positive and Unlabeled Data”, In Proceeding of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2008.
O. Etzioni, et al., “Unsupervised Named-Entity Extraction from the Web: An Experimental Study”, Artificial Intelligence, 165(1):91-134, 2005.
R.-E. Fan,e t al., “Liblinear: A Library for Large Linear Classification”, Journal of Machine Learning Research, 9:1871-1874,2008.
R. Farkas, et al., “An Iterative Method for the De-Identification of Structured Medical Text”, In Proceeding of AMIA I2B2NLP Workshop, 2006.
R. Grishman, et al., “Design of the MUC-6 Evaluation”, In Proceedings of the Sixth Message Understanding Conference (MUC-6), pp. 1-11, San Francisco, 1995.
W.S. Lee, et al., “Learning with Positive and Unlabeled Examples Using Weighted Logistic Regression”, In Proceedings of the Twentieth International Conference on Machine Learning, 2003.
M. Paimg id="CUSTOM-CHARACTER-00001" he="3.13mm" wi="2.46mm" file="US08073877-20111206-P00001.TIF" alt="custom character" img-content="character" img-format="tif" ?ca, “Organizing and Searching the World Wide Web of Facts—Step Two: Harnessing the Wisdom of the Crowds”, in WWW '07: Proceedings of the 16th International Conference on World Wide Web, 2007.
M. Paimg id="CUSTOM-CHARACTER-00002" he="3.13mm" wi="2.46mm" file="US08073877-20111206-P00001.TIF" alt="custom character" img-content="character" img-format="tif" ?ca, et al., “The Role of Documents vs. Queries in Extracting Class Attributes from Text”, In CIKM '07: Proceedings of the Sixteenth ACM Conference on Information and Knowledge Management, 2007.
J. Parikh, et al., “Unity: Relevance Feedback Using User Query Logs”, In Proceedings of the 25th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2006.
M. Paimg id="CUSTOM-CHARACTER-00003" he="3.13mm" wi="2.46mm" file="US08073877-20111206-P00001.TIF" alt="custom character" img-content="character" img-format="tif" ?ca, et al., “Organizing and Searching the World Wide Web of Facts—Step One: The One-Million Fact Extraction Challenge”, In Proceedings of the 21st National Conference on Artificial Intelligence, 2006.
E. Riloff, “Automatically Constructing a Dictionary for Information Extraction Tasks”, In Proceedings of the Eleventh National Conference on Artificial Intelligence, 1993.
E. Riloff, et al., “Learning Dictionaries for Information Extraction by Multi-Level Bootstrapping”,In Proceedings of the Sixteenth National Conference on Artificial Intelligence, 1999.
G. Salton, et al., “Term Weighting Approaches in Automatic Text Retrieval”, Technical Report, Ithaca, NY, USA, 1987.
B. Scholkopf, et al., “Estimating the Support of a High-Dimensional Distribution”, Neural Comput., 13(7):1443-1471, 2001.
M. Thelen, et al., “A Bootstrapping Method for Learning Semantic Lexicons Using Extraction Pattern Contexts”, In Proceedings of the 2002 Conference on Empirical Methods in Natural Language Processing, 2002.
V. Von Brzeski, et al., “Leveraging Context in User-Centric Entity Detection Systems”, In CIKM, pp. 691-700, ACM, 2007.
H. Yu, “Single-Class Classification with Mapping Convergence”, Machine Learning, 61(1-3):49-69, 2006.
H. Yu, et al., “Pebl: Web Page Classification Without Negative Examples”, IEEE Transactions on Knowledge and Data Engineering, 16:70-81, 2004.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Scalable semi-structured named entity detection does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Scalable semi-structured named entity detection, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Scalable semi-structured named entity detection will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-4260840

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.