Data processing: speech signal processing – linguistics – language – Linguistics – Natural language
Reexamination Certificate
2004-04-16
2008-12-09
Hudspeth, David R. (Department: 2626)
Data processing: speech signal processing, linguistics, language
Linguistics
Natural language
Reexamination Certificate
active
07464024
ABSTRACT:
A parser is provided that parses a Chinese text stream at the character level and builds a syntactic structure of Chinese character sequences. A character-based syntactic parse tree contains word boundaries, part-of-speech tags, and phrasal structure information. Syntactic knowledge constrains the system when it determines word boundaries. A deterministic procedure is used to convert word-based parse trees into character-based trees. Character-level tags are derived from word-level part-of-speech tags and word-boundary information is encoded with a positional tag. Word-level parts-of-speech become a constituent label in character-based trees. A maximum entropy parser is then built and tested.
REFERENCES:
Xue et al (“Building a Large-Scale Annotated Chinese Corpus”, Proceedings of the 19th International Conference on Computational Linguistics, 2002).
Luo (“A Maximum Entropy Chinese Character-Based Parser”, Proceedings of the 2003 Conference on Empirical Methods in Natural Language Processing, Jul. 2003, pp. 192-199).
Luo Xiaoqiang
Ward Robert Todd
Dougherty Anne
Hudspeth David R.
International Business Machines - Corporation
Neway Samuel G
Stewart Mari A.
LandOfFree
Chinese character-based parser does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Chinese character-based parser, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Chinese character-based parser will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-4032599