Chinese character-based parser

Data processing: speech signal processing – linguistics – language – Linguistics – Natural language

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

Reexamination Certificate

active

07464024

ABSTRACT:
A parser is provided that parses a Chinese text stream at the character level and builds a syntactic structure of Chinese character sequences. A character-based syntactic parse tree contains word boundaries, part-of-speech tags, and phrasal structure information. Syntactic knowledge constrains the system when it determines word boundaries. A deterministic procedure is used to convert word-based parse trees into character-based trees. Character-level tags are derived from word-level part-of-speech tags and word-boundary information is encoded with a positional tag. Word-level parts-of-speech become a constituent label in character-based trees. A maximum entropy parser is then built and tested.

REFERENCES:
Xue et al (“Building a Large-Scale Annotated Chinese Corpus”, Proceedings of the 19th International Conference on Computational Linguistics, 2002).
Luo (“A Maximum Entropy Chinese Character-Based Parser”, Proceedings of the 2003 Conference on Empirical Methods in Natural Language Processing, Jul. 2003, pp. 192-199).

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Chinese character-based parser does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Chinese character-based parser, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Chinese character-based parser will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-4032599

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.