Data processing: speech signal processing – linguistics – language – Linguistics – Dictionary building – modification – or prioritization
Reexamination Certificate
2005-04-12
2005-04-12
Chawan, Vijay (Department: 2654)
Data processing: speech signal processing, linguistics, language
Linguistics
Dictionary building, modification, or prioritization
C704S009000, C704S001000, C704S260000, C715S252000, C715S252000
Reexamination Certificate
active
06879951
ABSTRACT:
A Chinese word segmentation apparatus relates to processing of a Chinese sentence input to a computer. A character-to-phonetic converter of the segmentation apparatus initially converts a Chinese sentence into a phonetic symbol string while referring to a character phonetic dictionary and a ductionary for characters with different pronunciations. Thereafter, a candidate word-selector refers to a system dictionary to retrieve all of the possible candidate characters or words in the phonetic symbol string and relevant information, such as frequency of use, using the phonetic symbols as indexing terms. Unfeasible candidate characters or words are discarded. Subsequently, an optimum candidate character string-decider builds a candidate word network using starting and ending positions of each candidate character or word in the input sentence as indexing terms. By referring to semantic and syntax information portions, frequency of use prioritization, word length prioritization, semantic similarity prioritization and syntax prioritization are combined to obtain a total estimate. The optimum route for word segmentation marking portion adds word segmentation markers into the input sentence while referring to the optimum route to complete word segmentation.
REFERENCES:
patent: 4777600 (1988-10-01), Saito et al.
patent: 4937745 (1990-06-01), Carmon
patent: 5257938 (1993-11-01), Tien
patent: 5319552 (1994-06-01), Zhong
patent: 6014615 (2000-01-01), Chen
patent: 6587819 (2003-07-01), Lu
patent: 0271619 (1988-06-01), None
patent: 11-66061 (1999-03-01), None
English Language Abstract of JP-11-66061.
“Automatic Word Identification in Chinese Sentences by the Relaxation Technique”, Charng-Kang Fan et al., Proceedings of National Computer Symposium (1987).
Chawan Vijay
Greenblum & Bernstein P.L.C.
Matsushita Electric - Industrial Co., Ltd.
LandOfFree
Chinese word segmentation apparatus does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Chinese word segmentation apparatus, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Chinese word segmentation apparatus will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-3410620