Radical definition and dictionary creation for a handwriting...

Image analysis – Pattern recognition – Ideographic characters

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C382S187000, C382S229000

Reexamination Certificate

active

06539113

ABSTRACT:

TECHNICAL FIELD
The present invention relates generally to data processing systems and, more particularly, to radical definition and dictionary creation in a handwriting recognition system.
BACKGROUND OF THE INVENTION
Kanji is a Japanese system of writing that utilizes characters borrowed or adapted from Chinese writing. The elements of grammar in Kanji are known as “Kanji characters.” The phrase “elements of grammar” refers to units of a given natural language that are capable of comprising parts of speech. For example, the elements of grammar in the English language are words. As such, each Kanji character is a higher order linguistic symbol that is analogous to a word in the English language. That is, natural languages tend to have three levels of linguistic elements. The lowest of these levels depends on the specific alphabet used and is associated with the sounds of the spoken language. For example, the first and lowest level of linguistic elements in the English language comprises letters. The third level of linguistic elements is the highest level and contains linguistic elements conveying full creative expression. In the English language, the third level comprises sentences. It is the second level of linguistic elements to which the phrase “elements of grammar” refers. This second level is an intermediate level of linguistic elements and in the English language, the second level comprises words. In Japanese, the second level comprises Kanji characters.
Kanji characters typically comprise radicals. A “radical” is a part of a Kanji character, much like letters are part of a word. Oftentimes, a radical is itself a Kanji character. For example,
FIG. 1
depicts a Kanji character
100
that comprises two radicals
102
and
104
. Radical
102
is the “day” radical and radical
104
is the “month” radical. When combined, the resulting Kanji character
100
means “open.” There is a well-known, standard set of 214 radicals that are referred to as “traditional radicals.”
FIGS. 2A and 2B
depict the set of traditional radicals
200
. Within the set of traditional radicals
200
, each radical is enumerated from
1
-
214
with alternative drawings indicated with either parenthesis or brackets (e.g., “(32)”).
Some conventional computer systems for recognizing Kanji handwriting have focused on recognizing traditional radicals in order to recognize a Kanji character. This technique is known as “radical recognition.” These conventional systems have attained higher accuracy in recognizing Kanji characters over previous systems, and have reduced the amount of data that must be stored when performing Kanji character recognition. However, the conventional radical recognition approach suffers from a few drawbacks. First, it is difficult to determine which radicals of the traditional radicals should be used. Some of the traditional radicals are individual (“atomic”) radicals and others are combinations of atomic radicals. Hence, a decision must be made whether to use the atomic radicals, the combination radicals, or both. A second drawback is that after the set of radicals is determined, each radical typically must be manually entered into a database and mapped onto the Kanji characters that utilize the radicals. This procedure is time consuming. The third drawback stems from the conventional approach being nonextensible. That is, the conventional approach cannot be used with non Kanji-based languages. Also, after the radicals are mapped onto the Kanji characters, if the system is to be extended to recognize new Kanji characters, the set of radicals and the set of Kanji characters that are recognized usually have to be augmented manually, which is a time consuming task. That is, the additional Kanji characters have to be entered manually into the system and associated with their component radicals. Augmenting the set of Kanji characters that are recognized is a likely possibility since there are over 500,000 Kanji characters and most Kanji handwriting recognition systems only recognize a few thousand. Based upon these drawbacks, it is desirable to improve conventional radical recognition systems.
SUMMARY OF THE INVENTION
The system described herein automatically defines a set of radicals to be used in a Kanji character handwriting recognition system and automatically creates a dictionary of the Kanji characters that are recognized by the system. As a result, the system described herein facilitates the development of Kanji handwriting recognition systems and attains a higher accuracy over conventional systems when recognizing Kanji handwriting. Additionally, the system described herein is fully extensible and can therefore be extended with little effort to recognize different languages. Moreover, if the system described herein is used for Kanji character recognition, it can be extended easily to recognize additional radicals and Kanji characters. In performing its functionality, the system described herein first obtains representative handwriting samples for each Kanji character that is to be recognized by the system. The system described herein then evaluates the samples to identify a set of subparts (“radicals”) that are common to at least two of the Kanji characters. These radicals represent component roots (“visual components”) from which the characters are formed. Each Kanji character is formed by one or more of these radicals. The radicals that are identified by the system described herein are not constrained to any preset definition (e.g., the traditional set of radicals). Thus, the radicals utilized by the system described herein may include some of the traditional radicals or may include none of the traditional radicals. After identifying the set of radicals, the system described herein generates a dictionary with a mapping of each Kanji character that is to be recognized by the system to its component radicals. After the set of radicals and the dictionary have been created, these components can be utilized during handwriting recognition. When performing handwriting recognition, the system described herein identifies the radicals within the handwriting and then uses the mapping to determine which Kanji character the handwriting most closely matches.
In accordance with a first aspect of the present invention, a method for generating radicals of Kanji characters is practiced in a computer system. This method provides for receiving sample handwriting data from at least one user comprising a plurality of Kanji characters with each Kanji character comprising at least one radical that is a common component of at least two Kanji characters. Further, the method provides for examining the sample handwriting data to automatically create a set of radicals from the sample handwriting data.
In accordance with a second aspect of the present invention, a computer system for recognizing Kanji characters is provided. In accordance with the second aspect of the present invention, the computer comprises an analyzer component for receiving sample handwritten data comprising a plurality of Kanji characters and for automatically defining a set of radicals from the sample handwriting data and a recognizer component for receiving handwriting user input indicating an intended Kanji character and for comparing the received handwriting user input to the set of radicals to determine the intended Kanji character.


REFERENCES:
patent: 3979722 (1976-09-01), Sakoe
patent: 4365235 (1982-12-01), Greanias et al.
patent: 4410916 (1983-10-01), Pratt et al.
patent: 4542526 (1985-09-01), Satoh et al.
patent: 4559615 (1985-12-01), Goo et al.
patent: 4573196 (1986-02-01), Crane et al.
patent: 4628532 (1986-12-01), Stone et al.
patent: 4630309 (1986-12-01), Karow
patent: 4653107 (1987-03-01), Shojima et al.
patent: 4672677 (1987-06-01), Yamakawa
patent: 4680804 (1987-07-01), Kuzunuki et al.
patent: 4680805 (1987-07-01), Scott
patent: 4685142 (1987-08-01), Ooi et al.
patent: 4701960 (1987-10-01), Scott
patent: 4718103 (1988-01-01), Shojima et al.
patent: 4972496 (1990-11-01), Sklarew
patent: 4979226 (1990-12-01), Sato
pat

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Radical definition and dictionary creation for a handwriting... does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Radical definition and dictionary creation for a handwriting..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Radical definition and dictionary creation for a handwriting... will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-3061879

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.