Character recognizing apparatus, method, and storage medium

Image analysis – Pattern recognition – Context analysis or word recognition

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C382S310000

Reexamination Certificate

active

06636636

ABSTRACT:

BACKGROUND OF THE INVENTION
1. Field of the Invention
The invention relates to a post-process for the purpose of improvement of recognition precision of characters.
The invention intends to select a proper character from character recognition candidates by using a chain probability of a plurality of characters which are continuously inputted.
2. Related Background Art
Among conventional character recognizing apparatuses, there is an apparatus comprising: a pattern matching section for comparing an inputted unknown character pattern with standard patterns which have been prepared as a recognition dictionary in the apparatus, thereby selecting a character code of the standard pattern having high similarity; and a post-processing section for performing a word collating process, a context process, and the like by using recognition candidates obtained from the pattern matching section, thereby outputting a most probable recognition result as a character train.
As a post-process using the context process, an N-gram statistic process to which a chain probability of each character in a character train is applied can be mentioned. The N-gram statistic process uses the chain probability of the following character when a certain character train is given. Particularly, the N-gram statistic process is called a Bi-gram statistic process when the given character train is constructed by two characters and is called a Tri-gram statistic process when it is constructed by three characters.
For example, the Bi-gram statistic process is generally reflected to an on-line character recognition post-process in the following manner.
When the user inputs “xi” (&xgr;), first, the handwritings of “x” and “i” are matching processed by the pattern matching section which has a dictionary in which a standard pattern of each character has been stored and discriminates the similar character every character in accordance with a shape of the input pattern. It is now assumed that “x” and “y” were selected for one input pattern “x” and “;” and “i” were selected for one input pattern “i” as recognition candidates in accordance with the order from the candidate of high similarity for each input pattern and they were outputted as candidate characters, respectively.
Subsequently, all of the possible combinations of the respective recognition candidates are formed. In this example, four combinations of “x;”, “xi”, “y;”, and “yi” exist. Among them, however, the combination in which the chain probability due to the Bi-gram statistic process using the Bi-gram statistic data which has previously been formed is the highest among those four character trains is “xi”. Therefore, a final recognition result is outputted as “xi”.
In case of executing the N-gram statistic process as a post-process as mentioned above, it is necessary to preliminarily calculate N-gram statistics data by using sample texts such as newspapers and the like, to store the chain probabilities of the characters derived from the calculated N-gram statistics into the recognizing apparatus as an N-gram dictionary in a format of a file or the like, and to read out and use the chain probabilities at the time of the execution of the recognition.
In case of using the Bi-gram statistic process in the N-gram statistic process of the above conventional character recognizing apparatus, a backward-chain probability such that attention is paid to a certain character and at which probability a character subsequent to the target character occurs is applied. In case of applying the Bi-gram statistic process to the character recognition, however, there is a case where an enough backward-processing effect cannot be obtained so long as only the backward-chain probability is used. For example, it is now assumed that recognition results of three characters of “
” are “
”, “
”, and “∘O” in accordance with the order of similarity, respectively. From those candidates, when the Bi-gram statistics are applied, a chain probability of “
” is the largest in case of the combination of the first and second characters. A chain probability of “IO” is the largest in case of the combination of the second and third characters. Since an operation value of “I” upon pattern matching is better than that of “
”, the result of “
IO” is finally outputted. According to this result, the number of times of erroneous recognition is larger than that of the recognition result at the time of the pattern matching. There is a problem such that a recognition rate is deteriorated by the post-processing step as mentioned above.
Similarly, three character patterns of “C∘.” are inputted and each of them is character recognized. Thus, it is now assumed that upper recognition candidate characters of the first pattern are “C” and “c”, upper recognition candidate characters of the second pattern are “l”, “∘”, and “O”, and upper recognition candidate characters of the third pattern are “.” and “∘”, respectively. When the Bi-gram statistics are applied to those candidates, a chain probability of “C∘” is the highest in case of the combination of the first and second patterns and a chain probability of “l∘” is the highest in case of the combination of the second and third patterns. Since a similarity operation value of “l” upon pattern matching is better than that of “∘”, a character train of “Cl∘” is finally outputted as a recognition result. According to this result, the number of time of erroneous recognition is larger than that in case of outputting the first candidate character upon pattern matching without performing a post-processing.
SUMMARY OF THE INVENTION
The invention is made to solve the above problems and it is an object of the invention to provide character recognizing apparatus and method for realizing the improvement of a recognition rate by further applying a forward-chain probability in addition to a backward-chain probability in a Bi-gram statistic process.
To accomplish the above object, according to claim
1
of the invention, there is provided a character recognizing apparatus for recognizing a plurality of characters by applying a chain probability of a character, comprising: backward-chain probability applying means for applying the chain probability from the i-th character among the plurality of characters to the (i+1)th character; forward-chain probability applying means for applying the chain probability from the (i+1)th character among the plurality of characters to the i-th character; unifying means for unifying results which are respectively obtained from the backward-chain probability applying means and the forward-chain probability applying means and setting a unified result as a post-processing result; and output means for outputting the post-processing result unified by the unifying means as a final recognition result.
According to the invention, by applying the forward-chain probability in addition to the backward-chain probability, the erroneous recognition of a character train which cannot be saved so long as only the backward-chain probability is used can be improved and the recognition rate can be improved. The character train which is displayed as a final recognition result displays a natural result as a sentence that is better than the result so far. There is, consequently, an effect that even if an erroneous recognition character exists, an anxious factor for the erroneous recognition of the user is reduced. Since the post-processing system using a strong restriction between the characters is adopted, the invention effectively functions in a special field or in a case where a range of characters as recognition targets is limited or the like.


REFERENCES:
patent: 3188609 (1965-06-01), Harmon et al.
patent: 4058795 (1977-11-01), Balm

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Character recognizing apparatus, method, and storage medium does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Character recognizing apparatus, method, and storage medium, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Character recognizing apparatus, method, and storage medium will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-3136294

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.