Image analysis – Pattern recognition – On-line recognition of handwritten characters
Reexamination Certificate
1994-12-13
2003-02-25
Patel, Jayanti K. (Department: 2625)
Image analysis
Pattern recognition
On-line recognition of handwritten characters
C382S177000, C382S229000, C704S256000
Reexamination Certificate
active
06526170
ABSTRACT:
BACKGROUND OF THE INVENTION
The present invention relates to a character recognition system.
In a conventional character recognition system to read out the image of characters written on paper and produce the read-out characters as character codes capable of being processed on a computer, a character area is extracted from the document image, each of the characters is extracted from the extracted character area, and character recognition is executed for each division area.
In such a conventional character recognition system, since the character area is extracted before the character recognition process there may be errors in the extraction of the character due to causes such as blots and blurs of the image. If such errors occur, it is impossible to restore or compensate for the error, reducing the character recognition rate.
SUMMARY OF THE INVENTION
It is therefore an object of the present invention to provide a character recognition system having an improved recognition rate.
According to the present invention, there is provided a character recognition system comprising: feature extraction parameter storage means for storing a transformation matrix for reducing a number of dimensions of feature parameters and a codebook for quantization; HMM storage means for storing a constitution and parameters of a hidden Markov Model (HMM) for character string expression; feature extraction means for scanning a word image given from an image storage means from left to right in a predetermined cycle with a slit having a sufficiently smaller width than the character width and thus outputting a feature symbol at each predetermined timing, and matching means for matching a feature symbol row and a probability maximization HMM state, thereby recognizing the character string.
According to another aspect of the present invention, there is provided a character recognition system comprising: an image scanner; an image information storage means for storing document image data read out from the image scanner and pertaining image area information; a feature extraction parameter storage means for storing a transformation matrix to reduce a number of dimensions of feature parameters and a codebook to obtain feature symbols, the transformation matrix expressing each feature parameter which comprises multivariates as a small number of variates to minimize information loss, and previously calculated from a training sample feature parameter through main component analysis, the codebook being a set of codevectors used for quantization to express the transformed feature parameter with low bits, and previously calculated from the training sample feature parameter; a character string HMM storage means for storing a constitution and parameters of a character string HMM expression, the character string HMM being obtained by preparing one HMM for each character and adding a state transition from the completion state of each character to an initial state of each character HMM; an image extraction means for extracting an area with characters written therein from the document image and extracting rows and words from the extracted character area and storing word image area information thus obtained in the image information storage means; a feature extraction means for scanning the image of each word area from left to right in a predetermined cycle with a slit having a sufficiently smaller width than the character width and producing a feature symbol at each predetermined timing, and a matching means for making correspondence of the feature symbol sequence with a HMM state so as to maximize the probability of recognizing character string by utilizing the resultant optimum state transition sequence.
Other objects and features of the present invention will be clarified from the following description with reference to the attached drawings.
REFERENCES:
patent: 3482210 (1969-12-01), Lozier et al.
patent: 5321773 (1994-06-01), Kopec et al.
patent: 5323486 (1994-06-01), Taniguchi et al.
patent: 5438630 (1995-08-01), Chen et al.
patent: 5710916 (1998-01-01), Barbara et al.
patent: 5745600 (1998-04-01), Chen et al.
patent: A 60-230283 (1985-11-01), None
patent: A 4-96855 (1992-03-01), None
patent: A 4-275690 (1992-10-01), None
Kundu et al., Computer Vision and Pattern Recognition, CVPR, p. 457-462, 1988.*
“An Algorith for Vector Quantizer Design”, by Linde et at., IEEE Transactions on communications, vol. COM-28, No. 1, Jan. 1980 pp. 84-95.
“Speech Recognition with Probability Models”, by Seiichi Nakagawa, Society of Electronics Information Communication Engineers of Japan, pp. 44-61.
35th Information Processing Conference in Japan 5k-8 pp. 2173-2174.
NEC Corporation
Patel Jayanti K.
Sughrue & Mion, PLLC
LandOfFree
Character recognition system does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Character recognition system, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Character recognition system will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-3159695