Language-independent and segmentation-free optical character rec

Image analysis – Pattern recognition – Unconstrained handwriting

Patent

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

382192, 382228, 382185, G06K 948, G06K 980

Patent

active

059335254

ABSTRACT:
A language-independent and segment free OCR system and method comprises a unique feature extraction approach which represents two dimensional data relating to OCR as one independent variable (specifically the position within a line of text in the direction of the line) so that the same CSR technology based on HMMs can be adapted in a straightforward manner to recognize optical characters. After a line finding stage, followed by a simple feature-extraction stage, the system can utilize a commercially available CSR system, with little or no modification, to perform the recognition of text by and training of the system. The whole system, including the feature extraction, training, and recognition components, are designed to be independent of the script or language of the text being recognized. The language-dependent parts of the system are confined to the lexicon and training data. Furthermore, the method of recognition does not require pre-segmentation of the data at the character and/or word levels, neither for training nor for recognition. In addition, a language model can be used to enhance system performance as an integral part of the recognition process and not as a post-process, as is commonly done with spell checking, for example.

REFERENCES:
patent: 4809351 (1989-02-01), Abramovitz et al.
patent: 5062143 (1991-10-01), Schmitt
patent: 5343537 (1994-08-01), Bellegarda et al.
patent: 5438630 (1995-08-01), Chen et al.
patent: 5544257 (1996-08-01), Bellegarda et al.
patent: 5757960 (1998-05-01), Murdock et al.
patent: 5787198 (1998-07-01), Agazzi et al.
patent: 5862259 (1999-01-01), Bokser et al.
Bose et al. "Connected and Degraded Text Recognition Using Hidden Markov Model." Proceedings. 11th IAPR International Conference on Pattern Recognition. vol. II. pp. 116-119, Sep. 1992.
Kaltenmeier et al. "Sophisticated Topology of Hidden Markov Models for Cursive Script Recognition." Proceedings of the Second International Conference on Document Analysis and Recognition, pp. 139-142, Oct. 1993.
Sin et al. "A Statistical Approach with HMMs for On-Line Cursive Hangul (Korean Script) Recognition." Proceedings of the Second International Conference on Document Analysis and Recognition, pp. 147-150, Oct. 1993.
Bippus et al. "Cursive Script Recognition Using Semi Continuous Hidden Markov Models in Combination with Simple Features." IEE European Workshop on Handwriting Analysis and Recognition: A European Perspective, pp. 6/1-6, Jul. 1994.
Gonzalez and Woods, "Digital Image Processing", Addison-Wesley Pub. Co., pp. 416-418, 1992.
E. Levin and R. Pieraccini, "Dynamic Planar Warping for Optical Character Recognition," IEEE Int. Conf. Acoustics, Speech, Signal Processing, San Francisco, CA, pp. 111-149-111-152, Mar. 1992.
J.C. Anigbogu and A. Belaid, "Performance Evaluation of an HMM Based OCR System," Proc. 11th Int. Pattern Recognition, The Hague, The Netherlands, pp. 565-568, Aug. 1992.
G. Kopec and P. Chou, "Document Image Decoding Using Markov Source Models," IEEE Int. Conf. Acoustics, Speech, Signal Processing, Minneapolis, MN, pp. V-85-88, Apr. 1993.
O.E. Agazzi and S. Kuo, "Hidden Markov Model Based Optical Character Recognition in the Presence of Deterministic Transformations," Pattern Recognition, vol. 26, No. 12, pp.
T. Starner, J. Makhoul, R. Schwartz and G. Chou; On-Line Cursive Handwriting Recognition Using Speech Recognition Methods; IEEE International Conference on.
A. Kundu and P. Bahl, "Recognition of Handwritten Script: a Hidden Markov Model Based Approach," IEEE Int. Conf. Acoustics, Speech, Signal Processing, New York, NY, pp.
J.A. Vlontzos and S.Y. Kung, "Hidden Markov Models for Character Recognition," IEEE Trans. Image Processing, pp. 539-543, Oct. 1992.
H.-S. Park and S.-W. Lee, "Off-line Recognition of Large-set Handwritten Characters with Multiple Hidden Markov Models," Pattern Recognition, vol. 29, No. 2, pp. 231-244, 1996.
G.D. Forney, "The Viterbi algorithm," Proc. IEEE, vol. 61, pp. 268-278, 1973.
L. Nguyen, T. Anastasakos, F. Kubala, C. LaPre, J. Makhoul, R. Schwartz, N. Yuan, G. Zavaliagkos, and Y. Zhao, "The 1994 BBN/BYBLOS Speech Recognition System," Proc.
R.M. Schwartz, Y. Chow, S. Roucos, M. Krasner, and J. Makhoul, "Improved Hidden Markov Modeling of Phonemes for Continuous Speech Recognition," IEEE Int. Conf.
J. Bellegarda and D. Nahamoo, "Tied Mixture Continuous Parameter Models for Large Vocabulary Isolated Speech Recognition," IEEE Int. Conf. Acoustics, Speech, Signal.
J. Makhoul, S. Roucos, H. Gish. "Vector Quantization in Speech Coding," Proc. IEEE, vol. 73, No. I 1, pp. 1551-1588, Nov. 1985.
L. Rabiner, "A Tutorial on Hidden Markov Models and Selected Applications in Speech Recognition," Proc. IEEE, vol. 77, No. 2, pp. 257-286, Feb. 1989.
B. Al-Badr and S. Mahmoud, "Survey and bibliography of Arabic optical text recognition," Signal Processing, vol. 41, No. 1, pp. 49-77, 1995.
I.T. Phillips, S. Chen, and R.M. Haralick, "CD-ROM document database standard," Proc. Int. Conf. Document Analysis and Recognition, Tsukuba City, Japan, pp. 478-483, Oct.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Language-independent and segmentation-free optical character rec does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Language-independent and segmentation-free optical character rec, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Language-independent and segmentation-free optical character rec will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-856966

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.