Optical character recognition system having context analyzer

Image analysis – Applications – Reading bank checks

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C382S321000, C704S258000

Reexamination Certificate

active

06577755

ABSTRACT:

FIELD OF THE INVENTION
The invention generally relates to the field of optical character recognition (OCR). More particularly, the invention provides a method and system architecture for optical character recognition which employs syntax and semantics information relating to the characters to be recognized, to improve recognition accuracy and confidence.
BACKGROUND OF THE INVENTION
The problem at hand is to recognize the textual information contained in a scanned image. The scanned images can come from a wide variety of source material, such as written answers to printed questions on forms, or mailing addresses on postal envelopes.
OCR systems employ various strategies for isolating small portions of the image (such as groups of numeric digits within telephone numbers or ZIP codes) as connected components, segmenting a connected component into one or several character images and recognizing each such image as representing a specific character.
The process of OCR is prone to errors. Therefore an OCR system may be designed to identify several alternatives for segmenting a connected component, and several character choices for each character inside a segmentation alternative. The results are typically provided from the output of the OCR system to an application program, such as a text processor or a printer driver.
Conventional OCR recognition engines exist, which recognize characters with a reasonable accurracy. However, even a 90% accuracy rate at the character level means less than 50% at the word level, so over half of all words contain at least one error.
It is well known that the use of context information in conjunction with OCR helps to improve the level of accuracy realized. For instance, if a connected component is identified as a ZIP code (which consists only of numeric characters), then any character choice within the connected component which is not a numeric character can be dismissed as an incorrect choice.
In conventional systems, the OCR subsystem simply provides any character choices it recognizes to the application program, and the exploitation of context is performed by the application program. However, there are drawbacks in such an approach.
If an OCR system scans an image and outputs a single choice for each scanned character, then the application program can try alternate character values for well known confusion possibilities (i and l, O and D, 3 and 5, etc.), or use a fuzzy search to find the best matching word in a dictionary. If the engine returns several choices, then the application program may try to find the most likely sequence of characters based on maximizing some total confidence estimate. But this is by no means an easy task; and it is inefficient use of programming resources to include, within application programs, the necessary logic and code for each particular application. Therefore, there is a need to relieve application programs of the responsibility for performing context-based checking of OCR results.
SUMMARY OF THE INVENTION
It therefore is an object of the present invention to provide an OCR system which performs its own context-based checking, to improve recognition accuracy.
To achieve this and other objectives, there is provided in accordance with the invention a system architecture for an OCR system including a general purpose, highly customizable, context analyzer. Once the initial scan of the input image to be recognized has been made, the tentatively identified characters are subjected to a context analysis, in which rules of syntax and semantics are applied in order to verify that the scanned characters are consistent with those rules, thereby improving the confidence level that the recognized characters are indeed correct.
Since the constraints are application-dependent, there is preferably a means for customizing the analyzer. In accordance with the invention, this is done through a language called Document Specification Language (DSL). A program, or “specification,” written in the Document Specification Language specifies both the syntax and the semantics of the information to be recognized. DSL is specialized, concise, and non-procedural. Its inherent search mechanism makes it a high level language. Once a specification for a given type of text to be recognized is written in DSL, the specification can be compiled into an internal representation (the context structure).
While the invention is primarily disclosed as a system, it will be understood by a person of ordinary skill in the art that a method, such as a method executable on a conventional data processor, including a CPU, memory, I/O, program storage, a connecting bus, and other appropriate components, could be programmed or otherwise designed to facilitate the practice of the method of the invention. Such a method would include appropriate method steps for performing the invention. Also, an article of manufacture, such as a pre-recorded disk or other similar computer program product, for use with a data processing system, could include a storage medium and program means recorded thereon for directing the data processing system to facilitate the practice of the method of the invention. It will be understood that such apparatus and articles of manufacture also fall within the spirit and scope of the invention.


REFERENCES:
patent: 3969698 (1976-07-01), Bollinger et al.
patent: 4523330 (1985-06-01), Cain
patent: 4674065 (1987-06-01), Lange et al.
patent: 4979227 (1990-12-01), Mittelbach et al.
patent: 5040227 (1991-08-01), Lyke et al.
patent: 5151948 (1992-09-01), Lyke et al.
patent: 5267327 (1993-11-01), Hirayama
patent: 5535120 (1996-07-01), Chong et al.
A.C. Downton, E. Kabir and D. Guillevic, “Syntactic and Contextual Post-Processing of Handwritten Addresses for Optical Character Recognition”, Pattern Recognition, 9th International Conference, 1988.*
Cohen, Computational Theory for Interpreting Handwritten Text in Constrained Domains, Elsevier, Artificial Intelligence 67, vol. 26, No. 1, Jul. 1994.
J. K. Mullin, InterfactingCriteria for Recognition Logic Used with a Context Post-Processor, Pattern Recognition Society, vol. 15, No. 3, pp. 271-273, 1982.
R. M. K. Sinha et al., Hybrid Contextual Text Recognition with String Matching, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 15, No. 9, pp. 915-925, Sep. 1993.
From Card to MARC, Vine, No. 48, pp. 26-30, Mar. 1985.
Ong and Lee, Knowledge-based Contextual Processor for Text Recognition, IEEE, Computer Society Technical Committee on Pattern Analysis and Machine Intelligence, (Proceedings) p. 465, Mar. 1993.
Siddiqui et al., Using Contextual postprocessing to Improve Machine Recognition . . . IEEE Int. Symp. on Information Theory, Absts. of Papers, p. 37, Sep. 1983.
Houle et al., Use of a Priori Knowledge for Character Recognition, SPIE vol. 1661, pp. 146-156, 1992.
Downton et al., Syntactic and Contextual Post-Processing of a Handwritten Addresses for Optical Character Recognition, IEEE 9th Internat. Conf. on pattern Recognition, vol. II, pp. 1072-1076, Nov. 1988.
Baird et al., Components of an Omnifont Page Reader, IEEE 8th Internat. Conf. on Pattern Recognition, pp. 344-348, Oct. 1986.
Borgman et al., Getty's Synoname and Its Cousins: A Survey of Applications of Personal Name-Matching Algorithms, Journal of the American Society for Information Science, pp. 459-476, 1992.
Fukunaga et al., Recognition of Words with Semantic Information, Electronics and Communications in Japan, vol. 59-A, No. 1 1976.
T. G. Rose et al., A Large Vocabulary Semantic Analyzer for Handwriting Recognition, AISB Quarterly, No. 80. pp. 34-39.
Recognition Technologies Under Discussion, Electronic Documents, vol. 2, No. 9, pp. 12-16.
Lecolinet, A New Model for Context-Driven Word Recognition, Proc. 2nd Ann. Symp. on Document Analysis and Information Retrieval, pp. 135-145.
G. Farmer, Automated Data Capture & Deposit System Utilizes OCR for Hand-printed Tax Forms, IMC Journal, vol. 28, #2, Mar./Apr., pp. 23-26, 1992.
M. M. Lankhorst et al., Automatic Word Categorization: An Information-theoretic Approach, (paper

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Optical character recognition system having context analyzer does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Optical character recognition system having context analyzer, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Optical character recognition system having context analyzer will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-3090352

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.