Communications: electrical – Digital comparator systems
Patent
1975-12-24
1977-01-11
Thesz, Joseph M.
Communications: electrical
Digital comparator systems
3401463WD, G06K 900
Patent
active
040030253
ABSTRACT:
The print convention apparatus and method disclosed herein effects a decision making process with respect to a determination as to whether an alphabetic character field output from an optical character reader (OCR) is related to the OCR scan of an upper case or a lower case inscription on the document scanned. The alphabetic character field (e.g., a word) is comprised of one or a series of alphabetic characters which represent the OCR's interpretation of characters printed on the scanned document. Each word output by the OCR corresponds to a field (i.e., word) of characters imprinted on the scanned document. The electrical signals representative of the upper and lower case alphabetic characters and rejects including conflicts outputted from the OCR are applied to a character occurrence probability storage apparatus which contains precomputed empirical probabilities therein that: (1) a given character recognition is the result of the scan of an upper case character; and (2) a given character recognition is the result of the scan of a lower case character. In addition, the storage apparatus includes probability values for character conflicts and rejects. As the series of alphabetic character signals from the OCR output are applied character-by-character to the character occurrence probability storage apparatus (e.g., a read-only store), a running sum of the respective probabilities for the upper case and lower case print conventions is developed so that, following the input of the final character, reject or conflict within a word to the aforesaid apparatus, an appropriate upper or lower case determination can be made for all of the characters within the word. This determination corresponds with the print convention of the word inscribed on the scanned document. A corresponding upper or lower case flag is correspondingly generated with the print convention determination, and associated with the alphabetic character word output from the print convention apparatus for further text processing. In one embodiment of the invention the probability for each OCR output alphabetic character being an upper or lower case character is stored in respective upper and lower case character occurrence probability storage devices after having been precomputed as the product of two probability factors; i.e., (1) a first probability factor with respect to the likelihood that the OCR recognition resulted from the scan of an upper or lower case character, and (2) a second probability factor with respect to the likelihood of a given character occurring in a specified language (e.g., English) document. In another embodiment of the invention, the character occurrence probability storage devices are functionally replaced by a read-only store having an address position for each upper and lower case alphabetic character outputted by the OCR including conflicts and rejects, and a precomputed numerical probability value associated with each address position to represent the quotient of: (1) the probability that a given character is related to an upper case print convention; and (2) the probability that the same character is related to a lower case print convention.
REFERENCES:
patent: 3634822 (1972-01-01), Chow
patent: 3651459 (1972-03-01), Hahn
Hilliard John Joseph
Mullan Philip Joseph
Rosenbaum Walter Steven
International Business Machines - Corporation
Jancin, Jr. J.
Thesz Joseph M.
LandOfFree
Alphabetic character word upper/lower case print convention appa does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Alphabetic character word upper/lower case print convention appa, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Alphabetic character word upper/lower case print convention appa will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-1746892