Method of recognizing characters

Image analysis – Pattern recognition – Context analysis or word recognition

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C382S176000, C382S190000, C382S292000, C345S469000

Reexamination Certificate

active

06549662

ABSTRACT:

BACKGROUND OF THE INVENTION
1. Field of the Invention
The present invention relates to a method of recognizing characters of a character string contained in data by detecting the layout of the character string on a document.
2. Description of the Related Art
Characters on documents have various kinds including Kanji characters, numerical characters, and alphabetical characters, and are available in different fonts including type and handwritten characters. In order to recognize these characters accurately, it is necessary to define the positions, kinds, and fonts of characters.
FIG. 31
of the accompanying drawings illustrates a document, and
FIG. 32
of the accompanying drawings illustrates a conventional method of recognizing characters.
In
FIG. 31
, a money transfer request slip is shown as a document. The illustrated money transfer request slip is written by Kanji characters and numeric characters as shown in FIG.
31
. The illustrated money transfer request slip has
29
character strings C
1
-C
29
. The transfer requester is “AIU system” as indicated by the character string C
2
. The designated date of transfer is “September 20, Heisei 7” as indicated by the character strings C
3
, C
4
.
Headers include a transfer destination (C
5
), an item (C
6
), an account number (C
7
), a receiver (C
8
), and a sum of money to be transferred (C
9
). Data corresponding to the header of the transfer destination include the character strings C
10
, C
11
, C
16
, c
17
, C
22
, C
23
. Data corresponding to the header of the item include the character strings C
12
, C
18
, C
24
. Data corresponding to the header of the account number include the character strings C
13
, C
19
, C
25
.
Data corresponding to the header of the receiver include the character strings C
14
, C
20
, C
26
. Data corresponding to the header of the sum of money to be transferred include the character strings C
15
, C
21
, C
27
. The money transfer request slip also has a header “total to be transferred” (C
28
) and its data (C
29
).
For recognizing the characters of the data on the money transfer request slip, it is necessary to define the positions and names of the data. If the kinds of the characters of the data are known, then it is possible to limit the range where the characters of the data are recognized, for character recognition of higher accuracy. To limit the range of character recognition, it is necessary to define a character category of the characters of the data and the king of the character font.
As shown in
FIG. 32
, the position, data name (transfer destination), the character category (Kanji), and the character font (type) are defined with respect to the character string C
10
, for example. Heretofore, it has been customary to generate, in advance, definition information which defines positions where characters are to be read, for each document, register the definition information in a recognition apparatus, read an image on a document according to the registered definition information, and recognize characters from the image.
Since definition information needs to be registered beforehand, however, characters can be recognized only for those documents with respect to which the definition information has been registered in advance. Banking organizations use various formats for money transfer request slips that are generated by corporations for automatically making money transfers. It is tedious and time-consuming to generate definition information for those documents in advance.
Even if definition information for documents is registered, the registered definition information should be changed when a document format is changed.
SUMMARY OF THE INVENTION
It is an object of the present invention to provide a method of recognizing characters without the need for generation, in advance, of definition information of characters on documents.
Another object of the present invention is to provide a method of recognizing characters by automatically detecting the layout of characters on a document from an arrangement of character strings on a document.
Still another object of the present invention is to provide a method of recognizing characters by automatically detecting definition information of characters on a document to recognize characters of data thereon.
According the present invention, a method of recognizing characters of headers and characters of data on a document, comprises the steps of extracting character strings on the document by reading the document, distinguishing between headers and data on the document by determining the positional relationship between the character strings, determining character attributes of the data by recognizing characters of the character strings of the headers using a header recognition dictionary, and recognizing characters of the character strings of the data according to the determined character attributes of the data.
In the method, headers are determined from the positional relationship between character strings, and using the header recognition dictionary which has been registered in advance, the headers are recognized, and character attributes of the data are determined. Finally, character strings of the data are recognized according to the character attributes.
Because headers and data on documents are automatically distinguished from each other to recognize header characters, character attributes of the data can automatically be determined. Since headers are universal in nature and characters used therefor are limited, the header characters can easily be recognized. Furthermore, inasmuch as characters of data are recognized depending on the character attribute that has been determined, the characters of data are recognized with increased accuracy.
Other features and advantages of the present invention will become readily apparent from the following description taken in conjunction with the accompanying drawings.


REFERENCES:
patent: 5136520 (1992-08-01), Cox
patent: 5179650 (1993-01-01), Fukui et al.
patent: 5428720 (1995-06-01), Adams, Jr.
patent: 5504822 (1996-04-01), Holt
patent: 5563957 (1996-10-01), Ueno et al.
patent: 5673337 (1997-09-01), Gallo et al.
patent: 5768451 (1998-06-01), Hisamitsu et al.
patent: 5907631 (1999-05-01), Saitoh
patent: 5982387 (1999-11-01), Hellmann
patent: 6201894 (2001-03-01), Saito
patent: 6208744 (2001-03-01), Ishinge et al.
patent: 6438566 (2002-08-01), Okuno et al.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Method of recognizing characters does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Method of recognizing characters, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Method of recognizing characters will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-3065537

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.