Method for recognizing multi-language printed documents...

Image analysis – Pattern recognition – Feature extraction

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C382S176000, C382S202000, C382S229000

Reexamination Certificate

active

06665437

ABSTRACT:

FIELD OF THE INVENTION
The present invention relates to a picture processing technique in a pattern recognition field; and, more particularly, to a method for recognizing multi-language printed documents.
DESCRIPTION OF THE PRIOR ARTS
Most of general documents have been drawn up by different characters, including multi-language, such as Korean, English and Chinese, together with unique marks and figures. Accordingly, it is very important to extract proper features to these different characters in recognizing these different characters, which are included in the documents.
Feature extraction systems for a single language have been developed and multi-fonts are introduced in this picture processing technique. However, the conventional feature extraction systems for such a single language can not recognize the multi-languages having various features on their fonts. Further, a method for recognizing multi-language printed documents, which uses both a letter portion and a background portion in the type of mesh of a predetermined standard as one feature for extraction, has been not introduced.
SUMMARY OF THE INVENTION
It is, therefore, an object of the present invention to provide a method for recognizing multi-language printed documents having different styles of fonts.
It is another object of the present invention to provide a method improving a recognition rate by extracting a geometrical feature in both a letter portion and a background portion in the type of mesh.
In accordance with an aspect of the present invention, there is provided a method for extracting character features for recognizing characters, the method comprising the steps of: a) normalizing the characters to a fixed size; b) converting the size-fixed characters into mesh-type characters; c) extracting stroke features of each of the mesh-type characters; d) extracting non-stroke features of each of the mesh-type characters; and e) extracting the character features using the stroke features and the non-stroke features.
In accordance with another aspect of the present invention, there is provided a method for extracting character features for recognizing characters, the method comprising the steps of: i) inputting the characters into an input means; ii) printing the input characters and scanning the printed characters to make character pictures; iii) constructing a standard input character set using the character pictures; iv) normalizing the character pictures to a fixed size; v) converting the size-fixed characters into mesh-type characters; vi) extracting stroke features of each of the mesh-type characters; vii) extracting non-stroke features of each of the mesh-type characters; and viii) extracting the character features using the stroke features and the non-stroke features.


REFERENCES:
patent: 4032887 (1977-06-01), Roberts
patent: 4468808 (1984-08-01), Mori et al.
patent: 4561106 (1985-12-01), Yoshida et al.
patent: 4903313 (1990-02-01), Tachikawa
patent: 5271068 (1993-12-01), Ueda et al.
patent: 5325447 (1994-06-01), Vogt, III
patent: 5442715 (1995-08-01), Gaborski et al.
patent: 5715336 (1998-02-01), Tanaka
patent: 5740273 (1998-04-01), Parthasarathy et al.
patent: 6011879 (2000-01-01), Nemoto et al.
patent: 6026177 (2000-02-01), Mong et al.
patent: 6188790 (2001-02-01), Yoshikawa et al.
patent: 6272238 (2001-08-01), Kugai
patent: 6366699 (2002-04-01), Kuwano et al.
Krtolica, et al. discloses “Two-stage connectivity algorithm for optical character recognition”, IEEE, pp. 179-182, 1993.*
Smith, et al. discloses “Handwritten character classification using nearest neighbor in large databases”, IEEE, pp. 915-919, 199.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Method for recognizing multi-language printed documents... does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Method for recognizing multi-language printed documents..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Method for recognizing multi-language printed documents... will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-3176620

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.