Image analysis – Image segmentation – Distinguishing text from other regions
Patent
1992-04-24
1997-10-21
Couso, Jose L.
Image analysis
Image segmentation
Distinguishing text from other regions
382171, 382174, 382180, 382226, 382178, G06K 934
Patent
active
056804792
ABSTRACT:
In a character recognition system or the like, method and apparatus for selecting blocks of pixels from pixel image data so as to permit identification and grouping of similarly-typed pixels, such as text-type pixels and non-text-type pixels. Pixel image data is inputted and, if the pixel image data is not binary image data then the pixel image data is converted into binary pixel image data. Blocks of pixel image data are selected by outlining contours of connected components in the pixel image data, determining whether the outlined connected components include text unit or non-text units based on the size of the outlined connected components, selectively connecting text units widthwisely to form text lines based on proximity of adjacent text units, and selectively connecting text lines vertically to form text blocks based on proximity of adjacent text lines and on the position of non-text units between text lines. A hierarchical tree is formed based on the outlined connected components.
REFERENCES:
patent: 4933984 (1990-06-01), Nakano et al.
patent: 5048107 (1991-09-01), Tachikawa
patent: 5065442 (1991-11-01), Kugai
patent: 5075895 (1991-12-01), Bessho
patent: 5091964 (1992-02-01), Shimomura
patent: 5093868 (1992-03-01), Tanaka et al.
patent: 5101439 (1992-03-01), Kiang
patent: 5101448 (1992-03-01), Kawachiya et al.
patent: 5129012 (1992-07-01), Abe
patent: 5307422 (1994-04-01), Wang
patent: 5313526 (1994-05-01), Cheong
patent: 5335290 (1994-08-01), Cullen et al.
patent: 5351314 (1994-09-01), Vaezi
Isao Masuda, et al., "Approach to Smart Document Reader System", IEEE 1985, pp. 550-557.
Hiroshi Makino, "Representation and Segmentation of Document Images", IEEE, 1983, pp. 291-296.
W. Doster, "A Step Towards Intelligent Document Input To Computers", et al., IEEE, 1983, pp. 515-516.
Qin Luo, et al., "A Structure Recognition Method For Japanese Newspapers", Symposium on Document Analysis and Information Retrieval, Mar., 1992 pp. 217-234.
L.A. Fletcher, et al., "A Robust Algorithm For Text String Separation From Mixed Text/Graphics Images", IEEE Transactions On Pattern Analysis and Machine Intelligence, vol. 10, No. 6, Nov., 1988, pp. 910-918.
Osamu Iwaki, et al., "A Segmentation Method Based On Office Document Hierarchical Structure", Proceedings of the 1987 IEEE International Conference on Systems, Man, and Cybernetics, vol. 2, pp. 759-763.
K.Y. Wong, et al., "Document Analysis System", IBM J. Res. Develop., vol. 26, No. 6, Nov., 1982, pp. 647-656.
M. Okamoto, et al., "A Hybrid Page Segmeutation Method", Proceedings of the Second International Conference on Document Analysis and Recognition, Oct. 1993, pp. 743-748.
D.J. Ittner, "Automatic Inference of Textline Orientation", Proccedings, Second Annual Symposium on Document Analysis & Information Retrieval, Apr. 1993, pp. 123-133.
Tsujimoto, et al., "Understanding Multi-articled Documents," 10th Int'l Conf. on Pattern Recognition, IEEE, vol. 1, Jun. 16-21, 1990, pp. 551-556.
James L. Fisher, et al., "A Rule-Based System for Document Images", SPIE vol. 1258 Image Communications and Workstations, pp. 78-88.
Teruo Akiyama, et al., "Automated Entry System for Printed Documents", Pattern Recognition, vol. 23, No. 11, 1990, pp. 1141-1154.
K. Y. Wong, et al., "Document Analysis System", IBM J. Res. Develop., vol. 26, No. 6, Nov. 1982, pp. 647-656.
Stuart C. Hinds, et al., "A Document Skew Detection Method Using Run-Length Encoding And The Hough Transform", IEEE, May 1990, pp. 464-468.
James L. Fisher, et al., "A Rule-Based System For Document Image Segmentation", IEEE, May 1990, pp. 567-572.
Philip J. Bones, et al., "Segmentation Of Document Images", SPIE vol. 1258 Image Communications and Workstations (1990), pp. 78-88.
Yamada, et al., "Document Image Processing Based on Enhanced Border Following Algorithm", IEEE Proceedings of the 10th International Conference on Pattern Recognition, vol. 2, Jun. 21, 1990, pp. 231-236.
Mizuno, et al., "Document Recognition System With Layout Structure Generator", NEC Research And development, vol. 32, No. 3, Jul. 1991, pp. 430-437.
Pizano, et al., "A Business Form Recognition System", COMPSAC91 Proceedings, The Fifteenth Annual International Computer Software & Applications Conference, Sep. 13, 1991, pp. 626-632.
"Line Segmentation Method For Documents In European Languages", IBM Technical Disclosure Bulletin, vol. 33, No. 1B, Jun. 1990, pp. 207-210.
Sherrick Christopher Allen
Vaezi Mehrzad R.
Wang Shin-Ywan
Bella Matthew C.
Canon Kabushiki Kaisha
Couso Jose L.
LandOfFree
Method and apparatus for character recognition does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Method and apparatus for character recognition, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Method and apparatus for character recognition will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-1013329