Method and apparatus for image based document processing

Image analysis – Pattern recognition – Classification

Patent

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

382229, 382306, 358403, 707 5, 707 6, G06K 962, G06K 972, G06K 954, G06K 960

Patent

active

059434430

ABSTRACT:
The present invention provides a document processing apparatus, document processing method and a storage medium for storing thereof on purpose to offer document filing in which document can be registered with a little computation cost and with high speed, and retrieval can be performed with little oversight. In the document processing apparatus, a similar character classifying element classifies characters in a document image into similar character categories in advance and stores the classified categories together with their representative image features. When the document image is registered, a pseudo character recognizing element executes, without identifying each character in the text region, classification into character categories based on the image features less than those used in the ordinary character recognition and stores the category strings generated by identifying each character with the inputted image. In retrieval, a retrieval executing element converts each character in the retrieval keyword into nearest category, and retrieves a document including the converted category string as a part as a result of retrieval.

REFERENCES:
patent: 5029084 (1991-07-01), Morohasi et al.
patent: 5075896 (1991-12-01), Wilcox et al.
patent: 5261009 (1993-11-01), Bokser
patent: 5265242 (1993-11-01), Fujisawa et al.
patent: 5325444 (1994-06-01), Cass et al.
patent: 5375176 (1994-12-01), Spitz
patent: 5438630 (1995-08-01), Chen et al.
patent: 5440651 (1995-08-01), Martin
patent: 5487117 (1996-01-01), Burges et al.
patent: 5524065 (1996-06-01), Yagasaki
patent: 5628003 (1997-05-01), Fujisawa et al.
patent: 5745602 (1998-04-01), Chen et al.
patent: 5818952 (1998-10-01), Takenouchi et al.
patent: 5825926 (1998-10-01), Tanaka
Keyword Search for Japanese Image Text, Minoru Yusa and Yuzuru Tanaka, Faculty of Engineering, Hokkaido University, Jan. 1995.
Document Reconstruction: A Thousand Words from One Picture, Jeffrey C. Reynar, A. Lawrence Spitz and Penelope Sibun, University of Pennsylvania, Dept. Of Computer and Information Science, 1994.
A Method of Document-image Segmentation Based on Projection Profiles, Stroke Densities and Circumscribed Rectangles, Teruo Akiyama and Isao Masuda, NTT Electrical Communications Laboratories.
A Method for Composing the Extended Dictionary in which the Same Character is Involved in the Different Clusters for a Hierarchical Chinese Characters Recognition System, Akiyoshi Itoh and Takeshi Endoh, College of Science and Technology, Nihon University, and Keitaroh Hori and Tohru Shimamura, Members, Graduate School of Science and Technology, Nihon University, Jun. 1995.
Handprinted Chinese Characters Recogniton by Peripheral Direction Contributivity Feature, Norihiro Hagita, Seiichiro Maito and Isao Masuda, Members, Masahino Electrical Communication Laborator, N.T.T.
Key Search Strategies--Trie and Its Applications, by Junichi Aoe, The University of Tokushima, Faculty of Engineering Department of Information Science and Intelligent Systems, Feb., 1993.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Method and apparatus for image based document processing does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Method and apparatus for image based document processing, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Method and apparatus for image based document processing will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-474513

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.