Image analysis – Image segmentation – Distinguishing text from other regions
Patent
1997-01-21
1999-04-06
Boudreau, Leo H.
Image analysis
Image segmentation
Distinguishing text from other regions
382180, 382204, 358462, G06K 934
Patent
active
058928430
ABSTRACT:
The bitmap image data is analyzed by connected component extraction to identify components or connected components that represent either individual characters or letters, or regions of a nontext image. The connected components are classified as text or nontext based on geometric attributes such as the number of holes, arcs and line ends comprising each component. A nearest-neighbor analysis then identifies which text components represent lines or strings of text and each line or string is further analyzed to determine its vertical or horizontal orientation. Thereafter, separate vertical and horizontal font height filters are used to identify those text strings that are the most likely candidates. For the most likely title candidates a bounding box is defined which can be associated with or overlaid upon the original bitmap data to select the title region for further processing or display. Captions and photographs can also be located.
REFERENCES:
patent: 4503556 (1985-03-01), Scheri et al.
patent: 4741046 (1988-04-01), Matsunawa et al.
patent: 4750209 (1988-06-01), Shimura et al.
patent: 4893188 (1990-01-01), Murakami et al.
patent: 5001767 (1991-03-01), Yoneda et al.
patent: 5351314 (1994-09-01), Vaezi
patent: 5555362 (1996-09-01), Yamashita et al.
patent: 5588072 (1996-12-01), Wang
patent: 5680479 (1997-10-01), Wang et al.
patent: 5699453 (1997-12-01), Ozaki
patent: 5703962 (1997-12-01), Niki et al.
patent: 5748865 (1998-05-01), Yamamoto et al.
patent: 5751849 (1998-05-01), Ikeda
patent: 5757957 (1998-05-01), Tachikawa
patent: 5767978 (1998-06-01), Revankar et al.
patent: 5774579 (1998-06-01), Wang et al.
patent: 5848184 (1998-12-01), Taylor et al.
patent: 5848191 (1998-12-01), Chen et al.
Lopresti Daniel P.
Zhou Jiangying
Boudreau Leo H.
Matsushita Electric - Industrial Co., Ltd.
Mehta Bhavesh
LandOfFree
Title, caption and photo extraction from scanned document images does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Title, caption and photo extraction from scanned document images, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Title, caption and photo extraction from scanned document images will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-1378501