System and method for identifying and labeling fields of...

Image analysis – Image segmentation

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C382S176000, C715S252000

Reexamination Certificate

active

07965891

ABSTRACT:
A system for electronically distilling information from a business document uses a network scanner to electronically scan a platen area, having a business document thereon, to create a bitmap. A network server carries out a segmentation process to segment the scan generated bitmap into a bitmap object, the bitmap object corresponding to the scanned business document; a bitmap to text conversion process to convert the bitmap object into a block of text; a semantic recognition process to generate a structured representation of semantic entities corresponding to the scanned business document; and a document generation process to convert the structured representation into a structure text file. The semantic recognition process includes the processes of generating, for each line of text having a keyword therein, a terminal symbol corresponding to the keyword therein; generating, for each line of text not having a keyword therein and absent of numeric characters, an alphabetic terminal symbol; generating, for each line of text not having a keyword therein and having a numeric character therein, an alphanumeric terminal symbol; generating a string of terminal symbols from the generated terminal symbols; determining a probable parsing of the generated string of terminal symbols; labeling each text line, according to a determined function, with non-terminal symbols; and parsing the business document information text into fields of business document information text based upon the non-terminal symbol of each text line and the determined probable parsing of the generated string of terminal symbols.

REFERENCES:
patent: 3654611 (1972-04-01), Bluethman et al.
patent: 5321773 (1994-06-01), Kopec et al.
patent: 5375055 (1994-12-01), Togher et al.
patent: 5438512 (1995-08-01), Mantha et al.
patent: 5526444 (1996-06-01), Kopec et al.
patent: 5530794 (1996-06-01), Luebbert
patent: 5625465 (1997-04-01), Lech et al.
patent: 5787198 (1998-07-01), Agazzi et al.
patent: 5841900 (1998-11-01), Rahgozar et al.
patent: 6067555 (2000-05-01), Hayashi
patent: 6115495 (2000-09-01), Tachikawa et al.
patent: 6181820 (2001-01-01), Tachikawa et al.
patent: 6249765 (2001-06-01), Adler et al.
patent: 6342901 (2002-01-01), Adler et al.
patent: 6377703 (2002-04-01), Yeung
patent: 6539379 (2003-03-01), Vora et al.
patent: 6675356 (2004-01-01), Adler et al.
patent: 6704456 (2004-03-01), Venable
patent: 6738154 (2004-05-01), Venable
patent: 6832350 (2004-12-01), Bates et al.
patent: 6836760 (2004-12-01), Bellegarda et al.
patent: 6952281 (2005-10-01), Irons et al.
patent: 7035821 (2006-04-01), Smith et al.
patent: 7370045 (2008-05-01), Vora et al.
patent: 7519903 (2009-04-01), Yahagi
patent: 7689037 (2010-03-01), Handley et al.
patent: 7809156 (2010-10-01), Piersol et al.
patent: 2002/0001393 (2002-01-01), Jones et al.
patent: 2002/0038319 (2002-03-01), Yahagi
patent: 2004/0010757 (2004-01-01), McCoy et al.
patent: 2004/0044958 (2004-03-01), Wolf et al.
patent: 2004/0205449 (2004-10-01), Hayes
patent: 2004/0205668 (2004-10-01), Eastlake, III
patent: 2004/0236741 (2004-11-01), Burstrom et al.
patent: 2006/0088214 (2006-04-01), Handley et al.
patent: 2009/0119574 (2009-05-01), Gitlin et al.
patent: 2010/0150397 (2010-06-01), Handley et al.
Prosecution history between Mar. 10, 2010 and Sep. 24, 2010 of U.S. Appl. No. 12/710,568.
Bayer. T; Walischewski, H; Experiments on Extracting Structural Information from Paper Documents using Syntactic Pattern Analysis; IEEE 1995; pp. 476-479.
Bruckner, T.; Suda, P.; Block, H.; Maderlechner, G; In-house Mail Distribution by Automatic Address and Content Interpretation; SDAIR 1995; pp. 67-76.
Chang, F.; Retrieving Information from Document Images: Problems and Solutions; pp. 1-28, May 25, 2000.
Chiou, Y; Lee, H.; Recognition of Chinese Business Cards; IEEE 1997; pp. 1028-1032.
Dengel, A.; Bleisinger, R.; Fein, F.; Hoch, R.; Hones, F.; Malburg, M; Officemaid—A System for Office Mail Analysis, Interpretation and Delivery; pp. 52-73, Apr. 15-17, 1996.
He, J.; Downton, A.; User-Assisted Archive Document Image Analysis for Digital Library Construction: IEEE 2003.
Ishitani, Y; Document Transformation System from Papers to XML Data Based on Pivot XML Document Method, IEEE 2003.
Kanungo, T.; Mao, S.; Stochastic Language Model for Analyzing Document Pysical Layout; SPIE vol. 4670: 2002:pp. 28-36.
Kieninger, T.; Dengel, Al.; Applying the T-Recs Table Recognition System To The Business Letter Domain: IEE 2001: pp. 518-522.
Klink, S; Dengel, A; Kieninger, T.; Document Structure Analysis Based on Layout and Textual Features, 2000.
Kushmerick, N; Johnston, E.; McGuinness, S.; Information Extraction by Text Classification IJCAI 2001: pp. 1-7.
Liang, J;•Doermann, D.; Content Features for Logical Document Labeling; SPIE vol. 5010; 2003; pp. 189-196.
Lipshutz, M.; Taylor, S. Functional Decomposition of Business Letters; pp. 435-447.
Manning, C.; Schutze, H.; Foundations of Statistical Natural Language Processing; MIT Press:1999: pp. 381-405.
Pan, W; Jin, J.;Shi, G.; Wang, Q; A System for Automatic Chinese Business Card Recognition; IEEE 2001' pp. 577-581.
Shi, G; Pan, W; Jin, J.; Automatic Information Retrieval of Chinese Business Card; Proceedings of SPIE-IS&T Electronic Imaging, SPIE vol. 5010: 2003: pp. 241-248.
Walischewski, H.; Automated Knowledge Acquisition for Spatial Document Interpretation; ICDAR 1997; pp. 243-247.
Watanabe, T.; Huang, X; Automatic Acquisition of Layout Knowledge for Understanding Business Cards; IEEE 1997; pp. 216-220.
Zhenlong, B.; A General Approach to Informative Text Line Extraction for Document Analysis and Retrieval, 2001.
Prosecution history, as of Mar. 8, 2010 for U.S. Appl. No. 10/970,930.
Co-pending U.S. Appl. No. 12/710,568, filed Feb. 23, 2010.
Bayer, T; Walischewski, H; Experiments on Extracting Structural Information from Paper Documents using Syntactic Pattern Analysis; IEEE 1995; pp. 476-479.
Bodnar, A.; Jugoon, A.; Rose, A; Blostein, D.; A Grammatical Approach to Tagging Text on Business Cards; Queens's University, Kingston, Ontario, Canada.
Bruckner, T.; Suda, P.; Block, H.; Maderlechner, G.; In-house Mail Distribution by Automatic Address and Content Interpretation; SDAIR 1995; pp. 67-76.
Chang, F.; Retrieving Information from Document Images: Problems and Solutions; pp. 1-28.
Chiou, Y; Lee, H.; Recognition of Chinese Business cards; IEEE 1997; pp. 1028-1032.
Dengel, A.; Bleisinger, R.; Fein, F.; Hoch, R.; Hones, F.; Malburg, M.; Officemaid-A System for Office Mail Analysis, Interpretation and Delivery; pp. 52-73; Apr. 15-17, 1996.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

System and method for identifying and labeling fields of... does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with System and method for identifying and labeling fields of..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and System and method for identifying and labeling fields of... will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-2711256

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.