Image analysis – Image segmentation
Reexamination Certificate
2010-02-23
2010-12-28
Mariam, Daniel G (Department: 2624)
Image analysis
Image segmentation
C715S252000, C382S180000
Reexamination Certificate
active
07860312
ABSTRACT:
A system for electronically distilling information from a business document uses a network scanner to electronically scan a platen area, having a business document thereon, to create a bitmap. A network server carries out a segmentation process to segment the scan generated bitmap into a bitmap object, the bitmap object corresponding to the scanned business document; a bitmap to text conversion process to convert the bitmap object into a block of text; a semantic recognition process to generate a structured representation of semantic entities corresponding to the scanned business document; and a document generation process to convert the structured representation into a structure text file. The semantic recognition process includes the processes of generating, for each line of text having a keyword therein, a terminal symbol corresponding to the keyword therein; generating, for each line of text not having a keyword therein and absent of numeric characters, an alphabetic terminal symbol; generating, for each line of text not having a keyword therein and having a numeric character therein, an alphanumeric terminal symbol; generating a string of terminal symbols from the generated terminal symbols; determining a probable parsing of the generated string of terminal symbols; labeling each text line, according to a determined function, with non-terminal symbols; and parsing the business document information text into fields of business document information text based upon the non-terminal symbol of each text line and the determined probable parsing of the generated string of terminal symbols.
REFERENCES:
patent: 5321773 (1994-06-01), Kopec et al.
patent: 5375055 (1994-12-01), Togher et al.
patent: 5438512 (1995-08-01), Mantha et al.
patent: 5526444 (1996-06-01), Kopec et al.
patent: 5787198 (1998-07-01), Agazzi et al.
patent: 5841900 (1998-11-01), Rahgozar et al.
patent: 6115495 (2000-09-01), Tachikawa et al.
patent: 6181820 (2001-01-01), Tachikawa et al.
patent: 6249765 (2001-06-01), Adler et al.
patent: 6342901 (2002-01-01), Adler et al.
patent: 6539379 (2003-03-01), Vora et al.
patent: 6675356 (2004-01-01), Adler et al.
patent: 6704456 (2004-03-01), Venable
patent: 6738154 (2004-05-01), Venable
patent: 6836760 (2004-12-01), Silverman et al.
patent: 7035821 (2006-04-01), Smith et al.
patent: 7370045 (2008-05-01), Vora et al.
patent: 7689037 (2010-03-01), Handley et al.
patent: 2004/0236741 (2004-11-01), Burstrom et al.
patent: 2006/0088214 (2006-04-01), Handley et al.
patent: 2010/0149606 (2010-06-01), Handley et al.
Bayer. T; Walischewski, H; Experiments on Extracting Structural Information from Paper Documents using Syntactic Pattern Analysis; IEEE 1995; pp. 476-479.
Bruckner, T.; Suda, P.; Block, H.; Maderlechner, G; In-house Mail Distribution by Automatic Address and Content Interpretation; SDAIR 1995; pp. 67-76.
Chiou, Y; Lee, H.; Recognition of Chinese Business Cards; IEEE 1997; pp. 1028-1032.
He, J.; Downton, A.; User-Assisted Archive Document Image Analysis for Digital Library Construction; IEEE 2003.
Ishitani, Y; Document Transformation System from Papers to XML Data Based on Pivot XML Document Method, IEEE 2003.
Kanungo, T.; Mao, S.; Stochastic Language Model for Analyzing Document Pysical Layout; SPIE vol. 4670; 2002;pp. 28-36.
Kieninger, T.; Dengel, A.; Applying the T-Recs Table Recognition System to the Business Letter Domain; IEEE 2001; pp. 518-522.
Kushmerick, N; Johnston, E.; McGuinness, S.; Information Extraction by Text Classification; IJCAI 2001; pp. 1-7.
Liang, J;•Doermann, D.; Content Features for Logical Document Labeling; SPIE vol. 5010; 2003; pp. 189-196.
Manning, C.; Schutze, H.; Foundations of Statistical Natural Language Processing; MIT Press;1999; pp. 381-405.
Pan, W; Jin, J.;Shi, G.; Wang, Q; A System for Automatic Chinese Business Card Recognition; IEEE 2001' pp. 577-581.
Shi, G; Pan, W; Jin, J.; Automatic Information Retrieval of Chinese Business Card; Proceedings of SPIE-IS&T Electronic Imaging, SPIE vol. 5010; 2003; pp. 241-248.
Walischewski, H.; Automated Knowledge Acquisition for Spatial Document Interpretation; ICDAR 1997; pp. 243-247.
Watanabe, T.; Huang, X; Automatic Acquisition of Layout Knowledge for Understanding Business Cards; IEEE 1997; pp. 216-220.
An unofficial prosecution history, as of Mar. 8, 2010 for U.S. Appl. No. 10/970,930.
An unofficial co-pending U.S. Appl. No. 12/710,573, filed Feb. 23, 2010.
Bodnar, A.; Jugoon, A.; Rose, A.; Blostein, D.; A Grammatical Approach to Tagging Text on Business Cards; Queen's University, Kingston, Ontario, Candada.
Chang, F.; Retrieving Information from Document Images: Problems and Solutions; pp. 1-28.
Dengel, A.; Bleisinger, R.; Fein, F.; Hoch, R.; Hones, F.; Malburg, M; Officemaid-A System for Office Mail Analysis, Interpretation and Delivery; pp. 52-73.
Klink, S; Dengel, A; Kieninger, T.; Document Structure Analysis Based on Layout and Textual Features.
Lipshutz, M.; Taylor, S. Functional Decomposition of Business Letters; pp. 435-447.
Zhenlong, B.; A General Approach to Informative Text Line Extraction for Document Analysis and Retrieval.
Handley John C.
Namboodiri Anoop M.
Rahgozar M. Armon
Spiteri Pamela B.
Venable Dennis L.
Basch & Nickerson LLP
Mariam Daniel G
Nickerson Michael J.
Woldemariam Aklilu K
Xerox Corporation
LandOfFree
System and method for identifying and labeling fields of... does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with System and method for identifying and labeling fields of..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and System and method for identifying and labeling fields of... will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-4187881