Image analysis – Pattern recognition – Classification
Reexamination Certificate
2005-04-25
2009-10-13
Hutton, Dough (Department: 2176)
Image analysis
Pattern recognition
Classification
C715S247000, C715S244000, C715S245000, C715S227000, C382S175000, C382S180000, C382S170000, C382S172000, C358S453000
Reexamination Certificate
active
07602972
ABSTRACT:
One embodiment of the present invention provides a system that facilitates detecting white space tables within a document, wherein a white space table is comprised of text arranged in rows and columns, wherein at least some of the rows and columns are separated by bands of white space rather than by lines. The system operates by identifying an area that includes consecutive lines of text objects with an amount of white space between text objects greater than a specified value. Note that a text object is a string of text without an amount of white space greater than the specified value. The system then determines if the text objects on consecutive lines have widths that are within a specified tolerance of each other. If so, the system checks the spaces between the consecutive lines of text objects to determine if they belong to a single white space table or multiple white space tables. The system also checks if the consecutive lines of text objects form a true table by determining if the consecutive the lines of text can be organized in a number of rows and columns. Finally, the system creates a new white space table for the area.
REFERENCES:
patent: 4504969 (1985-03-01), Suzuki et al.
patent: 5075895 (1991-12-01), Bessho
patent: 5091964 (1992-02-01), Shimomura
patent: 5119437 (1992-06-01), Kuwamura et al.
patent: 5191612 (1993-03-01), Katsuyama et al.
patent: 5235653 (1993-08-01), Nakano et al.
patent: 5384864 (1995-01-01), Spitz
patent: 5485566 (1996-01-01), Rahgozar
patent: 5572601 (1996-11-01), Bloomberg
patent: 5956422 (1999-09-01), Alam
patent: 6012056 (2000-01-01), Menlove
patent: 6104835 (2000-08-01), Han
patent: 6121963 (2000-09-01), Ange
patent: 6247018 (2001-06-01), Rheaume
patent: 6408093 (2002-06-01), Hu et al.
patent: 6976266 (2005-12-01), Strong et al.
patent: 2001/0044798 (2001-11-01), Nagral et al.
patent: 2002/0087573 (2002-07-01), Reuning et al.
patent: 2003/0078973 (2003-04-01), Przekop et al.
patent: 2003/0097384 (2003-05-01), Hu et al.
patent: 2004/0006742 (2004-01-01), Slocombe
patent: 2004/0205594 (2004-10-01), Arora et al.
patent: 2005/0091251 (2005-04-01), Ramarao
patent: 2005/0273573 (2005-12-01), Liu et al.
patent: 2006/0155700 (2006-07-01), Dejean et al.
patent: 2006/0200751 (2006-09-01), Underwood et al.
Yildiz, Burcu, Information Extraction—Utilizing Table Patterns, Aug. 2004, Informatik, pp. 1-4, 36, 47-65.
Tupaj, Scott et al., Extracting Tabular Information From Text Files, 1996, Tufts University, pp. 1-19.
Pinto, David et al., Table Extraction Using Conditional Random Fields, Aug. 1, 2003, ACM, SIGIR '03, pp. 1-8.
Yildiz, Burcu, Information Extraction—Utilizing Table Patterns, Aug. 2004, Informatik, pp. 1-65.
Hu, et al., “Document image layout comparison and classification”, Lucent Technologies, Bell Labs.
Gaither Shawn A.
Wei Bryan Z.
Adobe Systems Incorporated
Hillery Nathan
Hutton Dough
Kowert Robert C.
Meyertons Hood Kivlin Kowert & Goetzel P.C.
LandOfFree
Method and apparatus for identifying white space tables... does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Method and apparatus for identifying white space tables..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Method and apparatus for identifying white space tables... will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-4123584