Method and apparatus for identifying white space tables...

Image analysis – Pattern recognition – Classification

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C715S247000, C715S244000, C715S245000, C715S227000, C382S175000, C382S180000, C382S170000, C382S172000, C358S453000

Reexamination Certificate

active

07602972

ABSTRACT:
One embodiment of the present invention provides a system that facilitates detecting white space tables within a document, wherein a white space table is comprised of text arranged in rows and columns, wherein at least some of the rows and columns are separated by bands of white space rather than by lines. The system operates by identifying an area that includes consecutive lines of text objects with an amount of white space between text objects greater than a specified value. Note that a text object is a string of text without an amount of white space greater than the specified value. The system then determines if the text objects on consecutive lines have widths that are within a specified tolerance of each other. If so, the system checks the spaces between the consecutive lines of text objects to determine if they belong to a single white space table or multiple white space tables. The system also checks if the consecutive lines of text objects form a true table by determining if the consecutive the lines of text can be organized in a number of rows and columns. Finally, the system creates a new white space table for the area.

REFERENCES:
patent: 4504969 (1985-03-01), Suzuki et al.
patent: 5075895 (1991-12-01), Bessho
patent: 5091964 (1992-02-01), Shimomura
patent: 5119437 (1992-06-01), Kuwamura et al.
patent: 5191612 (1993-03-01), Katsuyama et al.
patent: 5235653 (1993-08-01), Nakano et al.
patent: 5384864 (1995-01-01), Spitz
patent: 5485566 (1996-01-01), Rahgozar
patent: 5572601 (1996-11-01), Bloomberg
patent: 5956422 (1999-09-01), Alam
patent: 6012056 (2000-01-01), Menlove
patent: 6104835 (2000-08-01), Han
patent: 6121963 (2000-09-01), Ange
patent: 6247018 (2001-06-01), Rheaume
patent: 6408093 (2002-06-01), Hu et al.
patent: 6976266 (2005-12-01), Strong et al.
patent: 2001/0044798 (2001-11-01), Nagral et al.
patent: 2002/0087573 (2002-07-01), Reuning et al.
patent: 2003/0078973 (2003-04-01), Przekop et al.
patent: 2003/0097384 (2003-05-01), Hu et al.
patent: 2004/0006742 (2004-01-01), Slocombe
patent: 2004/0205594 (2004-10-01), Arora et al.
patent: 2005/0091251 (2005-04-01), Ramarao
patent: 2005/0273573 (2005-12-01), Liu et al.
patent: 2006/0155700 (2006-07-01), Dejean et al.
patent: 2006/0200751 (2006-09-01), Underwood et al.
Yildiz, Burcu, Information Extraction—Utilizing Table Patterns, Aug. 2004, Informatik, pp. 1-4, 36, 47-65.
Tupaj, Scott et al., Extracting Tabular Information From Text Files, 1996, Tufts University, pp. 1-19.
Pinto, David et al., Table Extraction Using Conditional Random Fields, Aug. 1, 2003, ACM, SIGIR '03, pp. 1-8.
Yildiz, Burcu, Information Extraction—Utilizing Table Patterns, Aug. 2004, Informatik, pp. 1-65.
Hu, et al., “Document image layout comparison and classification”, Lucent Technologies, Bell Labs.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Method and apparatus for identifying white space tables... does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Method and apparatus for identifying white space tables..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Method and apparatus for identifying white space tables... will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-4123584

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.