System for searching a corpus of document images by user specifi

Image analysis – Image transformation or preprocessing – Image storage or retrieval

Patent

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

382180, G06K 954

Patent

active

059996641

ABSTRACT:
A document search system provides a user with a programming interface for dynamically specifying features of documents recorded in a corpus of documents. The programming interface operates at a high-level that is suitable for interactive user specification of layout components and structures of documents. In operation, a bitmap image of a document is analyzed by the document search system to identify layout objects such as text blocks or graphics. Subsequently, the document search system computes a set of attributes for each of the identified layout objects. The set of attributes which are identified are used to describe the layout structure of a page image of a document in terms of the spatial relations that layout objects have to frames of reference that are defined by other layout objects. After computing attributes for each layout object, a user can operate the programming interface to define unique document features. Each document feature is a routine defined by a sequence of selections operations which consume a first set of layout objects and produce a second set of layout objects. The second set of layout objects constitutes the feature in a page image of a document. Using the programming interface, a user flexibly defines a genre of document using the user-specified document features.

REFERENCES:
patent: 5321770 (1994-06-01), Huttenlocher et al.
patent: 5325444 (1994-06-01), Cass et al.
patent: 5335088 (1994-08-01), Fan
patent: 5369714 (1994-11-01), Withgott et al.
patent: 5384863 (1995-01-01), Huttenlocher et al.
patent: 5390259 (1995-02-01), Withgott et al.
patent: 5434953 (1995-07-01), Bloomberg
patent: 5442778 (1995-08-01), Pedersen et al.
patent: 5491760 (1996-02-01), Withgott et al.
patent: 5524066 (1996-06-01), Kaplan et al.
patent: 5537491 (1996-07-01), Mahoney et al.
patent: 5539841 (1996-07-01), Huttenlocher et al.
patent: 5598507 (1997-01-01), Kimber et al.
patent: 5701500 (1997-12-01), Ikeo et al.
patent: 5832118 (1998-11-01), Kim
patent: 5841900 (1998-11-01), Rahgozar et al.
patent: 5848184 (1998-12-01), Taylor et al.
patent: 5848186 (1998-12-01), Wang et al.
Ashley, Jonathan et al. "Automatic and Semi-Automatic Methods for Image Annotation and Retrieval in QBIC," in Storage and Retrieval for Image and Video Databases III, Proceedings SPIE 2420, Feb. 9-10, 1995, pp.24-35.
Belongie, Serge et al. "Recognition of Images in Large Databases Using a Learning Framework," U.C. Berleley C.S. Technical Report 97-939.
Blomberg et al. "Reflections on a Work-Oriented Design Project," pdc '94: Proceedings of the Participatory Design Conference, Oct. 27-28, 1994: pp. 99-109. Revised publication in Human-Computer Interaction in 1996, at vol. 11, pp. 237-265.
Carson, Chad et al. "Region-Based Image Querying," IEEE Proceedings of CAIVL '97, Puerto Rico, Jun. 20, 1997.
Carson, Chad and Virginia E. Ogle. "Storage and Retrieval of Feature Data for a Very Large Online Image Collection," IEEE Computer Society Bulletin of the Technical Committee on Data Engineering, Dec. 1996, vol. 19, No. 4.
Fernandes et al. "Coding of Numerical Data in JBIG-2," published by ISO/IEC JTC 1/SC 29/WG 1 (ITU-T SG8) standards for Coding of Still Pictures (JBIG/JPEG), Aug. 18, 1997.
Haralick, R. "Document Image Understanding Geometric and Logical Layout," Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 1994: pp. 385-390.
Niblack, W. et al. "The QBIC Project: Querying Images By Content Using Color, Texture, and Shape," SPIE vol. 1908 (1993) pp. 173-187.
Rucklidge, William. 1996. Efficient Visual Recognition Using The Hausdorff Distance, Lecture Notes in Computer Science vol. 1173, G. Goos et al. ed., Santa Clara, Springer.
Syeda-Mahmood, Tanveer. "Indexing of Handwritten Document Images," Proceedings of IEEE Document Image Analysisi Workshop, Puerto Rico Jun. 20, 1997.
TextBridge Pro.sup.98 User's Guide, by ScanSoft Inc., a Xerox Company, 1997. (Available on the internet at: http://www.xerox.com/scansoft/tbpro98win/tbpro98windocumentation.htm) With specific reference to "Zoning the Page" on pp. 2-18 through 2-20.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

System for searching a corpus of document images by user specifi does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with System for searching a corpus of document images by user specifi, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and System for searching a corpus of document images by user specifi will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-833227

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.