Scalable indexing for layout based document retrieval and...

Data processing: artificial intelligence – Neural network – Learning task

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

Reexamination Certificate

active

07953679

ABSTRACT:
A computer-based method and a system for indexing, querying, and ranking documents based on layout are provided. The method includes providing a plurality of documents to computer memory, extracting layout blocks from the provided documents, clustering the layout blocks into a plurality of layout block clusters, computing a representative block for each of the layout block clusters, generating a document index for each provided document based on the layout blocks of the document and the computed representatives blocks, clustering the created document indexes into a plurality of document index clusters, and generating a representative cluster index for each of the document index clusters. The indexes generated, together with the representative blocks and document index clusters, can be stored and used for retrieval of documents responsive to a layout query.

REFERENCES:
patent: 7099861 (2006-08-01), Houn
patent: 7475061 (2009-01-01), Bargeron
patent: 7698255 (2010-04-01), Goodwin et al.
patent: 7698303 (2010-04-01), Goodwin et al.
patent: 7707157 (2010-04-01), Shen
patent: 7711668 (2010-05-01), Brinker et al.
patent: 7734627 (2010-06-01), Tong
patent: 7747593 (2010-06-01), Patterson et al.
patent: 7805389 (2010-09-01), Saito et al.
patent: 7809718 (2010-10-01), Brinker et al.
patent: 7836108 (2010-11-01), Kupke et al.
patent: 7836406 (2010-11-01), Kirsten et al.
patent: 7844566 (2010-11-01), Wnek
patent: 7856411 (2010-12-01), Darr
Efficient phrase-based document indexing for Web document clustering, Hammouda, K.M.; Kamel, M.S.; Knowledge and Data Engineering, IEEE Transactions on vol. 16 , Issue: 10 Digital Object Identifier: 10.1109/TKDE.2004.58 Publication Year: 2004 , pp. 1279-1296.
ModelMaker: a tool for rapid modeling from device descriptions, Cyre, W.R.; Gunawan, A.; Verilog HDL Conference and VHDL International Users Forum, 1998. IVC/VIUF. Proceedings., 1998 International Digital Object Identifier: 10.1109/IVC.1998.660692 Publication Year: 1998 , pp. 138-142.
Phrase-based document similarity based on an index graph model, Hammouda, K.M.; Kamel, M.S.; Data Mining, 2002. ICDM 2002. Proceedings. 2002 IEEE International Conference on Digital Object Identifier: 10.1109/ICDM.2002.1183904 Publication Year: 2002 , pp. 203-210.
Web Document Clustering Using Document Index Graph, Momin, B.F.; Kulkarni, P.J.; Chaudhari, A.; Advanced Computing and Communications, 2006. ADCOM 2006. International Conference on Digital Object Identifier: 10.1109/ADCOM.2006.4289851 Publication Year: 2006 , pp. 32-37.
Chidlovskii, et al., Stacked Dependency Networks for Layout Document Structuring,Journal of Universal Computer Science, vol. 14, No. 18, pp. 2998-3010 (2008).
Datta, et al., Image Retrieval: Ideas, Influences, and Trends of the New Age,ACM Comput. Surv., 40(2):1-60, 2008.
Janssen, et al., Uplib: A Universal Personal Digital Library System, InDocEng '03: Proc. of the 2003 ACM Symposium on Document Engineering, pp. 234-242, New York, NY, USA, 2003. ACM.
Nakai, et al., Camera-Based Document Image Retrieval as Voting for Partial Signatures of Projective Invariants, InICDAR'05: Proc. of the 8thIntl. Conf. on Document Analysis and Recognition, pp. 379-383, Washington, DC, USA, 2005. IEEE Computer Society.
Ramos, Using TF-IDF to Determine Word Relevance in Document Queries,Department of Computer Science, Rutgers University, 2003.
TF-IDF, Wikipedia, available at http://en.wikipedia.org/wiki/Tf-idf, downloaded May 22, 2009.
Van Beusekom, et al., Distance Measures for Layout-Based Document Image Retrieval, InDIAL '06: Proc. of the 2ndIntl. Conf. on Document Image Analysis for Libraries, pp. 232-242, Washington, DC, USA, 2006. IEEE Computer Society.
Vilkner, et al., Micro Total Analysis Systems, Recent Developments,Anal. Chem., 76, 3373-3386 (2004).

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Scalable indexing for layout based document retrieval and... does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Scalable indexing for layout based document retrieval and..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Scalable indexing for layout based document retrieval and... will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-2647285

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.