Data processing: artificial intelligence – Neural network – Learning task
Reexamination Certificate
2011-05-31
2011-05-31
Holmes, Michael (Department: 2129)
Data processing: artificial intelligence
Neural network
Learning task
Reexamination Certificate
active
07953679
ABSTRACT:
A computer-based method and a system for indexing, querying, and ranking documents based on layout are provided. The method includes providing a plurality of documents to computer memory, extracting layout blocks from the provided documents, clustering the layout blocks into a plurality of layout block clusters, computing a representative block for each of the layout block clusters, generating a document index for each provided document based on the layout blocks of the document and the computed representatives blocks, clustering the created document indexes into a plurality of document index clusters, and generating a representative cluster index for each of the document index clusters. The indexes generated, together with the representative blocks and document index clusters, can be stored and used for retrieval of documents responsive to a layout query.
REFERENCES:
patent: 7099861 (2006-08-01), Houn
patent: 7475061 (2009-01-01), Bargeron
patent: 7698255 (2010-04-01), Goodwin et al.
patent: 7698303 (2010-04-01), Goodwin et al.
patent: 7707157 (2010-04-01), Shen
patent: 7711668 (2010-05-01), Brinker et al.
patent: 7734627 (2010-06-01), Tong
patent: 7747593 (2010-06-01), Patterson et al.
patent: 7805389 (2010-09-01), Saito et al.
patent: 7809718 (2010-10-01), Brinker et al.
patent: 7836108 (2010-11-01), Kupke et al.
patent: 7836406 (2010-11-01), Kirsten et al.
patent: 7844566 (2010-11-01), Wnek
patent: 7856411 (2010-12-01), Darr
Efficient phrase-based document indexing for Web document clustering, Hammouda, K.M.; Kamel, M.S.; Knowledge and Data Engineering, IEEE Transactions on vol. 16 , Issue: 10 Digital Object Identifier: 10.1109/TKDE.2004.58 Publication Year: 2004 , pp. 1279-1296.
ModelMaker: a tool for rapid modeling from device descriptions, Cyre, W.R.; Gunawan, A.; Verilog HDL Conference and VHDL International Users Forum, 1998. IVC/VIUF. Proceedings., 1998 International Digital Object Identifier: 10.1109/IVC.1998.660692 Publication Year: 1998 , pp. 138-142.
Phrase-based document similarity based on an index graph model, Hammouda, K.M.; Kamel, M.S.; Data Mining, 2002. ICDM 2002. Proceedings. 2002 IEEE International Conference on Digital Object Identifier: 10.1109/ICDM.2002.1183904 Publication Year: 2002 , pp. 203-210.
Web Document Clustering Using Document Index Graph, Momin, B.F.; Kulkarni, P.J.; Chaudhari, A.; Advanced Computing and Communications, 2006. ADCOM 2006. International Conference on Digital Object Identifier: 10.1109/ADCOM.2006.4289851 Publication Year: 2006 , pp. 32-37.
Chidlovskii, et al., Stacked Dependency Networks for Layout Document Structuring,Journal of Universal Computer Science, vol. 14, No. 18, pp. 2998-3010 (2008).
Datta, et al., Image Retrieval: Ideas, Influences, and Trends of the New Age,ACM Comput. Surv., 40(2):1-60, 2008.
Janssen, et al., Uplib: A Universal Personal Digital Library System, InDocEng '03: Proc. of the 2003 ACM Symposium on Document Engineering, pp. 234-242, New York, NY, USA, 2003. ACM.
Nakai, et al., Camera-Based Document Image Retrieval as Voting for Partial Signatures of Projective Invariants, InICDAR'05: Proc. of the 8thIntl. Conf. on Document Analysis and Recognition, pp. 379-383, Washington, DC, USA, 2005. IEEE Computer Society.
Ramos, Using TF-IDF to Determine Word Relevance in Document Queries,Department of Computer Science, Rutgers University, 2003.
TF-IDF, Wikipedia, available at http://en.wikipedia.org/wiki/Tf-idf, downloaded May 22, 2009.
Van Beusekom, et al., Distance Measures for Layout-Based Document Image Retrieval, InDIAL '06: Proc. of the 2ndIntl. Conf. on Document Image Analysis for Libraries, pp. 232-242, Washington, DC, USA, 2006. IEEE Computer Society.
Vilkner, et al., Micro Total Analysis Systems, Recent Developments,Anal. Chem., 76, 3373-3386 (2004).
Chidlovskii Boris
Lecerf Loïc M.
Fay Sharpe LLP
Holmes Michael
Xerox Corporation
LandOfFree
Scalable indexing for layout based document retrieval and... does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Scalable indexing for layout based document retrieval and..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Scalable indexing for layout based document retrieval and... will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-2647285