Image analysis – Image segmentation – Distinguishing text from other regions
Reexamination Certificate
2011-03-01
2011-03-01
Ishrat, Sherali (Department: 2624)
Image analysis
Image segmentation
Distinguishing text from other regions
Reexamination Certificate
active
07899249
ABSTRACT:
The present invention relates to systems and methods for analyzing media material having articles continuing across multiple pages. A media material analyzer includes a segmenter and an article composer. The segmenter identifies block segments associated with columnar body test in the media material. The article composer determines which of the identified block segments belong to a continuing article extending across multiple pages in the media material based on language statistics information and continuation transition information.
REFERENCES:
patent: 5335290 (1994-08-01), Cullen et al.
patent: 5805731 (1998-09-01), Yaeger et al.
patent: 5848184 (1998-12-01), Taylor et al.
patent: 5848186 (1998-12-01), Wang et al.
patent: 5907631 (1999-05-01), Saitoh
patent: 6173073 (2001-01-01), Wang
patent: 6577763 (2003-06-01), Fujimoto et al.
patent: 7382909 (2008-06-01), Nattkemper et al.
patent: 2001/0018685 (2001-08-01), Saito et al.
patent: 2003/0229854 (2003-12-01), Lemay
patent: 2004/0117725 (2004-06-01), Chen et al.
patent: 2004/0122811 (2004-06-01), Page
patent: 2004/0208371 (2004-10-01), Liu et al.
patent: 2006/0080309 (2006-04-01), Yacoub et al.
patent: 2006/0184525 (2006-08-01), Jones et al.
patent: 2007/0050406 (2007-03-01), Byers
patent: 2007/0174343 (2007-07-01), Fortuna
patent: 2007/0291288 (2007-12-01), Campbell et al.
patent: 2008/0103996 (2008-05-01), Forman et al.
patent: 2008/0107337 (2008-05-01), Furmaniak et al.
patent: 2008/0109425 (2008-05-01), Yih et al.
Breuel, Thomas, “Google Library Project”, 2006 IUPR Research Group, last viewed Oct. 20, 2006, http://www.iupr.org/current/google—library—project—2, 3 pgs.
Mantzaris, S. L. et al., “Linking Article Parts for the Creation of a Newspaper Digital Library”, Lambrakis Press S.A., 2000, 14 pgs.
Gatos, B. et al., “Automatic page analysis for the creation of a digital library from newspaper archives”, 2000 Springer-Verlag, pgs. 77-84.
Mitchell, Phillip E. et al., “Newspaper layout analysis incorporation connected component separation”, Image and Vision Computing 22, 2004, pp. 307-317.
Cattoni, R. et al., “Geometric Layout Analysis Techniques for Document Image Understanding: a Review”, ITC-IRST, Jan. 1998, 68 pgs.
Alam, Hassan et al., “Web Document Analysis: How can Natural Language Processing Help in Determining Correct Content Flow?”, BCL Technologies Inc., 2003, pp. 29-32.
Koivusaari, Maija et al., “Automated document content characterization for a multimedia document retrieval system”, Proc. SPIE 1997, vol. 3229, Oct. 1997, pp. 148-159.
Nicholas, Joumet et al., “Ancient Printed Documents indexation: a new approach”, Springer Berlin / Heidelberg, 2005, vol. 3686, pp. 580-589.
Malerba, Donato et al., “Adaptive Layout Analysis of Document Images”, Dipartimento di Informatica, Universita degli Studi di Bari, 2002, 9 pgs.
Bread, Thomas M., “High Performance Document Layout Analysis”, 2003 Symposium on Document Image Understanding (SDIUT '03), Apr. 9-11, 2003, 10 pgs.
Klink, Stefan et al., “Document Structure Analysis Based on Layout and Textual Features”, in Proc. of Fourth IAPR International Workshop on Document Analysis Systems, DAS2000, pp. 99-111.
Mao, Song et al., “Document Structure Analysis Algorithms: A Literature Survey”, Center for Automation Research and IBM Almaden Research Center, 2003, 11 pgs.
Tsujimoto, Shuichi et al., “Understanding Multi-articled Documents”, IEEE, May 1990, pp. 551-556.
Andersen, Tim et al., “Features for Neural Net Based Region Identification of Newspaper Documents”, IEEE, Jan. 2003, 5 pgs.
Mühlberger, Günter, “Digitisation of Newspaper Clippings: The Laurin Project”, RLG DigiNews, Dec. 15, 1999, vol. 3, No. 6, 21 pgs.
Shafait, Faisal et al., “Peformance Comparison of Six Algorithms for Page Segmentation”, Springer-Verlag, 2006, vol. 3872, pp. 368-379.
Mitchell, Phillip et al., “Newspaper Document Analysis featuring Connected Line Segmentation”, Australian Computer Society, Inc, 2002, 5 pgs.
Brants, Thorsten et al., “Topic-Based Document Segmentation with Probabilistic Latent Semantic Analysis”, CIKM'02, ACM, Nov. 4-9, 2002, pp. 211-218.
Nagy, George, et al., “A Prototype Document Image Analysis System for Technical Journals”, IEEE, Jul. 1992, pp. 10-22.
O'Gorman, Lawrence, “The Document Spectrum for Page Layout Analysis”, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 15, No. 11, Nov. 1993, pp. 1162-1173.
Wong, K. Y., “Document Analysis System”, International Business Machines Corporation, vol. 26, No. 6, Nov. 1982, pp. 647-656.
Breuel, Thomas M., “Two Geometric Algorithms for Layout Analysis”, Document Analysis Systems, Xerox Palo Alto Research Center, 2002, 12 pgs.
Breuel, Thomas M., “Robust Least Square Baseline Finding using a Branch and Bound Algorithm”, Document Recognition & Retrieval, SPIE, 2002, pp. 20-27.
Kise, Koichi, et al., “Segmentation of Page Images Using the Area Voronoi Diagram”, Computer Vision and Image Understanding, vol. 70, No. 3, Jun. 1998, pp. 370-382.
Baird, Henry, “Background Structure in Document Images”, International Journal of Pattern Recognition and Artificial Intelligence, vol. 8, No. 5, Oct. 1994, pp. 1013-1030.
International Search Report, dated May 14, 2008, for PCT Patent Application No. PCT/US/23233, 1 page.
Bloomberg Dan
Furmaniak Ralph
Lee Dar-Shyang
Smith Ray
Vincent Luc
Google Inc.
Ishrat Sherali
Sterne Kessler Goldstein & Fox P.L.L.C.
LandOfFree
Media material analysis of continuing article portions does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Media material analysis of continuing article portions, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Media material analysis of continuing article portions will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-2623856