Image analysis – Pattern recognition – Context analysis or word recognition
Patent
1995-04-28
1997-11-18
Mancuso, Joseph
Image analysis
Pattern recognition
Context analysis or word recognition
395761, G06K 972
Patent
active
056895852
ABSTRACT:
A method for establishing a relationship between a text image and a transcription associated with the text image uses conventional image processing techniques to identify one or more geometric attributes, or image parameters, of each of a sequence of regions of the text image. The transcription labels in the transcription are analyzed to determine a comparable set of parameters in transcription label sequence. A matching operation then matches the respective parameters of the two sequences to identify image regions that match with transcription regions. The result is an output data structure that minimally identifies image locations of interest to a subsequent operation that processes the text image. The output data structure may also pair each of the image locations of interest to a transcription location, in effect producing a set of labeled image locations. In one embodiment, the sequence of locations of words and their observed lengths in the text image are determined. The transcription is analyzed to identify words, and transcription word lengths are computed using an estimated image character width of glyphs in the text image. The sequence of observed image word lengths is then matched to the sequence of computed transcription word lengths using a dynamic programming algorithm that finds a best path through a two-dimensional lattice of nodes and transitions between nodes, where the transitions represent pairs of sequences of zero or more word lengths. An output data structure contains entries, each of which pairs a transcription word with a matching image word location.
REFERENCES:
patent: 4905287 (1990-02-01), Segawa
patent: 5321770 (1994-06-01), Huttenlocher et al.
patent: 5333275 (1994-07-01), Wheatley et al.
patent: 5438512 (1995-08-01), Mantha et al.
patent: 5438628 (1995-08-01), Spitz et al.
patent: 5438630 (1995-08-01), Chen et al.
patent: 5455871 (1995-10-01), Bloomberg et al.
patent: 5473705 (1995-12-01), Abe et al.
patent: 5513304 (1996-04-01), Spitz et al.
patent: 5524066 (1996-06-01), Kaplan et al.
patent: 5526444 (1996-06-01), Kopec et al.
patent: 5544050 (1996-08-01), Abe et al.
Hull, "A Hidden Markov Model for Language Syntax in Text Recognition", Pattern Recognition, '92 11th Int'l. vol. 11, pp. 124-127.
G. Nagy, et al in "A prototype document image analysis system for technical journals", IEEE Computer, Jul., 1992, pp. 10-22.
A. Dengel, et al. in "From Paper to Office Document Standard Representation" Computer, vol. 25, No. 7, Jul. 1992, pp. 63-67.
T. Butler, "Retaining Document Format in OCR," in SPIE vol. 2181 Document Recognition, Proceedings of the IS&T/SPIE Electronic Imaging Conference, San Jose, CA, Feb. 1994, pp. 78-86.
Huang, Ariki and Jack, Hidden Markov Models for Speech Recognition, Edinburgh University Press, 1990, Chapter 3, Section 3.2, at pp. 70-81.
Rabiner, Lawrence and Juang, Biing-Hwang, Fundamentals of Speech Recognition, Prentice Hall, 1993, Chapter 4, Sec. 4.7, at pp. 200-241, Chapter 6, Sec. 6.4.1 at pp. 334-340.
Bloomberg Dan S.
Chou Philip Andrew
Kopec Gary E.
Niles Leslie T.
Bares Judith C.
Kahng Anthony H.
Mancuso Joseph
Xerox Corporation
LandOfFree
Method for aligning a text image to a transcription of the image does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Method for aligning a text image to a transcription of the image, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Method for aligning a text image to a transcription of the image will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-1571948