Method for aligning a text image to a transcription of the image

Image analysis – Pattern recognition – Context analysis or word recognition

Patent

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

395761, G06K 972

Patent

active

056895852

ABSTRACT:
A method for establishing a relationship between a text image and a transcription associated with the text image uses conventional image processing techniques to identify one or more geometric attributes, or image parameters, of each of a sequence of regions of the text image. The transcription labels in the transcription are analyzed to determine a comparable set of parameters in transcription label sequence. A matching operation then matches the respective parameters of the two sequences to identify image regions that match with transcription regions. The result is an output data structure that minimally identifies image locations of interest to a subsequent operation that processes the text image. The output data structure may also pair each of the image locations of interest to a transcription location, in effect producing a set of labeled image locations. In one embodiment, the sequence of locations of words and their observed lengths in the text image are determined. The transcription is analyzed to identify words, and transcription word lengths are computed using an estimated image character width of glyphs in the text image. The sequence of observed image word lengths is then matched to the sequence of computed transcription word lengths using a dynamic programming algorithm that finds a best path through a two-dimensional lattice of nodes and transitions between nodes, where the transitions represent pairs of sequences of zero or more word lengths. An output data structure contains entries, each of which pairs a transcription word with a matching image word location.

REFERENCES:
patent: 4905287 (1990-02-01), Segawa
patent: 5321770 (1994-06-01), Huttenlocher et al.
patent: 5333275 (1994-07-01), Wheatley et al.
patent: 5438512 (1995-08-01), Mantha et al.
patent: 5438628 (1995-08-01), Spitz et al.
patent: 5438630 (1995-08-01), Chen et al.
patent: 5455871 (1995-10-01), Bloomberg et al.
patent: 5473705 (1995-12-01), Abe et al.
patent: 5513304 (1996-04-01), Spitz et al.
patent: 5524066 (1996-06-01), Kaplan et al.
patent: 5526444 (1996-06-01), Kopec et al.
patent: 5544050 (1996-08-01), Abe et al.
Hull, "A Hidden Markov Model for Language Syntax in Text Recognition", Pattern Recognition, '92 11th Int'l. vol. 11, pp. 124-127.
G. Nagy, et al in "A prototype document image analysis system for technical journals", IEEE Computer, Jul., 1992, pp. 10-22.
A. Dengel, et al. in "From Paper to Office Document Standard Representation" Computer, vol. 25, No. 7, Jul. 1992, pp. 63-67.
T. Butler, "Retaining Document Format in OCR," in SPIE vol. 2181 Document Recognition, Proceedings of the IS&T/SPIE Electronic Imaging Conference, San Jose, CA, Feb. 1994, pp. 78-86.
Huang, Ariki and Jack, Hidden Markov Models for Speech Recognition, Edinburgh University Press, 1990, Chapter 3, Section 3.2, at pp. 70-81.
Rabiner, Lawrence and Juang, Biing-Hwang, Fundamentals of Speech Recognition, Prentice Hall, 1993, Chapter 4, Sec. 4.7, at pp. 200-241, Chapter 6, Sec. 6.4.1 at pp. 334-340.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Method for aligning a text image to a transcription of the image does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Method for aligning a text image to a transcription of the image, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Method for aligning a text image to a transcription of the image will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-1571948

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.