Fontless structured document image representations for efficient

Patent

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

395102, G06K 1502

Patent

active

058840141

ABSTRACT:
A processor is provided with a first set of digital information that includes a first, resolution-independent structured representation of a document. This first representation is one from which various image collections (e.g., sets of page images) can be obtained, each such image in each such collection having a characteristic resolution. From the first set of digital information, the processor produces a second set of digital information that includes a second, resolution-dependent structured representation of the document. The second structured representation is a lossless representation of a particular one of the image collections obtainable from the first structured representation, and it includes a set of tokens and a set of positions. The second set of digital information is produced by extracting the tokens from the first structured representation, and by determining the positions from the first structured representation. Each extracted token includes pixel data representing a subimage of the particular image collection. Each position is a position of a token subimage in the particular image collection. At least one of the token subimages contains multiple pixels and occurs at more than one position in the image collection. The second set of digital information thus produced can be made available for further use (e.g., distribution, transmission, storage, subsequent reconversion into page images). Applications of the invention include high-speed printing and Internet (World Wide Web) document display.

REFERENCES:
patent: 4410916 (1983-10-01), Pratt et al.
patent: 4499499 (1985-02-01), Brickman et al.
patent: 4566128 (1986-01-01), Araki
patent: 4703516 (1987-10-01), Fukuda
patent: 4769716 (1988-09-01), Casey et al.
patent: 5058187 (1991-10-01), Kim
patent: 5303313 (1994-04-01), Mark et al.
patent: 5305433 (1994-04-01), Ohno
patent: 5504843 (1996-04-01), Catapano et al.
Ian H. Witten, Alistair Moffat and Timothy C. Bell, "Textual Images", Managing Gigabytes: Compressing and Indexing Documents and Images, Chapter 7, New York:Van Nostrand Reinhold, 1994, pp. 254-293.
Holt, M. J. J. and C. S. Xydeas, "Recent Developments in Image Data Compression for Digital Facsimile", ICL Technical Journal, May 1986, pp. 123-146.
K. Mohiuddin, J. Rissanen and R. Arps, "Lossless Binary Image Compression Based on Pattern Matching", International Conference on Computers, Systems and Signal Processing, Bangalore, India, Dec. 9-12, 1984, pp. 447-451.
Gary E. Kopec and Mauricio Lomelin, "Document-Specific Character Template Estimation", International Symposium on Electronic Imaging: Science & Technology (IS&T/SPIE), Jan. 27-Feb. 2, 1996.
Witten, I. H., T. C. Bell, M. E. Harrison, M. L. James and A. Moffat, "Textual Image Compression", Proceedings IEEE Data Compression Conference, 1992, pp. 42-51.
Ascher, R. N. and G. Nagy, "A Means for Achieving a High Degree of Compaction on Scan-Digitized Printed Text", IEEE Transactions on Computers, 1974, C-23(11), pp. 1174-1179.
Pratt, W. K., P. J. Capitant, W. H. Chen, E. R. Hamilton, and R. H. Wallis, "Combined Symbol Matching Facsimile Data Compression System", Proceedings IEEE, 1980, 68(7), pp. 786-796.
Johnsen, O., J. Segen and G. L. Cash, "Coding of Two-Level Pictures by Pattern Matching and Substitution", Bell Systems Technical Journal, 1983, 62(8), pp. 2513-2545.
Mohiuddin, K. M., Pattern Matching with Application to Binary Image Compression, Ph. D. thesis, Stanford University, Stanford, California, 1982.
Adobe Systems, Inc., Postcript Language Reference Manual, (2nd ed.), (Reading, Mass.:Addision-Wesley, 1990) pp. 266-267, 398, 435, 456, 483, 520 and 591-606,.
Tao Hong and Jonathan J. Hull, "Improving OCR Performance with Word Image Equivalence", Fourth Annual Symposium on Document Analysis and Information Retrieval, Apr. 1995, pp. 177-189.
Emberson, H. Textual Image Compression, Honours Project Report, Department of Computer Science, University of Canterbury, New Zealand, 1992.
Wong, K. Y., R. G. Casey and F. M. Wahl, "Document Analysis System", IBM Journal of Research and Development, 1982, 26(6), pp. 647-656.
K. Mohiuddin, J. Rissanen and R. Arps, "Lossless Binary Image Compression Based on Pattern Matching", International Conference on Computers, Systems, and Signal Processing, Bangalore, India, Dec. 9-12, 1984, pp. 447-451.
Holt, M.J.J. and Xydeas, C.S., "Compression of Document Image Data by Symbol Matching," in Capellini, V. and Marconi, R., eds., Advances in Image Processing and Pattern Recognition, Elsevier Science Publishers, 1986, pp. 184-190.
A. Broder and M. Mitzenmacher, "Pattern-Based Compression of Text Images," Proceedings DCC'96 Data Compression Conference (IEEE), Snowbird, Utah, Mar. 31-Apr. 3, 1996, pp. 300-309.
M. Atallah, Y. Genin, and W. Szpankowski, "Pattern Matching Image Compression," Proceedings DCC'96 Data Compression Conference (IEEE), Snowbird, Utah, Mar. 31-Apr. 3, 1996, p. 421.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Fontless structured document image representations for efficient does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Fontless structured document image representations for efficient, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Fontless structured document image representations for efficient will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-824504

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.