Data processing: database and file management or data structures – Database design – Data structure types
Reexamination Certificate
2004-11-10
2008-10-21
Chace, Christian P. (Department: 2165)
Data processing: database and file management or data structures
Database design
Data structure types
Reexamination Certificate
active
07440967
ABSTRACT:
A method for converting a legacy document into an XML document, includes decomposing the conversion process into a plurality of individual conversion tasks. A legacy document is decomposed into a plurality of document portions. A target XML schema including a plurality of schema components is provided. Local schema are generated from the target XML schema, wherein each local schema includes at least one of the schema components in the target XML schema. A plurality of conversion tasks is generated by associating a local schema and an applicable document portion, wherein each conversion task associates data from the applicable document portion with the applicable schema component in the local schema. For each conversion task, a conversion method is selected and the conversion method is performed on the applicable document portion and local schema. Finally, the results of all the individual conversion tasks are assembled into a target XML document.
REFERENCES:
patent: 5560006 (1996-09-01), Layden et al.
patent: 6675353 (2004-01-01), Friedman
patent: 6694325 (2004-02-01), Jas
patent: 2003/0126556 (2003-07-01), Soetarman et al.
patent: 2003/0167445 (2003-09-01), Su et al.
patent: 2003/0212698 (2003-11-01), Mani et al.
patent: 2004/0010754 (2004-01-01), Jones
patent: 2004/0093331 (2004-05-01), Garner et al.
patent: 2006/0167909 (2006-07-01), Mendis et al.
U.S. Appl. No. 10/756,393, filed Jan. 14, 2004, Boris Chidlovskii, et al.
C. Baru, A. Gupta, B. Ludäescher, R. Marciano, Y. Papakonstantinou, and P. Velikhov, “XML-Based Information Mediation with MIX”, ACM SIGMOD Proceedings, Philadelphia, PA, 1999, p. 1-5.
H. Do, S. Melnik, and E. Rahm, “Comparison of Schema Matching Evaluations”, In Proceedings of the 2ndInt. Workshop on Web Databases (German Informatics Society), 2002, p. 1-15.
E. Rahm, P.A. Bernstein, “A survey of approaches to automatic schema matching”, VLDB Journal 10:4 (2001), p. 334:350.
Stephen Soderland, “Learning Information Extraction Rules for Semi-structured and Free Text”, Machine Learning, vol. 34, 1-3, 1999, p. 1-44.
Altamura, et al., “Transforming Paper Documents into XML Format with Wisdom++”, Internet Citation [online] 2000, XP002317735, Retrieved from the Internet: URL:http://citeseer.ist.psu.edu/cac.PDF.
Chanod, et al., “From Legacy Documents to XML: A Conversion Framework”, Research and Advanced Technology for Digital Libraries Lecture Notes in Computer Science; LNCS, Springer-Verlag, BE, vol. 3652, 2005, pp. 92-103, XP019018385, ISBN: 3-540-28767-1.
Chung, et al., “Reverse Engineering for Web Data: From Visual to Semantic Structures”, Proceeding 18th, International Conference on Data Engineering. (ICDE '2002), San Jose, CA, Feb. 26-Mar. 1, 2002, ICDE, Los Alamitos, CA, IEEE Comp. Soc, US, vol. Conf. 18, Feb. 26, 2002, pp. 53-63, XP010588199, ISBN: 0-7695-1531-2.
Ishitani, Yasuto, “Document Transformation System from Papers to XML Data Based on Pivot XML Document Method”, Document Analysis and recognition, 2003. Proceedings, Seventh International Conference on Aug. 3-6, 2003, Piscataway, NJ, USA, IEEE, Aug. 3, 2003, pp. 250-255, XP010656617, ISBN: 0-7695-1960-1.
Chidlovskii, Boris, “Schema Extraction for XML Data: A Grammatical Inference Approach”, KRDB '01 Workshop (Knowledge Representation and Databases), [Online] Sep. 15, 2001, XP002442301, Rome, Italy, Retrieved from the Internet: URL:http://www.xrce.com/Publications/Attachments/2001-200/schemaExtr.pdf> [retrieved on Jul. 12, 2007].
Chidlovskii, et al., “Supervised Learning for the Legacy Document Conversion”, ACM Symposium on Document Engineering, [Online] Oct. 28, 2004, Oct. 30, 2004 XP002442300, Milwaukee, Wisconcin, USA, Retrieved from the Internet: URL:http://doi.acm.org/10.1145/1030397.1030439> [retrieved on Jul. 12, 2007].
Anderson Kellye
Chace Christian P.
Walder Jeannette
Xerox Corporation
LandOfFree
System and method for transforming legacy documents into XML... does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with System and method for transforming legacy documents into XML..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and System and method for transforming legacy documents into XML... will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-4012982