Data file processing

Data processing: database and file management or data structures – Database design – Data structure types

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

Reexamination Certificate

active

06532473

ABSTRACT:

FIELD OF THE INVENTION
The present invention relates to a method and apparatus for processing data files, the data files being generated in accordance with different protocols. In particular, the present invention relates to the generation of and reading of XML (eXtensible Mark-up Language) files generated in accordance with different document structures (e.g. DTD's, schemas).
DESCRIPTION OF THE PRIOR ART
XML is a mark-up language which is used for transferring structured data. The XML files include a number of mark-up tags (e.g. “elements”, “attributes” and “entities” etc.) which are associated with respective data. Each respective mark-up tag has a specific meaning within the context of the particular XML file and this allows third parties to determine the nature of the data associated with the respective mark-up tags.
The mark-up tags associated with an XML file are defined by the XML document type definition or schema. This leads to a large number of different file formats being available, which in turn leads to many problems with the extraction and storage of data from the files.
Currently, data is extracted from XML files by parsing the file to locate specific mark-up tags and then extracting and storing the corresponding data accordingly. However, a respective data loader must be provided for each specific type of XML file. As a result a large number of different data loaders can be required for extracting and storing data from a number of different XML files.
As the use of electronic information interchange increases rapidly, users are now desiring that data is transferred to and from the systems in a variety of XML formats. Accordingly, it is desirable to be able to process different XML file formats using a single piece of software.
SUMMARY OF THE INVENTION
In accordance with a first aspect of the present invention, we provide a method of processing data files, the data files being generated in accordance with different protocols, each protocol defining a number of mark-up tags and each data file including a number of respective mark-up tags, each mark-up tag having respective data associated therewith, at least some of the data files also including a protocol definition indicating the protocol used to generate the data file, wherein the method comprises storing the data contained in a data file by:
a. receiving the data file;
b. determining the protocol definition;
c. using the protocol definition to determine storage location(s) or processing action(s) for each of the mark-up tags of the data file;
d. using the protocol definition to determine contextual information for each of the mark-up tags of the data file;
e. extracting the marked up data contained within the data file; and,
f. storing or processing the data in accordance with the determined contextual information and at least one of the storage location(s) or the processing action(s).
In accordance with a second aspect of the present invention, we provide apparatus for processing data files, the data files being generated in accordance with different protocols, each protocol defining a number of mark-up tags and each data file including a number of respective mark-up tags, each mark-up tag having respective data associated therewith, at least some of the data files also including a protocol definition indicating the protocol used to generate the data file, the apparatus comprising:
a. a processor; and,
b. a store, the processor being adapted to storing data contained in a data file by:
i. receiving the data file;
ii. determining the protocol definition;
iii. using the protocol definition to determine storage location(s) or processing action(s) for each of the mark-up tags of the data file;
iv. using the protocol definition to determine contextual information for each of the mark-up tags of the data file;
v. extracting the data contained within the mark-up tags of the data file; and,
vi. storing or processing the data in accordance with the determined contextual information and at least one of the storage location(s) or the processing action(s).
Accordingly, the present invention provides a method and apparatus for storing data contained in a data file, and in particular a structured data file. The system operates by examining the data file to determine a protocol definition which is then used to determine contextual information and storage location(s) (also referred to as locational information) or processing action(s) for each of the marked-up tags in the data file. The data associated with the marked-up tags is then stored or processed in accordance with this contextual information and the storage location(s) or processing action(s), as appropriate. As the protocol definition is different for each different type of data file, different mark-up tags contained in different types of data file can resolve to the same contextual information and storage location(s) or processing action(s) allowing the data to be stored or processed in the same way irrespective of the type of data file.
If the data file does not include a protocol definition, then the method of determining contextual and locational information for each of the mark-up tags of the data file typically comprises parsing the data file to locate the mark-up tags and thus generating a protocol definition for use with this file, considering each mark-up tag and the data associated therewith to determine contextual and locational information for each of the mark-up tags. Accordingly, if no protocol definition can be generated to determine contextual information, it is then necessary to look at each of the mark-up tags and the data contained therein to determine the contextual and locational information directly.
Typically, when no protocol definition is available or can be generated the contextual and locational information is determined by requesting the input of contextual and locational information from an external source, such as the user. Thus, the user of the apparatus and/or method would examine the data and the mark-up tags and use their own knowledge of the database receiving system and the way in which data is stored therein to determine the contextual and locational information appropriate to the given data.
Once this has been completed, a protocol definition can then be defined for the mark-up tags for which contextual and locational information has been derived. This can then be used in subsequent processing of data files.
In accordance with a third aspect of the present invention, we provide a method of processing data files, the data files being generated in accordance with different protocols, each protocol defining a number of mark-up tags and each data file including a number of respective mark-up tags, each mark-up tag having respective data associated therewith, at least some of the data files also including a protocol definition indicating the protocol used to generate the data file, wherein the method comprises generating a data file by:
a. determining the protocol definition of the protocol to be used;
b. locating the data to be incorporated into the file, the data being stored in accordance with contextual and locational information;
c. using the protocol definition and the contextual information to determine the mark-ups with which the data should be associated;
d. generating a data file by associating the data with respective mark-up tags in accordance with the contextual and locational information.
In accordance with a fourth aspect of the present invention, we provide apparatus for processing data files, the data files being generated in accordance with different protocols, each protocol defining a number of mark-up tags and each data file including a number of respective mark-up tags, each mark-up tag having respective data associated therewith, at least some of the data files also including a protocol definition indicating the protocol used to generate the data file, the apparatus comprising:
a. a processor; and,
b. a data dictionary, the processor being adapted to generate data files by:
i. determining the protocol def

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Data file processing does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Data file processing, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Data file processing will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-3075392

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.