System and methods for easy-to-use periodic network data...

Electrical computers and digital processing systems: multicomput – Remote data accessing – Accessing a remote server

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C709S224000, C719S329000

Reexamination Certificate

active

06810414

ABSTRACT:

TECHNICAL FIELD
This invention relates to capture and storage of information retrieved from a network.
BACKGROUND
The World Wide Web (WWW) is a collection of Hypertext Mark-Up Language (HTML) documents resident on computers that are distributed over networks such as the Internet. The WWW has become a vast repository for knowledge. Web pages provide information spanning the realm of human knowledge from information on foreign countries to information about the community in which one lives. The number of Web pages providing information over the Internet has increased exponentially since the World Wide Web's inception in 1990. Multiple Web pages are sometimes linked together to form a Web site, which is a collection of Web pages devoted to a particular topic or theme.
Accordingly, the collection of existing and future World Wide Web pages represents one of the largest databases in the world. However, access to the data residing on individual Web pages is hindered by the fact that World Wide Web pages are not a structured source of data. That is, there is no defined “structure” for organizing information provided by the Web page, as there is in traditional, relational databases. For example, different Web pages may provide the same geographic information about a particular country, but the information may appear in various locations of each page and may be organized differently from page to page. One particular example of this is that one Web site may provide relevant information on one Web page, i.e. in one HTML document, while another Web site may provide the same information distributed over multiple, interrelated Web pages.
These problems are not limited to retrieving data from HTML documents distributed over the Internet. Larger organizations have begun building “intranets”, which are collections of linked HTML documents internal to the organization. While “intranets” are intended to provide a member of an organization with easy access to information about the organization, the problems discussed above with respect the WWW apply to “intranets”. Requiring members of the organization to learn the data context of each Web page, or requiring them to learn a specialized query language for accessing Web pages, would defeat the purpose of the “intranet” and would be virtually impossible on the Internet.
The periodic retrieval of Web pages and extraction of useful information are hindered by several difficulties that have not been solved by prior art. In particular, a large percentage of Web pages are dynamically created. Those Web pages contain data that depends upon input parameters sent to the Web server. Thus, a single uniform resource locator (URL) may, with appropriate parameters, return many data sets. Further, the pages returned may vary in format. For example, some pages may have additional elements, while other pages have had elements deleted. In addition, valuable information may be contained in graphical elements, such as JPEG or BMP images. This information often does not exist in text form in the page data.
SUMMARY
A method for capturing and storing data from a network includes specifying at least one target data accessible from a network location addressable by a network address. The method also includes capturing the target data from data received from the network location at specified dates and times.
In some embodiments, the method further includes easy-to-use graphical user interfaces; integration with Web browsers; point-and-click selection of data targets; automatic input element parameter substitution to retrieve multiple pages from a single network address; periodic Web page retrieval from Internet servers at pre-specified intervals; target data matching; intelligent character recognition of graphical HTML or XML elements; graphical database, database table and table record creation; and automatic creation of formatted data files or direct storage to database.
The present invention also includes a data capture and storage system. The system includes a graphical interface element configured to display at least one target page. The system also includes a selection device and a processor. The selection device operates to enable selection of target data on the target page for capture and storage. The processor is coupled to the graphical interface element, and is capable of being programmed with a plurality of configurations to locate, extract, and store the target data according to the plurality of configurations.


REFERENCES:
patent: 5713019 (1998-01-01), Keaten
patent: 5764906 (1998-06-01), Edelstein
patent: 5809250 (1998-09-01), Kisor
patent: 5893091 (1999-04-01), Hunt et al.
patent: 5895461 (1999-04-01), De La Huerga et al.
patent: 5905492 (1999-05-01), Straub et al.
patent: 5905866 (1999-05-01), Nakabayashi et al.
patent: 5913214 (1999-06-01), Madnick et al.
patent: 5933531 (1999-08-01), Lorie
patent: 5978807 (1999-11-01), Mano et al.
patent: 5983247 (1999-11-01), Yamanaka et al.
patent: 6041331 (2000-03-01), Weiner et al.
patent: 6081788 (2000-06-01), Appleman et al.
patent: 6128624 (2000-10-01), Papierniak et al.
patent: 6144990 (2000-11-01), Brandt et al.
patent: 6163779 (2000-12-01), Mantha et al.
patent: 6185585 (2001-02-01), Sequeira
patent: 6272484 (2001-08-01), Martin et al.
patent: 6286046 (2001-09-01), Bryant
patent: 6304864 (2001-10-01), Liddy et al.
patent: 6311194 (2001-10-01), Sheth et al.
patent: 6487566 (2002-11-01), Sundaresan
patent: 6510406 (2003-01-01), Marchisio
patent: 6538673 (2003-03-01), Maslov
patent: 6549896 (2003-04-01), Candan et al.
patent: 6549941 (2003-04-01), Jaquith et al.
patent: 8571243 (2003-05-01), Gupta et al.
patent: 6651059 (2003-11-01), Sundaresan et al.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

System and methods for easy-to-use periodic network data... does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with System and methods for easy-to-use periodic network data..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and System and methods for easy-to-use periodic network data... will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-3265472

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.