Electrical computers and digital processing systems: multicomput – Remote data accessing – Accessing a remote server
Reexamination Certificate
1999-12-09
2003-05-13
Vu, Viet D. (Department: 2154)
Electrical computers and digital processing systems: multicomput
Remote data accessing
Accessing a remote server
C709S225000, C709S241000, C707S793000
Reexamination Certificate
active
06564257
ABSTRACT:
TECHNICAL FIELD
This invention relates to repository security, and more particularly to the protection of searchable repositories such as those found on the Internet.
BACKGROUND OF THE INVENTION
Internet search engines (e.g. Hotbot, Yahoo) spend a great deal of time and effort developing and maintaining repositories of information stored on their own servers. These repositories contain summary data about network resources such as documents or web pages found on the Internet. The data includes links, also called hyperlinks or uniform resource locators (URLs) that are essentially addresses where the documents can be found, as the documents are not stored in the repositories but on other servers.
The repositories are created and maintained by using a web crawler or gatherer to access documents on a large scale from a large number of servers on the Internet. A crawler or gatherer typically will perform an initial query to obtain an initial resultant set of documents, download and analyze the results to generate the summary data, extract and store the URLs contained within, query the URLs for more results, and proceed in a recursive process to gather as many URLs as possible.
The key to obtaining the URLs is in a document's page specification, which is how the page will be assembled when viewed. A common page specification language is HyperText Markup Language, or HTML. In HTML, the URL is coded as an HREF tag. Other page specification languages use similar tags to indicate the presence of a URL.
Crawling is an expensive and time-consuming process, and thus the search engine repositories (as well as other Internet or intranet repositories or databases) are very valuable, as millions of end users access them every day. End users access the search engine's repository by means of a query. The search engine presents results in the form of a list of summary data, and the user chooses the appropriate item from among the results. Users thus typically access documents in a limited manner, sequentially searching and examining documents until the desired item is found, in contrast with the web crawlers or gatherers, which access documents in a wholesale fashion.
Unfortunately, in addition to building a repository or database, a web crawler or gather can be used to systematically extract and replicate all the information from someone else's repository or database, by the same querying/parsing/extracting process described above. Thus it is desirable to provide a means to protect an Internet or intranet repository or database from wholesale access yet still provide limited access for the typical end user.
SUMMARY OF THE INVENTION
A method and system for protecting a searchable repository containing a document locator when a user searches the repository for the document locator, by replacing the document locator with a unique time-sensitive key are described. The document locator may be a uniform resource locator, or URL. A user search request is intercepted, each URL in the original search result is extracted and replaced with a key, and the altered result returned to the user. When the user selects the key from the search result within the expiration interval, the associated URL and document are able to be retrieved.
REFERENCES:
patent: 5761436 (1998-06-01), Nielsen
patent: 5793964 (1998-08-01), Rogers et al.
patent: 5812776 (1998-09-01), Gifford
patent: 5855020 (1998-12-01), Kirsch
patent: 5864852 (1999-01-01), Luotonen
patent: 5870546 (1999-02-01), Kirsch
patent: 5870559 (1999-02-01), Leshem et al.
patent: 5878219 (1999-03-01), Vance, Jr. et al.
patent: 6078866 (2000-06-01), Buck et al.
patent: 6157930 (2000-12-01), Ballard et al.
patent: 6360254 (2002-03-01), Linden et al.
patent: WO97/29414 (1997-08-01), None
“Persistent Context for World Wide Web Browsers”, IBM Technical Disclosure Bulletin, vol. 40, No. 02, Feb. 1997, pp. 215-216.
“Virtual URL's for Browsing and Searching Large Information Spaces”, Research Disclosure, Sep. 1998, pp. 1238-1239.
Emens Michael Lawrence
Kraft Reiner
Mortinger Alison D.
Vu Viet D.
LandOfFree
Repository protection by URL expiration does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Repository protection by URL expiration, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Repository protection by URL expiration will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-3090558