Method, system and program for providing indexed web page...

Data processing: database and file management or data structures – Database design – Data structure types

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C709S203000, C709S229000, C711S118000, C382S219000

Reexamination Certificate

active

06823341

ABSTRACT:

BACKGROUND OF THE INVENTION
1. Technical Field
The present invention relates to an improved method, system and program for indexing web page contents, and in particular to an improved method, system and program for providing indexed web page contents to a search engine database. Still more particularly, the present invention relates to a method, system and program for indexing web page contents of each web page requested from the Internet by a user and providing the indexed web page contents to a search engine database.
2. Description of the Related Art
In the prior art, it has been well known that computer systems can be utilized to manage indices of records of databases. Many techniques are known to parse, index, and search databases. However, managing extremely large databases presents special problems.
In recent years, a unique distributed database has emerged in the form of the World-Wide-Web(Web). The database records of the Web are in the form of web pages accessible via the Internet. Here, tens of millions of pages are accessible by anyone having a communications link to the Internet.
The pages are dispersed over millions of different computer systems all over the world. Users of the Internet constantly desire to locate specific pages containing information of interest. However, a current problem with the Web is the lack of ability to search and browse (collectively these activities are referred to as “navigate”) the information in the Web. Searching can be described as looking for the resources that contain particular information of interest, such as a specific set of keywords, while browsing is a less focused “looking around.”
Currently, it is impossible to efficiently navigate all of the Web. The amount of information available through web pages and other data available through the Internet grows by vast amounts each day. In an effort to provide a directory to data on the Internet, many search engines have been created whereby a user can search web pages by a keyword, phrase, topic, etc. However, each search engine typically only accesses a directory of web pages that have been previously “crawled” or indexed for that search engine or manually created. The indexing data is typically stored in a database which can be searched in various ways to provide users with locations of web pages which may be relevant to a user's particular interest.
With the amount of information on the Internet growing exponentially, there is little chance that the vast majority of the information will be effectively indexed with the techniques utilized today by the various search engine sites that rely on centralized computers accessing and indexing the distributed content of the Internet in a limited manner. This, in turn, leads to the current statistics which estimate that only 15-20% of the information available on the Internet is readily accessible via current search engine indexing methods.
However, users access a multitude of web pages that have not been indexed by any search engine. Therefore, it would be desirable to retrieve index data from each web page that a user accesses over the Internet in order to update search engine databases with pages that have not yet been indexed. Further, it would be desirable to free bandwidth typically utilized to crawl for non-indexed pages and shift to utilizing data retrieved from user accesses. In particular, indexing pages retrieved from user accesses would be both more efficient and potentially allow a derivation of the value of particular web pages based on the number of times indexes of a particular page are returned to a search engine from different users. Effectively, by receiving index data created during user accesses, the creation of a usefulness value for web pages within search engines may be determined.
SUMMARY OF THE INVENTION
In view of the foregoing, it is therefore an object of the present invention to provide an improved method, system and program for indexing web page contents.
It is another object of the present invention to provide an improved method, system and program for providing web page contents to a search engine database.
It is yet another object of the present invention to provide a method, system and program for indexing web page contents of each web page requested from the Internet by a user and providing the indexed web page contents to a search engine database.
In accordance with the method, system and program of the present invention, in response to each user request for a web page, user access to the web page is provided from a temporary copy of the web page which is stored on a device which accesses the web page and which is accessible to the user. Indexing data is then automatically recorded at that device from the temporarily stored copy of the accessed web page, wherein the indexing data corresponds to contents of the accessed web page. The indexing data is thereafter transmitted from the device to a remote data storage device which provides a search engine database. According to one object of the invention, the indexed data is incorporated into the search engine database, such that previously unknown indexed web page contents are provided to a search engine database in response to a user access of that web page. According to another object of the invention, a statistical count of a number of times that indexing data for a particular web page is provided to a search engine database is maintained.
All objects, features, and advantages of the present invention will become apparent in the following detailed written description.


REFERENCES:
patent: 5692073 (1997-11-01), Cass
patent: 5745899 (1998-04-01), Burrows
patent: 5761418 (1998-06-01), Francis et al.
patent: 5765158 (1998-06-01), Burrows
patent: 5806065 (1998-09-01), Lomet
patent: 5845273 (1998-12-01), Jindal
patent: 5864863 (1999-01-01), Burrows
patent: 5941944 (1999-08-01), Messerly
patent: 5991810 (1999-11-01), Shapiro et al.
patent: 6141333 (2000-10-01), Chavez, Jr.
patent: 6286006 (2001-09-01), Bharat et al.
patent: 2001/0023476 (2001-09-01), Rosenzweig
patent: 2001/0026591 (2001-10-01), Keren et al.
Ardo et al; “Regional Ditributed WWW Search and Indexing Service—the DESIRE Way”; Computer Networks and ISDN Systems; vol. 30, No. 1-7, pp. 173-183; 1998.
Rodriguez et al; “AlephWeb: a Search Engine Based on the Federatedd Structure”; Proceedings of JENC7. 7th Joint European Conference, Networking in the Information Society, pp. 112/1-11210; 1996.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Method, system and program for providing indexed web page... does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Method, system and program for providing indexed web page..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Method, system and program for providing indexed web page... will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-3350131

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.