Dynamic corpus generation

Data processing: database and file management or data structures – Database and file access – Search engines

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C707S739000

Reexamination Certificate

active

07941418

ABSTRACT:
A computer-implemented method of generating a dynamic corpus includes generating web threads, based upon corresponding sets of words dequeued from a word queue, to obtain web thread resulting URLs. The web thread resulting URLs are enqueued in a URL queue. Multiple text extraction threads are generated, based upon documents downloaded using URLs dequeued from the URL queue, to obtain text files. New words are randomly obtained from the text files, and the randomly obtained words from the text files are enqueued in the word queue. This process is iteratively performed, resulting in a dynamic corpus.

REFERENCES:
patent: 6263364 (2001-07-01), Najork et al.
patent: 6272456 (2001-08-01), de Campos
patent: 6321265 (2001-11-01), Najork et al.
patent: 6516312 (2003-02-01), Kraft et al.
patent: 7158930 (2007-01-01), Pentheroudakis et al.
patent: 2003/0033288 (2003-02-01), Shanahan et al.
patent: 2004/0161150 (2004-08-01), Cukierman et al.
patent: 2004/0210434 (2004-10-01), Wang et al.
patent: 2005/0197829 (2005-09-01), Okumura
patent: 2005/0251384 (2005-11-01), Yang

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Dynamic corpus generation does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Dynamic corpus generation, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Dynamic corpus generation will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-2674770

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.