Two-pass hash extraction of text strings

Data processing: speech signal processing – linguistics – language – Linguistics – Dictionary building – modification – or prioritization

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C707S747000

Reexamination Certificate

active

08078454

ABSTRACT:
Data compression and key word recognition may be provided. A first pass may walk a text string, generate terms, and calculate a hash value for each generated term. For each hash value, a hash bucket may be created where an associated occurrence count may be maintained. The hash buckets may be sorted by occurrence count and a few top buckets may be kept. Once those top buckets are known, a second pass may walk the text string, generate terms, and calculate a hash value for each term. If the hash values of terms match hash values of one of the kept buckets, then the term may be considered a frequent term. Consequently, the term may be added to a dictionary along with a corresponding frequency count. Then, the dictionary may be examined to remove terms that may not be frequent, but appeared due to hash collisions.

REFERENCES:
patent: 4843389 (1989-06-01), Lisle et al.
patent: 5287499 (1994-02-01), Nemes
patent: 5333313 (1994-07-01), Heising
patent: 5561421 (1996-10-01), Smith et al.
patent: 5951623 (1999-09-01), Reynar et al.
patent: 6047298 (2000-04-01), Morishita
patent: 6121901 (2000-09-01), Welch et al.
patent: 6879271 (2005-04-01), Abdat
patent: 7003522 (2006-02-01), Reynar et al.
patent: 7031910 (2006-04-01), Eisele
patent: 7032174 (2006-04-01), Montero et al.
patent: 7181388 (2007-02-01), Tian
patent: 7403137 (2008-07-01), Huang
patent: 7451075 (2008-11-01), Mohammed
patent: 7584184 (2009-09-01), Takuma et al.
patent: 2003/0101413 (2003-05-01), Klein et al.
patent: 2003/0125929 (2003-07-01), Bergstraesser et al.
patent: 2004/0006547 (2004-01-01), Dehlinger et al.
patent: 2005/0027731 (2005-02-01), Revel
patent: 2008/0065639 (2008-03-01), Choudhary et al.
patent: 2007-094838 (2007-04-01), None
patent: 10-2004-00117769 (2004-02-01), None
International Search Report dated Mar. 24, 2009 cited in Application No. PCT/US2008/074586.
J. L. Martinez-Fernández et al., “Automatic Keyword Extraction for News Finder.” 20 pgs., http://canada.esat.kuleuven.be/omnipaper/downloads/WP7—AKE—AMR—1.pdf.
Suzanne Bunton et al., “Practical Dictionary Management for Hardware Data Compression.” Communications of the ACM. Jan. 1992, vol. 35, No. 1, pp. 95-104, http://delivery.acm.org/10.1145/130000/129622/p95-bunton.pdf?key1=129622&key2=6460113811&coli=GUIDE&d1=GUIDE&CFID=72243252&CFTOKEN=63873519.
“Smart Tags—Buttons with a Brain! Improve Productivity Within Your Office XP 2002/3 Documents!”, 3 pgs. http://www.download3k.com/Press-Smart-Tags-Buttons-with-a-Brain-IMPROVE.html.
Chinese First Office Action dated Jun. 24, 2011 cited in Application No. 200880109407.0.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Two-pass hash extraction of text strings does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Two-pass hash extraction of text strings, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Two-pass hash extraction of text strings will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-4258576

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.