Data processing: speech signal processing – linguistics – language – Linguistics – Dictionary building – modification – or prioritization
Reexamination Certificate
2007-09-28
2011-12-13
Sked, Matthew (Department: 2626)
Data processing: speech signal processing, linguistics, language
Linguistics
Dictionary building, modification, or prioritization
C707S747000
Reexamination Certificate
active
08078454
ABSTRACT:
Data compression and key word recognition may be provided. A first pass may walk a text string, generate terms, and calculate a hash value for each generated term. For each hash value, a hash bucket may be created where an associated occurrence count may be maintained. The hash buckets may be sorted by occurrence count and a few top buckets may be kept. Once those top buckets are known, a second pass may walk the text string, generate terms, and calculate a hash value for each term. If the hash values of terms match hash values of one of the kept buckets, then the term may be considered a frequent term. Consequently, the term may be added to a dictionary along with a corresponding frequency count. Then, the dictionary may be examined to remove terms that may not be frequent, but appeared due to hash collisions.
REFERENCES:
patent: 4843389 (1989-06-01), Lisle et al.
patent: 5287499 (1994-02-01), Nemes
patent: 5333313 (1994-07-01), Heising
patent: 5561421 (1996-10-01), Smith et al.
patent: 5951623 (1999-09-01), Reynar et al.
patent: 6047298 (2000-04-01), Morishita
patent: 6121901 (2000-09-01), Welch et al.
patent: 6879271 (2005-04-01), Abdat
patent: 7003522 (2006-02-01), Reynar et al.
patent: 7031910 (2006-04-01), Eisele
patent: 7032174 (2006-04-01), Montero et al.
patent: 7181388 (2007-02-01), Tian
patent: 7403137 (2008-07-01), Huang
patent: 7451075 (2008-11-01), Mohammed
patent: 7584184 (2009-09-01), Takuma et al.
patent: 2003/0101413 (2003-05-01), Klein et al.
patent: 2003/0125929 (2003-07-01), Bergstraesser et al.
patent: 2004/0006547 (2004-01-01), Dehlinger et al.
patent: 2005/0027731 (2005-02-01), Revel
patent: 2008/0065639 (2008-03-01), Choudhary et al.
patent: 2007-094838 (2007-04-01), None
patent: 10-2004-00117769 (2004-02-01), None
International Search Report dated Mar. 24, 2009 cited in Application No. PCT/US2008/074586.
J. L. Martinez-Fernández et al., “Automatic Keyword Extraction for News Finder.” 20 pgs., http://canada.esat.kuleuven.be/omnipaper/downloads/WP7—AKE—AMR—1.pdf.
Suzanne Bunton et al., “Practical Dictionary Management for Hardware Data Compression.” Communications of the ACM. Jan. 1992, vol. 35, No. 1, pp. 95-104, http://delivery.acm.org/10.1145/130000/129622/p95-bunton.pdf?key1=129622&key2=6460113811&coli=GUIDE&d1=GUIDE&CFID=72243252&CFTOKEN=63873519.
“Smart Tags—Buttons with a Brain! Improve Productivity Within Your Office XP 2002/3 Documents!”, 3 pgs. http://www.download3k.com/Press-Smart-Tags-Buttons-with-a-Brain-IMPROVE.html.
Chinese First Office Action dated Jun. 24, 2011 cited in Application No. 200880109407.0.
Merchant & Gould
Microsoft Corporation
Sked Matthew
LandOfFree
Two-pass hash extraction of text strings does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Two-pass hash extraction of text strings, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Two-pass hash extraction of text strings will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-4258576