Data processing: speech signal processing – linguistics – language – Linguistics – Natural language
Reexamination Certificate
2008-03-18
2008-03-18
Edouard, Patrick N. (Department: 2626)
Data processing: speech signal processing, linguistics, language
Linguistics
Natural language
C715S252000
Reexamination Certificate
active
10250746
ABSTRACT:
In one aspect, the present invention provides a for estimating the similarity between at least two portions of text including the steps of forming a set of syntactic tuples, each tuple including at least two terms and a relation betweeen the two terms; classifying the relation between the terms in the tuples according to a predefined set of relations; establishing the relative agreement between syntactic tuples from the portions of text under comparison according to predefined classes of agreement; calculating a value representative of the similarity between the portions of text of each of the classes of agreement; and establishing a value for the similarity between the portions of text by calculating a weighted sum of the values representative of the similarity between the portions of text for each of the classes of agreement. Preferaly, the step of calculating a value representative of the similarity between the portions of text for each of the classes of agreement includes a weighting based upon the number of matched terms occurring in particular parts of speech in which the text occurs. It is also preferred that the step of calculating a value representative of the similarity between the portions of text for each of the classes of agreement include the application of a weighting factor to the estimate of similarity for each of the classes of agreement and the parts of speech in which matched terms occur.
REFERENCES:
patent: 5293552 (1994-03-01), Aalbersberg
patent: 5297039 (1994-03-01), Kanaegami
patent: 5418716 (1995-05-01), Suematsu
patent: 5519608 (1996-05-01), Kupiec
patent: 5619709 (1997-04-01), Caid et al.
patent: 5675819 (1997-10-01), Schuetze
patent: 5794178 (1998-08-01), Caid et al.
patent: 5799312 (1998-08-01), Rigoutsos
patent: 5835905 (1998-11-01), Pirolli et al.
patent: 5864855 (1999-01-01), Ruocco et al.
patent: 5893095 (1999-04-01), Jain et al.
patent: 5895446 (1999-04-01), Takeda et al.
patent: 5895470 (1999-04-01), Pirolli et al.
patent: 5905863 (1999-05-01), Knowles et al.
patent: 5933822 (1999-08-01), Braden-Harder et al.
patent: 5943669 (1999-08-01), Numata
patent: 5966686 (1999-10-01), Heidorn et al.
patent: 6076051 (2000-06-01), Messerly et al.
patent: 6233546 (2001-05-01), Datig
patent: 6678679 (2004-01-01), Bradford
patent: 6871174 (2005-03-01), Dolan et al.
patent: 0530993 (1992-08-01), None
patent: 0687987 (1995-06-01), None
Chris Buckley et al.,Automatic Routing and AD-hoc Retrieval Using SMART: Trec 2, Text Retrieval Conference, Gaithersburg, MD., (Aug. 31-Sep. 2, 1993), <http://trec.nist.gov/pubs/trec2/papers/txt/04.text>.
Kanagasabai Rajaraman
Pan Hong
Agency for Science Technology and Research
Crockett & Crockett
Crockett, Esq. K. David
Edouard Patrick N.
Yen E.
LandOfFree
Method of text similarity measurement does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Method of text similarity measurement, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Method of text similarity measurement will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-3910219