Data processing: speech signal processing – linguistics – language – Linguistics – Natural language
Reexamination Certificate
2006-04-18
2006-04-18
McFadden, Susan (Department: 2655)
Data processing: speech signal processing, linguistics, language
Linguistics
Natural language
C707S793000, C715S252000
Reexamination Certificate
active
07031909
ABSTRACT:
The present invention provides a method, system and computer program for naming a cluster, or a hierarchy of clusters, of words and phrases that have been extracted from a set of documents. The invention takes these clusters as the input and generates appropriate labels for the clusters using a lexical database. Naming involves first finding out all possible word senses for all the words in the cluster, using the lexical database; and then augmenting each word sense with words that are semantically similar to that word sense to form respective definition vectors. Thereafter, word sense disambiguation is done to find out the most relevant sense for each word. Definition vectors are clustered into groups. Each group represents a concept. These concepts are thereafter ranked based on their support. Finally, a pre-specified number of words and phrases from the definition vectors of the dominant concepts are selected as labels, based on their generality in the lexical database. Therefore, the labels may not necessarily consist of the original words in the cluster. A hierarchy of clusters is named in a recursive fashion starting from leaf clusters. Dominant concepts in child clusters are propagated into their parent to reduce the labeling complexity of parent clusters.
REFERENCES:
patent: 5056021 (1991-10-01), Ausborn
patent: 5237503 (1993-08-01), Bedecarrax et al.
patent: 5721902 (1998-02-01), Schultz
patent: 5794050 (1998-08-01), Dahlgren et al.
patent: 5873056 (1999-02-01), Liddy et al.
patent: 6076088 (2000-06-01), Paik et al.
patent: 6260008 (2001-07-01), Sanfilippo
patent: 6510406 (2003-01-01), Marchisio
patent: 6675159 (2004-01-01), Lin et al.
Chung Christina
Luk Alpha
Mao Jianchang
Taank Sumit
Botjer William L.
McFadden Susan
Verity, Inc.
LandOfFree
Method and system for naming a cluster of words and phrases does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Method and system for naming a cluster of words and phrases, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Method and system for naming a cluster of words and phrases will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-3573617