Automatic clustering of tokens from a corpus for grammar...

Data processing: speech signal processing – linguistics – language – Linguistics – Natural language

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C704S001000, C704S010000, C704S231000, C704S245000

Reexamination Certificate

active

07966174

ABSTRACT:
A system for recognizing patterns is disclosed. Grammar learning from a corpus includes, for the other non-context words, generating frequency vectors for each non-context token in a corpus based upon counted occurrences of a predetermined relationship of the non-context tokens to identified context tokens. Clusters are grown from the frequency vectors according to a lexical correlation or a cluster tree among the non-context tokens. The cluster tree is used for pattern recognition.

REFERENCES:
patent: 5128865 (1992-07-01), Sadler
patent: 5195167 (1993-03-01), Bahl et al.
patent: 5325298 (1994-06-01), Gallant et al.
patent: 5619709 (1997-04-01), Caid et al.
patent: 5835893 (1998-11-01), Ushioda
patent: 5839106 (1998-11-01), Bellegarda
patent: 5860063 (1999-01-01), Gorin et al.
patent: 6014647 (2000-01-01), Nizzari et al.
patent: 6052657 (2000-04-01), Yamro et al.
patent: 6073091 (2000-06-01), Kanevsky et al.
patent: 6094653 (2000-07-01), Li et al.
patent: 6178396 (2001-01-01), Ushioda
patent: 6182091 (2001-01-01), Pitkow et al.
patent: 6470383 (2002-10-01), Leshem et al.
patent: 6816830 (2004-11-01), Kempe
“Dimensions of Meaning,” Hinrich Schutze, Center for the Study of Language and Information, Ventura Hall. Proceedings of the 1992 ACM/IEEE conference on Supercomputing, Minneapolis, Minnesota. 1992.
“Grammar Fragment Acquisition using Syntactic and Semantic Clustering,” Jeremy H. Wright, Giuseppe Riccardi, Allen L. Gorin AT&T Laboratories-Research, 180 Park Ave., Florham Park, NJ 07932, USA, & Kazuhiro Arai NTT Human Interface Laboratories, 1-1 Hikari-no-oka, Yokosuka, Kanagawa 239-0847, Japan. Received Oct. 14, 1997, revised Jul. 16, 1998, accepted Sep. 2, 1998, available online Feb. 17, 1999.
“Improved Clustering Techniques for Class-Based Statistical Language Modeling,” Reinhard Kneser and Hermann Ney. Philips GmbH Forschungslboratorien, Aachen, Germany. Eurospeech '93 Third European Conference on Speech Communication and Technology, Berlin, Germany. Sep. 22-25, 1993.
“Aggregate and Mixed Order Markov Models for Statistical Language Processing,” Lawrence Saul and Fernando Pereira. AT&T Labs Research, 180 Park Ave, D-130, Florham Park, NJ 07932. 1997.
“Empirical Acquisition of Word and Phrase Classes in the Atis Domain,” Michael K. McCandless and James R. Glass, EUROSPEECH '93 Third European Conference on Speech Communication and Technology, Berlin, Germany, Sep. 22-25, 1993.
“Distributional Clustering of English Words,” Fernando Pereira, AT&T Bell Laboratories; Naftali Tishby, Hebrew University; and Lillian Lee, Cornell University. Apr. 25, 1993.
“Automatic Acquisition of Phrase Grammars for Stochastic Language Modeling,” Giuseppe Riccardi and Srinivas Bangalore. AT&T Labs-Research, 180 Park Avenue, Florham Park, NJ 09732. 1998.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Automatic clustering of tokens from a corpus for grammar... does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Automatic clustering of tokens from a corpus for grammar..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Automatic clustering of tokens from a corpus for grammar... will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-2666429

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.