Optimization of text-based training set selection for...

Data processing: database and file management or data structures – File or database maintenance

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C707S802000, C704S231000, C704S243000, C704S260000

Reexamination Certificate

active

07831549

ABSTRACT:
A device and a method provide for selection of a database from a corpus using an, optimization function. The method includes defining a size of a database, calculating a distance using a distance function for each pair in a set of pairs, and executing an optimization function using the distance to select each entry saved in the database until the number of saved entries equals the size of the database. Each pair in the set of pairs includes either two entries selected from a corpus or one entry selected from a set of previously selected entries and another entry selected from a set of a remaining portion of the corpus. The distance function may be a Levenshtein distance function or a generalized Levenshtein distance function.

REFERENCES:
patent: 5329608 (1994-07-01), Bocchieri et al.
patent: 5692097 (1997-11-01), Yamada et al.
patent: 5737723 (1998-04-01), Riley et al.
patent: 5754977 (1998-05-01), Gardner et al.
patent: 6044343 (2000-03-01), Cong et al.
patent: 6073099 (2000-06-01), Sabourin et al.
patent: 6810379 (2004-10-01), Vermeulen et al.
patent: 2002/0069053 (2002-06-01), Dobler et al.
patent: 2005/0267755 (2005-12-01), Suontausta
Hermann Ney, “The Use of the One-Stage Dynamic Programming Algorithm for Connected Word Recognition”, IEEE Transaction Acoustics, Speech and Signal Processing, vol. ASSP-32, No. 2, pp. 263 to 271, 1984.
Data-Driven Approaches for Automatic Detection of Syllable Boundaries, Tian, Audio-Visual Systems Laboratory, Finland, 4 pgs.
Optimal Subset Selection from Text Databases, Tian et al., ICASSP 2005, Finland, pp. I-305-308.
N-Gram and Decision Tree Based Language Identification for Written Words, Häkkinen et al., 2002 IEEE, pp. 335-338.
A Learning Model for Multiple-Prototype Classification of Strings, Cárdenas, Proceedings of the 17thInternational Conference on Pattern Recognition (ICPR'04), Spain, 4 pgs.
Speaker- and Language-Independent Speech Recognition in Mobile Communication Systems, Viikki et al., 2001IEEE, pp. 5-8.
Rose and Paul, A Hidden Markov model based keyword recognition system. IEEE, ICASSP Apr. 3, 1990, p. 129-132.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Optimization of text-based training set selection for... does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Optimization of text-based training set selection for..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Optimization of text-based training set selection for... will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-4173958

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.