Data processing: speech signal processing – linguistics – language – Linguistics – Multilingual or national language support
Reexamination Certificate
2004-03-03
2008-09-09
Hudspeth, David R. (Department: 2626)
Data processing: speech signal processing, linguistics, language
Linguistics
Multilingual or national language support
C704S009000, C707S793000
Reexamination Certificate
active
07424421
ABSTRACT:
A method, computer readable medium and system are provided which collect new words for addition to a lexicon for an agglutinative language. In the method, a log of queries submitted to a search engine is obtained. The log of queries is sorted to obtain sorted queries. The sorted queries are then filtered using a plurality of heuristic criteria to obtain a candidate list of new words. Words from the candidate list of new words are then added to a lexicon.
REFERENCES:
patent: 5029084 (1991-07-01), Morohasi et al.
patent: 5946648 (1999-08-01), Halstead et al.
patent: 6035268 (2000-03-01), Carus et al.
patent: 6374210 (2002-04-01), Chu
patent: 6879951 (2005-04-01), Kuo
patent: 7113950 (2006-09-01), Brill et al.
Langer, “Reverse Queries DATR”, University of Osnabruck, Germany, pp. 1-7, Nov. 17, 1994.
Davis et al, “Linking as Constraints on Word Classes in a Hierarchical Lexicon”, pp. 1-45, Jul. 6, 1999.
Melnik et al., “Building a Distributed Full-Text Index for the Web”, ACM Transactions on Information Systems (TOIS), vol. 19, No. 3, pp. 217-241, 2001.
Agichtein et al., “Learning Search Engine Specific Query Transformations for Question Answering”, Proceedings of the Tenth International World Wide Web Conference, WWW10, May 1-5, 2001.
Nagarajarao et al., “An Inverted Index Implementation Supporting Efficient Querying and Incremental Indexing”, pp. 1-9, May 6, 2002.
Hodge et al., “An Integrated Neural IR System”, ESANN'2001 Proceedings, ISBN 2-930307-01-03, pp. 265-270, Apr. 25-27, 2001.
Technical Note TE25, “How to Construct Word-Break Tables”, pp. 1-4, Nov. 1, 1987.
Siivola, et al., “Unlimited Vocabulary Speech Recognition Based on Morphs Discovered in an Unsupervised Manner”, Eurospeech 2003-Geneva, pp. 2293-2296.
Albertalli Brian L
Hudspeth David R.
Microsoft Corporation
Veldhuis-Kroeze John D.
Westman Champlin & Kelly P.A.
LandOfFree
Word collection method and system for use in word-breaking does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Word collection method and system for use in word-breaking, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Word collection method and system for use in word-breaking will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-3969955