Word collection method and system for use in word-breaking

Data processing: speech signal processing – linguistics – language – Linguistics – Multilingual or national language support

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C704S009000, C707S793000

Reexamination Certificate

active

07424421

ABSTRACT:
A method, computer readable medium and system are provided which collect new words for addition to a lexicon for an agglutinative language. In the method, a log of queries submitted to a search engine is obtained. The log of queries is sorted to obtain sorted queries. The sorted queries are then filtered using a plurality of heuristic criteria to obtain a candidate list of new words. Words from the candidate list of new words are then added to a lexicon.

REFERENCES:
patent: 5029084 (1991-07-01), Morohasi et al.
patent: 5946648 (1999-08-01), Halstead et al.
patent: 6035268 (2000-03-01), Carus et al.
patent: 6374210 (2002-04-01), Chu
patent: 6879951 (2005-04-01), Kuo
patent: 7113950 (2006-09-01), Brill et al.
Langer, “Reverse Queries DATR”, University of Osnabruck, Germany, pp. 1-7, Nov. 17, 1994.
Davis et al, “Linking as Constraints on Word Classes in a Hierarchical Lexicon”, pp. 1-45, Jul. 6, 1999.
Melnik et al., “Building a Distributed Full-Text Index for the Web”, ACM Transactions on Information Systems (TOIS), vol. 19, No. 3, pp. 217-241, 2001.
Agichtein et al., “Learning Search Engine Specific Query Transformations for Question Answering”, Proceedings of the Tenth International World Wide Web Conference, WWW10, May 1-5, 2001.
Nagarajarao et al., “An Inverted Index Implementation Supporting Efficient Querying and Incremental Indexing”, pp. 1-9, May 6, 2002.
Hodge et al., “An Integrated Neural IR System”, ESANN'2001 Proceedings, ISBN 2-930307-01-03, pp. 265-270, Apr. 25-27, 2001.
Technical Note TE25, “How to Construct Word-Break Tables”, pp. 1-4, Nov. 1, 1987.
Siivola, et al., “Unlimited Vocabulary Speech Recognition Based on Morphs Discovered in an Unsupervised Manner”, Eurospeech 2003-Geneva, pp. 2293-2296.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Word collection method and system for use in word-breaking does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Word collection method and system for use in word-breaking, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Word collection method and system for use in word-breaking will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-3969955

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.