Data processing: speech signal processing – linguistics – language – Linguistics – Translation machine
Reexamination Certificate
2004-11-01
2009-11-17
Hudspeth, David R (Department: 2626)
Data processing: speech signal processing, linguistics, language
Linguistics
Translation machine
C704S004000, C704S005000, C704S008000, C704S009000, C704S010000, C715S256000, C715S259000, C715S264000, C707S793000
Reexamination Certificate
active
07620539
ABSTRACT:
Various methods formulated using a geometric interpretation for identifying bilingual pairs in comparable corpora using a bilingual dictionary are disclosed. The methods may be used separately or in combination to compute the similarity between bilingual pairs.
REFERENCES:
patent: 5325298 (1994-06-01), Gallant
patent: 5907821 (1999-05-01), Kaji et al.
patent: 6006221 (1999-12-01), Liddy et al.
patent: 6236958 (2001-05-01), Lange et al.
patent: 6523026 (2003-02-01), Gillis
patent: 6611825 (2003-08-01), Billheimer et al.
patent: 6694311 (2004-02-01), Smith
patent: 6876998 (2005-04-01), Evans
patent: 6885985 (2005-04-01), Hull
patent: 6990628 (2006-01-01), Palmer et al.
patent: 7047182 (2006-05-01), Masuichi
patent: 7054803 (2006-05-01), Eisele
patent: 7124073 (2006-10-01), Tokuda et al.
patent: 7249012 (2007-07-01), Moore
patent: 2002/0123982 (2002-09-01), Masuichi
patent: 2002/0128821 (2002-09-01), Ehsani et al.
patent: 2003/0115188 (2003-06-01), Srinivasa et al.
patent: 2005/0021323 (2005-01-01), Li
T. S. Jaakkola and D. Haussler. Exploiting generative models in discriminative classifiers. Preprint, Dept. of Computer Science, Univ. of California, available from http://www.cse.ucsc.edu/˜haussler/pubs.html, 1998. http://citeseer.ist.psu.edu/jaakkola98exploiting.html.
Tiedeman, Jorg. Automatic Lexicon Extraction from Aligned Bilingual Corpora. Thesis. Otto-von-Guericke-Univesittat Magdeburg, Sep. 1997.
Dumais, S.; Letsche, T.; Littman, M. L.; and Lan-dauer, T. 1997. Automatic cross-language retrievalusing latent semantic indexing. In AAAI Symposiumon Cross-Language Text and Speech Retrieval. Amer-ican Association for Articial Intelligence.
Jiang, F. and Littman, M. L. 2000. Approximate Dimension Equalization in Vector-based Information Retrieval. In Proceedings of the Seventeenth international Conference on Machine Learning (Jun. 29-Jul. 2, 2000). P. Langley, Ed. Morgan Kaufmann Publishers, San Francisco, CA, 423-430.
Chiao, Y. and Zweigenbaum, P. 2002. Looking for candidate translational equivalents in specialized, comparable corpora. In Proceedings of the 19th international Conference on Computational Linguistics—vol. 2 (Taipei, Taiwan, Aug. 24-Sep. 1, 2002). International Conference on Computational Linguistics. Association for Computational Lin.
Déjean, H., É Gaussier and F. Sadat, “Bilingual Terminology Extraction: An Approach based on a Multilingual thesaurus Applicable to Comparable Corpora,” Proceedings of the 19th International Conference on Computational Linguistics COLING 2002, Taipei, Taiwan, pp. 218-224.
Probst, K. 2003. Using ‘smart’ bilingual projection to feature-tag a monolingual dictionary. In Proceedings of the Seventh Conference on Natural Language Learning At HLT-NAACL 2003—vol. 4 (Edmonton, Canada). Human Language Technology Conference. Association for Computational Linguistics, Morristown, NJ, 103-110. DOI= http://dx.doi.org/10.3115/1.
Sahlgren, M. and Karlgren, J. 2005. Automatic bilingual lexicon acquisition using random indexing of parallel corpora. Nat. Lang. Eng. 11, 3 (Sep. 2005), 327-341. DOI= http://dx.doi.org/10.1017/S1351324905003876.
Daille, Beatrice et al. “French-English Terminology Extraction from Comparable Corpora”. Lecture Notes in Computer Science. vol. 3651/2005, Springer 2005.
Sadat, F., Yoshikawa, M., and Uemura, S. 2003. Learning bilingual translations from comparable corpora to cross-language information retrieval: hybrid statistics-based and linguistics-based approach. In Proceedings of the Sixth international Workshop on information Retrieval with Asian Languages, vol. 11 (Sapporo, Japan, Jul. 7-7, 2003), 57-64.
Bach and Jordan, “Kernel independent component analysis” in Journal of Machine Learning Research, vol. 3, pp. 1-48, 2002.
Besancon, Rajman and Chappelier, “Textual similarities based on distributional approach”, Proceeding of the Tenth International Workshop on Database and Expert Systems Applications, DEX'99, Florence, Italy, 1999.
Deerwester, Dumais, Furnas, Landauer, and Harshman, “Indexing by latent semantic analysis”, Journal of the American Society for Information Science, 41 (6), pp. 391-407, 1990.
Dejean, Gaussier and Sadat, “An approach based on multilingual thesauri and model combination for bilingual lexicon extraction”, In International Conference on Computational Linguistics, COLING 2002.
Dempster, Laird and Rubin, “Maximum likehood from incomplete data via the EM algorithm”, Journal of the Royal Statistical Society, Series B, 39(1), pp. 1-38, 1977.
Diab, Finch, “A statistical word-level translation model for comparable corpora”, Proceeding of the Conference on Content-Based Multimedia Information Access, 2000.
Dunning, T, “Accurate methods for the statistics of surprise and coincidence” Computational Linguistics, 19(1), pp. 61-74, 1993.
Fung, “Statistical View on Bilingual Lexicon Extraction: from Parallel Corpora to Non-parallel Corpora”, Lecture Notes in Artificial Intelligence, Springer Publisher, vol. 1529, pp. 1-17, 1998.
Gaussier, E., Goutte, C., Popat, K. and Chen, F, “A hierarchical model for clustering and categorizing documents”, ECIR, Glasgow, pp. 229-247, 2002.
Gaussier, Eric, Jean-Michel Renders, Irina Matveeva, Cyril Goutte, Hervé Déjean, “A Geometric view on bilingual lexicon extraction from comparable corpora”, 42nd Annual Meeting of the Association for Computational Linguistics, Barcelona, Spain, Jul. 25-26, 2004.
Grefenstette, Chapter 5.3 Word Meaning Clustering, in “Explorations in Automatic Thesaurus Discovery”, Kluwer Academic Publishers, pp. 126-131, 1994.
Hofmann, “Probabilistic latent semantic analysis” Proceedings of the Fifteenth Conference on Uncertainty in Artificial Intelligence, pp. 289-296, 1999.
Hofmann, “Learning the similarity of documents: An information-geometric approach to document retrieval and categorization”, Research and Development in Information Retrieval, pp. 369-371, 2000.
Jaakkola and Haussler, “Exploiting generative models in discriminative classifiers”, Advances in Neural Information Processing Systems, vol. 11, pp. 487-493, 1999.
Koehn and Knight, “Learning a translation lexicon from monolingual corpora”, ACL Workshop on Unsupervised Lexical Acquisition, 2002.
Lewis, Baxendale and Bennet, “Statistical discrimination of the synonym/antonym relationship between words”, Journal of the ACM, 14(1), pp. 20-44, Jan. 1967.
Peters, Picchi, “Capturing the comparable: A system for querying comparable text corpora”, JADT—3rdInternational Conference on Statistical Analysis of Textual Data, pp. 255-262, 1995.
Rapp, “Identifying word translations in nonparallel texts”, Proceedings of the Annual Meeting of the Association for Computational Linguistics, 1995.
Rapp, “Automatic Identification of word translations from unrelated English and German corpora”, ACL Proceedings, 1999.
Renders, Dejean and Gaussier, “Assessing Automatically Extracted Bilingual Lexicons for CLIR in Vertical Domains: XRCE participation to the GIRT track of CLEF 2002”, M.Braschler, J.Gonzalo and M.Kluck (Eds), Lecture Notes in Computer Science, Springer Verlag, 2003.
Shahzad, Ohtake, Masuyama and Yamamoto, “Identifying translations of compound nouns using non-aligned corpora”, Proceedings of the Workshop MAL, pp. 108-113, 1999.
Tanaka and Iwasaki, “Extraction of lexical translations form non-aligned corpora”, International Conference on Computational Linguistics, COLING 1996.
Ueda and Nakano, “Deterministic annealing variant of the EM algorithm”, Advances in Neural Information Processing Systems 7, pp. 545-552, 1995.
Vinokourov, Shawe-Taylor and Cristianini, “Finding language-independent semantic representation of text using kernel canonical correlation analysis”, Advances in Neural Informa
Dejean Herve
Gaussier Eric
Goutte Cyril
Matveeva Irina
Renders Jean-Michel
Fay Sharpe LLP
Hudspeth David R
Shah Paras
Xerox Corporation
LandOfFree
Methods and apparatuses for identifying bilingual lexicons... does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Methods and apparatuses for identifying bilingual lexicons..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Methods and apparatuses for identifying bilingual lexicons... will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-4087702