Machine translation using learned word associations without...

Data processing: speech signal processing – linguistics – language – Linguistics – Translation machine

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

Reexamination Certificate

active

07356457

ABSTRACT:
A method and computer-readable medium are provided that perform a series of steps associated with machine translation. These steps include using a first text in a first language and a second text in a second language, to produce an association list where words in the first language are associated with words in the second language. A first syntactic structure for a sentence from the first text is aligned with a second syntactic structure for a sentence in the second text based on the association list without referring to a bilingual dictionary of content words. The association list is also used during translations. Specifically, a word in the first language is translated into a word in the second language based on an entry in the association list without referring to a bilingual dictionary that contains content words. Thus, training and translation are performed without the need for a bilingual dictionary of content words.

REFERENCES:
patent: 4800522 (1989-01-01), Miyao et al.
patent: 4914590 (1990-04-01), Loatman et al.
patent: 5128865 (1992-07-01), Sadler
patent: 5267156 (1993-11-01), Nomiyama
patent: 5386556 (1995-01-01), Hedin et al.
patent: 5418717 (1995-05-01), Su et al.
patent: 5477451 (1995-12-01), Brown et al.
patent: 5510981 (1996-04-01), Berger et al.
patent: 5541836 (1996-07-01), Church et al.
patent: 5850561 (1998-12-01), Church et al.
patent: 5867811 (1999-02-01), O'Donoghue
patent: 5907821 (1999-05-01), Kaji et al.
patent: 6041293 (2000-03-01), Shibata et al.
patent: 6092034 (2000-07-01), McCarley et al.
patent: 6236958 (2001-05-01), Lange et al.
patent: 6278967 (2001-08-01), Akers et al.
patent: 6721967 (2004-04-01), Braxton
patent: 6804637 (2004-10-01), Tokuda et al.
patent: 6885985 (2005-04-01), Hull
patent: 7020601 (2006-03-01), Hummel et al.
patent: 7107204 (2006-09-01), Liu et al.
patent: 2002/0107683 (2002-08-01), Eisele
patent: 2002/0123877 (2002-09-01), Xun
patent: 2002/0198701 (2002-12-01), Moore
patent: 2003/0023422 (2003-01-01), Menezes et al.
patent: 2003/0023423 (2003-01-01), Yamada et al.
patent: 2003/0061023 (2003-03-01), Menezes et al.
patent: 2003/0204400 (2003-10-01), Marcu et al.
patent: 2004/0098247 (2004-05-01), Moore
patent: 2004/0172235 (2004-09-01), Pinkham
H. Ney, F.J. Och, S. Vogel, ‘The RWTH System for Statistical Translation of Spoken Dialogues’, Proceedings of the First International Conference on Human Language Technology Research, San Diego, Mar. 18-21, 2001.
P. Charoenpornsawat, V. Sornlertlamvanich, T. Charoenporn, ‘Improving Translation Quality of Rule-based Machine Translation’, COLING-02 on Machine translation in Asia—vol. 16, Taipei, Taiwan, Aug. 24-Sep. 1, 2002.
Martin Kay and Martin Roscheisen, Text-Translation Alignment, Association for Computational Linguistics, 1993.
U.S. Appl. No. 10/300,309, filed Nov. 20, 2002, Moore.
U.S. Appl. No. 10/173,252, filed Jun. 17, 2002, Moore.
I. Dan Melamed,, “Automatic Construction of Clean Broad-Coverage Translation Lexicons,” 2ndConference of the Association for Machine Translation in the Americas (AMTA '96), 10 pages (1996).
I. Dan Malamed, “Automatic Discovery of Non-Compositional Compounds in Parallel Data,” 2ndConference on Empirical Methods in Natural Language Processing (EMNLP '97), 12 pages (1997).
K. Yamamoto et al., “A Comparative Study on Translation Units for Bilingual Lexicon Extraction,” In Proceedings of the Workshops on Data-Driven Machine Translation, 39thAnnual Meeting of the Association for Computational Linguistics, pp. 87-94 (2001).
F. Smadja et al., “Translating Collocations for Bilingual Lexicons: A Statistical Approach,” Computational Linguistics, 22(1): 1-38 (1996).
J. Kupiec, “An Algorithm for Finding Noun Phrase Correspondences in Bilingual Corpora,” In Proceedings of the 31stAnnual Meeting of the Association for Computational Linguistics, pp. 17-22 (1993).
Y. Al-Onaizan and K. Knight. 2002. Named entity translation: extended abstract. In advance papers of Human Language Technology 2002, San Diego, CA. pp. 111-115.
N. Chinchor. 1997. MUC-7 named entity task definition. In Proceedings of the 7thMessage Understanding Conference.
I. Dagan and K. Church. 1997. Termight: coordinating humans and machines in bilingual terminology acquisition. Machine Translation, 12:89-107.
T. Dunning. 1993. Accurate methods for the statistics of surprise and coincidence. Computational Linguistics, 19(1):61-74.
J. Kupiec. 1993. An algorithm for finding noun phrase correspondences in bilingual corpora. In Proceedings of the 31stAnnual Meeting of the Association for Computational Linguistics. Columbus Ohio, pp. 17-22.
I.D. Melamed. 2000. Models of Translational Equivalence. Computational Linguistics, 26(2):221-249.
R. C. Moore. 2001. Towards a simple and accurate statistical approach to learning translation relationships among words. In Proceedings of the Workshop on Data-Driven Machine Translation, 39thAnnual Meeting Association for Computational Linguistics, Toulouse, France, pp. 79-86.
S. Richardson, W.B. Dolan, M. Corston-Oliver and A. Menezes. 2001. Overcoming the customization bottleneck using example-based MT. In Proceedings of the Workshop on Data-Driven Machine Translation, 39thAnnual Meeting of the Association for Computational Linguistics, Toulouse, France, pp. 9-16.
D. Wu. 1995. Grammarless extraction of phrasal translation examples from parallel texts. In Proceedings of TMI-95, Sixth International Conference on Theoretical and Methodological Issues in Machine Translation, Leuven, Belgium, vol. 2, pp. 354-372.
I. D. Melamed. 1995. Automatic evaluation and uniform filter cascades for inducing N-Best translation lexicons. In proceedings of the Third Workshop on Very Large Corpora, pp. 184-198, Cambridge, MA.
A. Kumano and H. Hirakawa. 1994. Building an MT dictionary from parallel texts based on liguistics and statistical information. In Proceedings of the 15thInternational Conference on Computational Linguistics, pp. 76-81, Kyoto, Japan.
W. Gale and K. Church. 1991. Identifying word correspondences in parallel texts. In Proceedings Speech and Natural Language Workshops, pp. 152-157, Asilomar, CA. DARPA.
P. Fung. 1995. A pattern matching method for finding noun and proper noun translations from noisy parallel corpora. In Proceedings of the 33rdAnnual Meeting, pp. 236-243, Boston, MA. Association for Computational Linguistics.
D. Wu and X. Xia. 1994. Learning an English-Chinese lexicon from parallel corpus. In Proceedings of the 1stConference of the Association for Machine Translation in the Americas, pp. 206-213, Columbia, MD.
Lars Ahrenberg et al., “A Simply Hybrid Aligner for Generating Lexical Correspondence in Parallel Texts,” Proccedings of COLING '98/ACL '98.
Mihoko Kitamura et al., “Automatic Extraction of Word Sequence Correspondences in Parallel Corpora,” Proceedings of the Fourth Annual Workshop on Very Large Corpora, pp. 79-87, 1996.
I. Dan Malamed, “A Word-to-Word Model of Translational Equivalence,” Proceedings of the 35thConference of the Association for Computational Linguistics, pp. 490-497, 1997.
Pascale Fung et al., “K-Vec: A New Approach for Aligning Parallel Texts,” Proceedings from the 15thInternational Conference on Computational Linguistics, pp. 1096-1102, 1994.
Robert C. Moore, “Learning Translations of Named-Entity Phrases from Parallel Corpora,” Microsoft Research.
Dekai Wu, “Aligning a Parallel English-Chinese Corpus Statistically with Lexical Criteria,” Proceedings of the 32ndAnnual Meeting on Association for Computational Linguistics, pp. 80-87, 1994.
Stanley F. Chen, “Aligning Sentences in Bilingual Corpora Using Lexical Information,” Proceedings of the 31stAnnual Meeting on Association for Computational Linguistics, pp. 9-16, 1993.
William A. Gale, et al., “A Program for Aligning Sentences in Bilingual Corpora,” 1993, Association for Computational Linguistics.
Peter F. Brown et al., “Aligning Sentences in Parallel Corpora,&#x

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Machine translation using learned word associations without... does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Machine translation using learned word associations without..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Machine translation using learned word associations without... will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-2777707

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.