Machine translation using elastic chunks

Data processing: speech signal processing – linguistics – language – Linguistics – Translation machine

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C704S277000, C704S005000

Reexamination Certificate

active

07542893

ABSTRACT:
A machine translation method includes receiving source text in a first language and retrieving text fragments in a target language from a library of bi-fragments to generate a target hypothesis. Each bi-fragment includes a text fragment from the first language and a corresponding text fragment from the second language. Some of the bi-fragments are modeled as elastic bi-fragments where a gap between words is able to assume a variable size corresponding to a number of other words to occupy the gap. The target hypothesis is evaluated with a translation scoring function which scores the target hypothesis according to a plurality of feature functions, at least one of the feature functions comprising a gap size scoring feature which favors hypotheses with statistically more probable gap sizes over hypotheses with statically less probable gap sizes.

REFERENCES:
patent: 5140522 (1992-08-01), Ito et al.
patent: 5408410 (1995-04-01), Kaji
patent: 5477451 (1995-12-01), Brown et al.
patent: 5608622 (1997-03-01), Church
patent: 5991710 (1999-11-01), Papineni et al.
patent: 6182026 (2001-01-01), Tillmann et al.
patent: 6236958 (2001-05-01), Lange et al.
patent: 6304841 (2001-10-01), Berger et al.
patent: 6338057 (2002-01-01), Weeks
patent: 6473896 (2002-10-01), Hicken et al.
patent: 6542866 (2003-04-01), Jiang et al.
patent: 6678415 (2004-01-01), Popat et al.
patent: 6848080 (2005-01-01), Lee et al.
patent: 6885985 (2005-04-01), Hull
patent: 6996518 (2006-02-01), Jones et al.
patent: 7191115 (2007-03-01), Moore
patent: 7353165 (2008-04-01), Zhou et al.
patent: 2002/0198699 (2002-12-01), Greene et al.
patent: 2004/0024581 (2004-02-01), Koehn et al.
patent: 2004/0030551 (2004-02-01), Marcu et al.
patent: 2005/0108000 (2005-05-01), Kempe et al.
patent: 2005/0228643 (2005-10-01), Munteanu et al.
patent: 2006/0009963 (2006-01-01), Gaussier et al.
patent: 2007/0033001 (2007-02-01), Muslea et al.
U.S. Appl. No. 11/315,043, filed Dec. 22, 2005, Cancedda, et al.
M. Carl, A. Way, “Workshop on Example-Based Machine Translation”, MT-Summit VIII, Santiago de Compostela, Spain, Sep. 18-22, 2001.
M. Simard, N. Cancedda, B. Cavestro, M. Dymetman, E. Gaussier, C. Goutte, K. Yamada, “Translating with Non-Continuous Phrases”, (English Translation of Traduction Automatique Statistique Avec Des Segments Discontinues) 12thAnnual Conference on the Automatic Treatment of Natural Language (TALN), pp. 233-242, Dourdan, France, (Jun. 2005).
P.F.Brown, S.A.D.Pietra, V.J.D.Pietra, R.L.Mercer,The Mathematics of Statistical Machine Translation: Parameter Estimation, Computational Linguistics, 19(2):263-311, 1993.
C.Goutte, K.Yamada, E.Gaussier,Aligning Words Using Matrix Factorisation, Proc. ACL'04, pp. 503-510, (2004).
P.Koehn,Pharaoh: A Beam Search Decoder for Phrase-Based Statistical Machine Translations Models, USC Information Science Institute, Dec. 2003.
F.J.Och, H.Ney,A Systematic Comparison of Various Statistical Alignment Models, Computational Linguistics, 29(1):19-51, Mar. 2003.
F.J.Och, H.Ney,The Alignment Template Approach to Statistical Machine Translation, Computational Linguistics, 30(4):417-449, 2004.
C.Tillmann, F.Xia,A Phrase-Based Unigram Model for Statistical Machine Translation, Proc. of the HLT-NAACL 2003 Conference, Edmonton, Canada, 2003.
N.Ueffing, F.J.Och, H.Ney,Generation of Word Graphs in Statistical Machine Translation, Proc. of the 7thConference on Empirical Methods in Natural Language Processing (EMNLP), pp. 156-163, Philadelphia, 2002.
R.Zens, H.Ney,Improvements in Phrase-Based Statistical Machine Translation, Proc. of the Human Language Technology Conference (HLT-NAACL), 2004.
C.Chenon,Vers une meilleure utilisabilite des memoires de traduction fondee sur un alignement sous-phrastique. These de Doctorat, Universite Joseph Fourier, Oct. 2005 (Abstract).
D.Chiang,A Heirarchical Phrase-Based Model For Statistical Machine Translation, Proc. 43rdMeeting of the Association for Computational Linguistics (ACL), Ann Harbor, Michigan, Jun. 2005.
B.Efron, C.Morris,Stein's Paradox In Statistics, Scientific American, 236, 119-127, 1977.
U.S. Appl. No. 11/431,393, filed May 10, 2006, Cancedda, et al.
U.S. Appl. No. 11/789,089, filed Apr. 20, 2007, Pacull.
Brown, Example-Based Machine Translation in the Pangloss System, InProc. of the 16thIntl. Conf. on Computational Linguistics(COLING-96), pp. 169-174, Denmark, Aug. 5-9, 1996.
http://en.wikipedia.org/wiki/Poisson—distribution, downloaded Mar. 22, 2006.
A.Crabtree, J.O'Neill, P.Tolmie, S.Castellani, T.Colombino, A.Grasso, The Practical Indispensability of Articulation Work to Immediate and Remote Help-Giving,Proc. of the 2006 ACM Conference on Computer Supported Cooperative Work, Banff: ACM Press, 2006.
A.Kempe, Acronym-Meaning Extraction from Corpora Using Multitape Weighted Finite-State Machines,Xerox Corporation, Research Report 2006/019, Jul. 26, 2006.
A.Kempe, F.Guingne, F.Nicart, Algorithms for Weighted Multi-Tape Automata,Xerox Corporation, Research Report 2004/031, Meylan, France, 2004.
A.Kempe, J.M.Champarnaud, J.Eisner, A Note on Join and Auto-Intersection of n-ary Rational Relations, In B. Watson and L. Cleophas, editors,Proceedings Eindhoven FASTAR Days, No. 04-40 in TU/e CS TR, pp. 64-78, Eindhoven, Netherlands, 2004.
A.Kempe, J.M.Champarnaud, J.Eisner, F.Guingne, F.Nicart, A Class of Rational n-WFSM Auto-Intersections,In FSMNLP, Helsinki, Finland, 2005.
A.P.Dempster, et al., Maximum Likelihood from Incomplete Data Using the EM Algorithm,Journal of the Royal Statistical Society, Series B(Methodological), vol. 39, No. 1, pp. 1-38, 1977.
D.Yarowsky, G.Ngai, R.Wicentowski, Inducing Multilingual Text Analysis Tools Via Robust Projection Across Aligned Corpora,In Proceedings of HLT, San Diego, CA, 2001.
É.Gaussier, Flow Network Models for Word Alignment and Terminology Extraction from Bilingual Corpora,In Proceedings of the 36thAnnual Meeting of the Association for Computational Linguistics and 17thIntl Conference on Computational Linguistics, pp. 444-450, San Francisco, CA, 1998.
E.Hovy, et al., Principles of Context-Based Machine Translation Evaluation,Machine Translation, 17:43-75, Netherlands, 2002.
F.J.Och, C.Tillman, H.Ney, Improved Alignment Models for Statistical Machine Translation,In Proceedings of EMNLP/VLC 1999, pp. 20-28, University of Maryland, College Park, MD, 1999.
F.J.Och, et al., A Comparison of Alignment Models for Statistical Machine Translation,In Proc. of the 18thIntl. Conf. on Computational Linguistics, vol. 2, pp. 1086-1090, Morristown, NJ, 2000.
H.Dejean, E.Gaussier, J.M.Renders, F.Sadat, Automatic Processing of Multilingual Medical Terminology: Applications to Thesaurus Enrichment and Cross-Language Information Retrieval,Artificial Intelligence in Medicine, vol. 33, No. 2, pp. 111-124, 2005.
H.Dejean, F.Sadat, E.Gaussier, Bilingual Lexicon Extraction: Using and Enriching Multilingual Thesauri,Proceedings of Terminology Knowledge Extraction, Nancy, France, Aug. 2002.
I.Dagan, K.Church, Termight: Coordinating Man and Machine in Bilingual Terminology Acquisition,Journal Tex, Jul. 24, 1996.
J.Martin, R.Mihalcea, T.Pedersen, Word Alignment for Languages with Scarce Resources,In Proceedings of ACL Workshop on Building and Using Parallel Text, Ann Arbor, MI, 2005.
Lodhi, et al., Text Classification Using String Kernels,Advances in Neural Information Processing Systems, 13 (pp. 563-569) MIT Press (2001).
M.Simard, P.Plamondon, Bilingual Sentence Alignment: Balancing Robustness and Accuracy,Machine Translation, vol. 13, No. 11, Mar. 1998.
N.Aswani, R.Gaizauskas, A Hybrid Approach to Align Sentences and Words in English-Hindi Parallel Corpora,In ACL Workshop on Building and Using Parallel Text, 2005.
K.Papineni, et al., Bleu: A Method for Automatic Evaluation of Machine Translation,Proc. of the 40thAnnual Meeting of the Assn. for Computational Linguistics(ACL), Philadelphia, pp. 311-318, Jul. 2002.
P.Koehn, F.J.Och, D.Marcu, Statistical Phrase-Based Translation,In Proceedings of HLT-NAACL, Edmonton, Canada, 2003.
R.C.Moore, A Discriminative Framework for Bilingual Word Alignment,Proceedin

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Machine translation using elastic chunks does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Machine translation using elastic chunks, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Machine translation using elastic chunks will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-4055289

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.