Method and system for transforming a source language...

Data processing: speech signal processing – linguistics – language – Linguistics – Translation machine

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C704S007000, C704S009000, C704S277000

Reexamination Certificate

active

06330530

ABSTRACT:

FIELD OF THE INVENTION
The present invention relates to language translation systems. More specifically, the present invention relates to the transformation of a source language linguistic structure into a target language linguistic structure.
BACKGROUND
To With the continuing growth of multinational business dealings where the global economy brings together business people of all nationalities and with the ease and frequency of today's travel between countries, the demand for a machine-aided interpersonal communication system that provides accurate near real-time language translation is a compelling need. This system would relieve users of the need to possess specialized linguistic or translation knowledge.
A typical language translation system functions by using natural language processing. Natural language processing is generally concerned with the attempt to recognize a large pattern or sentence by decomposing it into small sub-patterns according to linguistic rules. A natural language processing system uses considerable knowledge about the structure of the language, including what the words are, how words combine to form sentences, what the words mean, and how word meanings contribute to sentence meanings.
Morphological knowledge concerns how words are constructed from more basic units called morphemes. Syntactic knowledge concerns how words can be put together to form correct sentences and determines what structural role each word plays in the sentence and what phrases are sub-parts of what other phrases. Typical syntactic representations of language are based on the notion of context-free grammars, which represent sentence structure in terms of what phrases are sub-parts of other phrases. This syntactic information is often presented in a tree form. Typically, semantic knowledge concerns what words mean and how these meanings combine in sentences to form sentence meanings. This is the study of context-independent meaning—the meaning a sentence has regardless of the context in which it is used. The representation of the context-independent meaning of a sentence is called its logical form. The logical form encodes possible word senses and identifies the semantic relationships between the words and phrases.
Natural language processing systems further include interpretation processes that map from one representation to the other. For instance, the process that maps a sentence to its syntactic structure and logical form is called parsing, and it is performed by a component called a parser. The parser uses knowledge about word and word meaning, the lexicon, and a set of rules defining the legal structures, the grammar, in order to assign a syntactic structure and a logical form to an input sentence.
Formally, a context-free grammar of a language is a four-tuple comprising nonterminal vocabularies, terminal vocabularies, a finite set of production rules, and a starting symbol for all productions. The nonterminal and terminal vocabularies are disjoint. The set of terminal symbols is called the vocabulary of the language.
A natural language processor receives an input sentence in a source language, lexically separates the words in the sentence, syntactically determines the types of words, semantically understands the words, and creates an output sentence in a target language that contains the content of the input sentence. The natural language processor employs many types of knowledge and stores different types of knowledge in different knowledge structures that separate the knowledge into organized types.
In transferring a linguistic representation of a source language (such as English) to the linguistic representation for a target language (such as Japanese), a significant amount of linguistic knowledge needs to be incorporated in order to achieve a high-quality translation. In a prior method for transferring, an transfer-driven method was developed. In this method, transfer rules that operated at a string, pattern, or semantic grammar rule level. The input sentence was analyzed using the transfer rules, and the rules that developed the best analyses were used to generate the target-language output. In addition, example expressions were used to annotate the transfer rules directly.
In another prior method, a dependency tree representation was used to store examples of the source linguistic structures. During transfer, this method selected a set of example fragments that completely coverts the input. The target-language expression was then constructed from the target-language portions of the selected fragments. The dependency trees created are not detailed enough to account for many natural language expressions. This method also requires exact matches between the input and examples. Because of the variability of natural languages, exact matches are hard to achieve or require extremely large databases of examples.
What is required is a method and system that incorporates the ease and accuracy of the example-based method with the ability to manipulate the transfer rules to allow for a variety of attempts at translation.
SUMMARY OF THE INVENTION
A system and method for transforming input source language linguistic structures (SLS) to target linguistic language linguistic structures (TLS) are described. In one embodiment, the SLS is matched to at least one existing source language example linguistic structure (SEF). The SLS is aligned with the at least one SEF, and the TLS is generated using at least one target language example linguistic structure (TEF) corresponding to the at least one SEF.


REFERENCES:
patent: 5088038 (1992-02-01), Tanaka et al.
patent: 5101349 (1992-03-01), Tokuume et al.
patent: 5426583 (1995-06-01), Uribe-Echebarria Diaz De Mendiil
patent: 5510981 (1996-04-01), Berger et al.
patent: 5528491 (1996-06-01), Kuno et al.
patent: 5535120 (1996-07-01), Chong et al.
patent: 5659765 (1997-08-01), Nii
patent: 5727222 (1998-03-01), Maxwell, III
patent: 5768603 (1998-06-01), Brown et al.
patent: 5819210 (1998-10-01), Maxwell, III et al.
patent: 5864788 (1999-01-01), Kutsumi
patent: 5903860 (1999-05-01), Maxwell, III et al.
patent: 5907821 (1999-05-01), Kaji et al.
patent: 5963894 (1999-10-01), Richardson et al.
patent: 5983169 (1999-11-01), Kozma
patent: 6064953 (2000-05-01), Maxwell, III et al.
patent: 6161083 (2000-12-01), Franz et al.
patent: 6223150 (2001-04-01), Duan et al.
patent: 0805403 A (1997-11-01), None
S. Kurohashi, T. Nakamura, Y. Matsumoto, M. Nagao. Improvements of Japanese Morphological Analyzer JUMAN. In: “Proceedings of the International Workshop on Sharable Natural Language Resources”, p. 22-28, Nara, Japan,1994.
Kenneth W. Church, “A Stochastic Parts Program and Noun Phrase Parser for Unrestricted Text”, in Proceedings of the Second Applied Natural Language Processing Conference, Austin, TX, 1988.
Edited by Karen Jensen, George E. Heidorn, Stephen D. Richardson, “Natural Language Processing: The PLNLP Approach”, Kluwer Academic Publishers, 1993, 22 pages.
Stuart M. Shieber, An Introduction to Unification-based Approaches to Grammar, CSLI, 1986, 23 pages.
M. Tomita, T. Mitamura, H. Musha, M. Kee, “The Generalized LR Parser/Compiler Version 8.1: User's Guide”, CMU-CMT-88-MEMO, Apr. 20, 1988, 44 pages.
M. Tomita, K. Knight, “Pseudo-Unification and Full-Unification”, CMU, 1987, 10 pages.
M. Ishii, K. Ohta, H. Saito, “An Efficient Parser Generator for Natural Language”, COLING 1994, 3 pages.
O. Furuse, H. Iida, “An Example-Based Method for Transfer-Driven Machine Translation”, Proceedings of the Conference on Theoretical and Methodological Issues on Machine Translation (TMI-92), 1992, p. 139-150.
P. Resnik, “Using Information Content to Evaluate Semantic Similarity in a Taxonomy”, Proceedings of the International Joint Conference on Artificial Intelligence (IJCAI-95), 1995.
T.C. Bell, J. G. Cleary, I. H. Witten, “Text Compression”, Prentice Hall, 1990, 19 pages.
H. Maruyama, H. Watanabe, “Three Cover Search Algorithm for Example-Based Translation”, in Proceedings of the Fourth International Conference on Theoretical and Methodological Issues i

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Method and system for transforming a source language... does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Method and system for transforming a source language..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Method and system for transforming a source language... will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-2573416

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.