System, method, and product for dynamically aligning...

Data processing: speech signal processing – linguistics – language – Linguistics – Translation machine

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C704S007000

Reexamination Certificate

active

06345244

ABSTRACT:

RELATED APPLICATION
The following application is related to the present application.
U.S. patent application entitled “SYSTEM, METHOD, AND PRODUCT FOR DYNAMICALLY PROPAGATING TRANSLATIONS IN A TRANSLATION-MEMORY SYSTEM,” Ser. No. 09/085,468, naming as inventor Jonathan Clark, assigned to the assignee of the present invention and filed concurrently herewith.
BACKGROUND OF THE INVENTION
1. Field of the Invention
The present invention relates generally to language translation and, more particularly, to translation-memory tools.
2. Related Art
Technology-driven industries have increasingly relied upon translation, localization and related services to bring products to the global markets. The need to quickly and efficiently create foreign language versions of products has increased dramatically as global competition increases, upgrades are developed and released more frequently, and the time in which products become obsolete decreases.
Historically, the expertise of language translators, engineers and publishers have been utilized to translate a document from a source to a target language. More recently, advances in computer software and hardware have enabled the growth of processor-based language translation tools. Traditionally, two types of language translation tools generally have been available: machine-translation tools and translation- memory tools (also referred to herein as translation memory systems).
Generally, machine translation tools use natural language translation techniques to perform language translation; accordingly, they are also referred to as natural language translation tools. Machine-translation tools perform in-depth morphological, grammatical, syntactical and some semantical analysis of text written in a source language. The machine translation tool then attempts to parse the source language into a target language using extensive glossaries and a complex set of linguistic rules. However, despite the many types of machine-translation tools that so far have been developed, there are a number of limitations that have prevented machine-translation tools from being fully successful.
First, machine-translation systems are expensive to set up, operate and maintain. Furthermore machine translation typically performs below publication-grade translation, even when operating under optimal conditions. As a result, machine-translation has been proven to be effective only when used to translate very controlled input text. However, this is time consuming and expensive since providing such controlled input generally requires careful planning.
Translation memory tools are software programs that recycle existing translations provided by a human translator-operator. Conventional translation memory tools generally utilize well-known text search and replace methodologies to perform language translation. For each file in a group of files, referred to herein as a project, translation memory tools contain a database of text strings that are to be translated. The user-operator searches for a particular string throughout a text file and, for each occurrence, replaces the found string with a translated text. Generally, the translation memory tools are utilized when the input files include text having substantial duplication of text strings, such as in technical texts, or when upgrades are performed.
Although translation memory tools overcome the computational burdens of machine translation tools, there are a number of problems with translation memory tools that compromise their effectiveness in today's rapidly changing global markets. One such drawback to conventional translation memory tools is that typically they are completely manual; that is, the operator must provide all of the target language translations. Unfortunately, the time involved in providing such translations is extensive, making it difficult to translate a document efficiently and cost-effectively. To reduce this burden, some conventional translation memory tools provide techniques to address multiple occurrences of a given text string in the file being translated. However, these systems still require the operator to manually address each occurrence of the text string. In addition, the operator must perform the same functions in each of the files in a project.
Another drawback to conventional translation memory tools is that the integrity of the translation is dependent upon each operator entry. This drawback makes such systems sensitive to inconsistent translations provided by the same user-operator over time as well as by different translator-operators. Furthermore, this drawback often yields a translated text which is either incorrect, misleading or at least inconsistent with itself.
What is needed, therefore, is a translation memory tool that accurately translates text quickly and efficiently and is not sensitive to variations in the source language or to different operators.
SUMMARY OF THE INVENTION
To overcome these and other drawbacks of conventional language translation systems, the present invention, in one embodiment, is an aligner for a translation memory system. The aligner associates words, phrases, or other characters to be translated (referred to herein as “translatable source segments”) with previous translations of such words, phrases, or other characters (referred to herein as “corresponding target segments”), if a previous translation exists. The translatable source segments, and their identifying attributes (such as, for example, formatting information) are extracted from one or more source files. The corresponding target segments, and their identifying attributes, are extracted from one or more target files. If a previous translation does not exist for a translatable source segment, the aligner copies the translatable source segment and its attributes to create a corresponding target segment and its attributes.
The aligner associates each extracted translatable source segment with a corresponding target segment based upon commonality of attributes of the segments. In one implementation, the aligner also associates the source and target segments based upon their relative locations in their respective files.
In one embodiment, the aligner stores each translatable source segment and its attributes with its corresponding target segment in a source-target pair record of a source-target pair database. Each source-target pair record may include a propagation flag identifying whether the corresponding target segment stored in the source-target pair record is to be propagated to other occurrences of the associated translatable source segment in the source-target pair database. Each source-target pair record may also include a pointer to a page of an occurrence book having pages, wherein each page includes pointers to a common translatable source-segment in records of the source-target pair database.
In one embodiment, the aligner assigns a unique identifier to each translatable source segment. The aligner generates such unique identifier based on the attributes of the translatable source segment. The aligner may also generate such unique identifier based on the location of each translatable source segment in its source file. The aligner may associate each translatable source segment with one corresponding target segment by matching their unique identifiers.
The aligner may include a project identifier that selects the source and target files from files in a file system. The file system may be local, or it may be remote. In one embodiment, the project identifier identifies legacy files, if any, associated with one or more of the source and target files. Generally, legacy files refer to previous translations of source files including translatable source segments. In one implementation, legacy files refer to a source file and a corresponding target file wherein one or more translatable source segments have been translated and stored in a corresponding target segment.
In one embodiment, the aligner includes a parser-extractor that extracts each translatable source segment and its attributes from a sour

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

System, method, and product for dynamically aligning... does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with System, method, and product for dynamically aligning..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and System, method, and product for dynamically aligning... will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-2941532

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.