Extracting translations from Transit XV reference material

Despite Transit XV is quite obsolete tool, it is still used by numerous (mostly) German companies. Not all of my colleagues, whom I give some jobs, are working with Transit, so from time to time I am facing a need to convert Transit project into SDL Trados 2007 or SDL Trados Studio. Here is how to proceed to create a new translation memory containing translations from Transit reference material.

Assume we have a Transit NXT project (German-Slovak pair) imported into the Transit XV with NXT plugin. Create a folder on your disk and copy language files from the Transit project folder. In our example, we would copy files file.DEU.NXT and file.SKY.NEXT. Rename these files to DEU.txt and SKY.txt.

1. Using Unicode editor, such as Notepad+, open both files and remove header section, i.e. everything at the beginning of the file ending  with <BODY> (including this tag), and tags </Body></Transit> at the end. Save these files.

2. Open updated TXT files in Word (Unicode encoding).

3. Using Search and Replace tool in Word, replace </Seg> with empty string (to remove this end tag).

4. Now we need to remove text between <Seg and “>.
Check the Regular expression search checkbox.
Search for: \<Seg*\”\>
And replace all with <TAG>.

5. Now we simplify the most common tags in files to make the text easier to read.
We will be working with these tags:
<Tab> </Tab>, <Tag </Tag>, <FontTag </FontTag>, <NL </NL> (new line), <WS />
Search for: \<Tab*Tab\> and replace all with <TAG>.
Search for: \<Tag*Tag\> and replace all with <TAG>.
Search for: \<FontTag*FontTag\> and replace all with <TAG>.
Search for: \<NL*NL\> and replace will with <EOL>.
Search for: \<WS*\/\> and replace with an empty string.
Do this for both TXT files.

6. Save the files. Now we have ?clean? source and target texts exported from the reference material.

7. Open TXT files in Excel and create a bilingual worksheet ? the first column should contain the source language content, and second one the target language content.

8. Export the bilingual table from Excel as Unicode TXT file.

9. Convert this TXT file into TMX. Follow instructions in this article.

10. Create a new translation memory in Translator?s Workbench or SDL Trados Studio.

This entry was posted in Computer-aided Translation, Software-related. Bookmark the permalink.

Leave a Reply

Your email address will not be published. Required fields are marked *