GarboktransSpain Local time: 03:56Member (2003) French to English + ...

Nov 11, 2016

I need to split a 50k document to be translated by several translators. I am working from a new TM in Trados Studio 2015. The analysis shows 12200 repetitions. Is there a way of knowing where these repetitions are in the document so as not to assign repeated segments? I mean, is there a way of clicking on a segment and Trados "telling" you that, for example segment 245 is a repetition of segment 552.

Subject:

Comment:

The contents of this post will automatically be included in the ticket generated. Please add any additional comments or explanation (optional)

Yes, you can use the Display Filter for this. Use the predefined filter "Repetitions - All" and you will then see only the repetitions. You may then for example lock all these segments to prevent all translators from translating them differently (preferably translate them before you lock them).

Walter

Subject:

Comment:

The contents of this post will automatically be included in the ticket generated. Please add any additional comments or explanation (optional)

Walter Blaser wrote:
Yes, you can use the Display Filter for this. Use the predefined filter "Repetitions - All" and you will then see only the repetitions.

No, the "All" setting displays all segments, not all repetitions. There is no built-in filter to show all repetitions (and you can't run multiple filters simultaneously AFAIK).

The Display Filter is on the Review tab, by the way. Don't ask why. It just is.

[Edited at 2016-11-12 12:53 GMT]

Subject:

Comment:

The contents of this post will automatically be included in the ticket generated. Please add any additional comments or explanation (optional)

Thomas PfannUnited Kingdom Local time: 02:56Member (2006) English to German + ...

Export repetitions and translate them first

Nov 12, 2016

During the file analysis you can export 'frequently occurring units' (FOUs). Those are then placed into a separate xliff file and can be translated first - of course, that means that they are going to be translated out of context so the translator might spend more time cross-checking the source file for context, but it's still a very handy feature when files with lots of internal repetitions need to get split between several translators.

When this option is selected, the analysis process checks all files in the project to identify frequently occurring segments. The frequently occurring units are placed in an XLIFF file for translation. A separate XLIFF file is produced for every language pair. The XLIFF files are placed in the project's Exports folder.
The XLIFF files can be translated and the translation units added to the translation memories before work begins on other project tasks.

A unit has to occur in the project source files more times than or equal to the number set in the Number of occurrences box.

Garboktrans wrote:
I need to split a 50k document to be translated by several translators. I am working from a new TM in Trados Studio 2015. The analysis shows 12200 repetitions. Is there a way of knowing where these repetitions are in the document so as not to assign repeated segments?

Trados has very limited filters with regard to repetitions. Basically, there are only two: "first occurrences" (which show only the first occurrences of repetitions, but no non-repeating segments) and "excluding first occurrences" (which excludes all first occurrences of repetitions, but *does not* exclude any non-repeating segments). So there is no filter that shows only non-first occurrences of repetitions.

If you have lots and lots of time:

A. If you don't care about translating in context:
Use the method mentioned by Thomas. Right-click the project, select "Analyse", and click Next until you get to the screen where you would normally click Finish. On that screen, select Analyze Files on the left, and then select "Export frequent segments" and set it to 2. This will result in a file that contains only the first occurrences of repetitions and no other segments whatsoever (the file won't open automatically, but will be located in the Exports subfolder of the project folder).

B. If you do want to translate in context:
Use the "excluding first occurrences" filter and lock all those segments. This means that the entire file is visible to your translators, but only the first occurrences of repetitions can be translated by them.

Then, but only after they have finished translating the repetitions, you can import their file to a new TM and then divide the original file among the other translators (and provide the TM, or pre-translate against the TM).

It is not possible (natively) to create a version of the file in which only non-repeating segments are unlocked.

If you want all translators to work at the same time:

Here's an idea about how to "remove" repeating segments from the file, but I haven't tested it very extensively.

1. Start with a project that contains a copy of the SDLXLIFF file. Then right-click the project and select Batch Tasks > Pseudo-translation. Select "Deterministic" and "Dollar sign" when prompted. In the "Append characters to the start" field, add something unique, e.g. "_#!#!#". This will "translate" all segments with dollar signs. The _#!#!# will be useful later to filter segments that still contain those characters.
2. Go to the file itself, and use the "excluding first occurrences" filter. Then select all segments in that view, and clear the target fields (if you don't know how to select multiple segments, just ask -- there's a trick to it that's not logical).
3. Now use the "first occurrences" filter, and then mark all those segments as "translated".
4. Create a new TM and import the SDLXLIFF file into that TM (or use your favourite method of adding all translated segments to the TM)
5. Now, create a new project with the original file in it, and pre-translate (Batch Tasks) it against that TM (and specify a match threshold of 100%).

The resulting file will have dollar signs in all repeating segments, and all non-repeating segments will be empty. If you want to make first occurrences also empty (a good idea), simply use the "first occurrences" filter on that file, and then clear the target text from those segments.

Don't forget at the end to delete all translations that start with _#!#!#, and then rerun the SDLXLIFF file against the TM so that those empty segments get translated.

I mean, is there a way of clicking on a segment and Trados "telling" you that, for example segment 245 is a repetition of segment 552.

Not as far as I know, no.

[Edited at 2016-11-12 13:37 GMT]

[Edited at 2016-11-12 13:38 GMT]

Subject:

Comment:

The contents of this post will automatically be included in the ticket generated. Please add any additional comments or explanation (optional)

Yes, you can use the Display Filter for this. Use the predefined filter "Repetitions - All" and you will then see only the repetitions. You may then for example lock all these segments to prevent all translators from translating them differently (preferably translate them before you lock them).

Not that it matters for this thread, as the task at hand can be achieved with the filter currently available in Studio 2015, but Studio 2017 will have an Advanced Display Filter where one can combine several criteria to refine the filter, such as Content, Segment Status, Segment Origin, Repetitions, Segment Review, Segment Locking, Comments, Document Structure, so, for example, it would be possible to filter for segments that include a key word, are repetitions and are unlocked.

Subject:

Comment:

The contents of this post will automatically be included in the ticket generated. Please add any additional comments or explanation (optional)

Walter Blaser wrote:
Yes, you can use the Display Filter for this. Use the predefined filter "Repetitions - All" and you will then see only the repetitions.

No, the "All" setting displays all segments, not all repetitions. There is no built-in filter to show all repetitions.

Walter is right.
"General - All Segments" displays all segments.
"Repetitions - All" displays all repetitions.
"Repetitions - First Occurrences" displays only first occurrences of all repetitions.

My test file is this: (I put the repetitions in italics here, to make them stand out)

This is the house that Jack built.This is the ho2use that Jack built.
This is the ho3use that Jack built.This is the house that Jack built.This is the ho2use that Jack built.
This is the ho4use that Jack built.This is the house that Jack built.
This is the ho5use that Jack built.

If I select the "Repetitions - All" filter, then all of these segments remain in the view, even though segments 3, 6 and 8 are not repetitions. If I select "Repetitions - First occurrences", only segments 1 and 2 are displayed. If I select "Repetitions - Excluding first occurrences", then all segments excluding segments 1 and 2 are displayed, even though segments 3, 6 and 8 are not repetitions. I'm using Trados 2015.

Subject:

Comment:

The contents of this post will automatically be included in the ticket generated. Please add any additional comments or explanation (optional)

My test file is this: (I put the repetitions in italics here, to make them stand out)

This is the house that Jack built.This is the ho2use that Jack built.
This is the ho3use that Jack built.This is the house that Jack built.This is the ho2use that Jack built.
This is the ho4use that Jack built.This is the house that Jack built.
This is the ho5use that Jack built.

If I select the "Repetitions - All" filter, then all of these segments remain in the view, even though segments 3, 6 and 8 are not repetitions. If I select "Repetitions - First occurrences", only segments 1 and 2 are displayed. If I select "Repetitions - Excluding first occurrences", then all segments excluding segments 1 and 2 are displayed, even though segments 3, 6 and 8 are not repetitions. I'm using Trados 2015.

This is actually normal behavior for Studio. Your sample sentences are in fact all repetitions, because by adding a number to the word "house", you've turned "hou2se", "hou3se", etc. it into a placeable/token. In fact, if you translate and confirm segment 2 (keeping the token), you will see that segment 3 is autopropagated from segment 2.

Try something like this instead:

This is the house that Jack built.
This is the car that Jack bought.
This is the house that Jack grew up in.
This is the car Jack drove.
This is the house that Jack built.
This is the car that Jack bought.
This is the house that Jack built.
This is the car that Jack bought.

Subject:

Comment:

The contents of this post will automatically be included in the ticket generated. Please add any additional comments or explanation (optional)

To report site rules violations or get help, contact a site moderator: