Tokens and Word Counts in Studio
The Studio-aligned TM mode can automatically substitute recognized tokens (a piece of text in one of a pre-defined set of formats) for the text in a translatable item (file, asset, database entry, etc.) when WorldServer computes a translation proposal. The TM then recognizes and counts these tokens as a single word.
The following token types are recognized:
- Dates
- Times
- Numbers (in numerals)
- Measurements
- Acronyms and URLs
This treatment can reduce the word counts (and hence costs) for some types of content. The Studio-aligned mode handles simple differences (such as international formatting) in these tokens in a way that results in better matches for translated information. For certain types of content this could enhance project processing performance and quality.
The Studio-aligned TM mode recognizes this special kind of content and auto-substitutes (replaces) appropriate recognized tokens. Recognition and related settings can be controlled through the tm.properties file.