Repetitions and markup tags

Recognizing segment repetitions is important if you want to estimate the effort needed for translating a document. In WorldServer, certain types of markup tags affect whether or not a segment is a genuine repetition of another.

WorldServer treats markup tags of the same type (standalone, opening, or closing) and appearing at the same location in the segment as interchangeable and considers these tags equivalent in the repetition analysis. It determines equivalency based on tag types and locations, but not on the content of the placeholder.

For example, the following sentences are likely to be repetitions in WorldServer:
  • This is a <b>simple sentence<\b> with tags.
  • This is a <i>simple sentence<\i> with tags.
On the other hand, the following sentences are not likely to be repetitions in WorldServer:
  • This is a <b>simple sentence<\b> with tags.
  • This is a <b>simple<\b> sentence with tags.
or
  • This is a <b>simple sentence<\b> with tags.
  • This is a <br>simple sentence<br> with tags.