Documentation Center

XML: Any XML settings

The XML: Any XML settings file type has the *.xml extension. This XML v1-based file type will be deprecated in 2022. Use XML v2-based file types instead.

Detection

Setting
The file detection settings for this XML file type are pre-defined and cannot be changed.

Parser

Setting
The parser rules for this XML File Type are auto-generated and cannot be changed. If you required manual control over the rules, create a new XML file type and import the rules from a suitable source file. Suitable source files are: XML (*.xml), XML Schema (*.xsd), SDLX ANX (*.anl), Trados INI (*.ini) or ITS (*.xml).

Writer settings

SettingInstruction
Unicode UTF-8 byte order mark (BOM)From the list, choose an option to determine how BOM is handled during translation: Preserve; don't add if not originally present, Preserve: add if not originally present, Remove if present.
Values of xml:lang and lang attributeFrom the list, choose an option to determine how 'lang' is handled during translation: Change matching source language to target language, Always change to target language, Do not change.

Whitespace settings

SettingInstruction
Whitespace in contentDecide how whitespace is treated during translation: Normalize unless xml:space='preserve' ; Always preserve; Always normalize.
Normalize whitespace in tagsSelect the check box to ensure that whitespace is normalized in tags.

Namespace settings

SettingInstruction
The namespace mappings for this XML File Type are auto-generated and cannot be changed. If you require manual control over the namespace mappings, create a new XML file type.

Validation

SettingInstruction
Perform schema and DTD validation during file detectionDecide how schemas and DTD files are validated:
  • Treat all validation warnings as file parsing errors
  • Report warning if no DTD/schema can be found
Perform schema validation when verifying translation
Manually specify schemaEnable this option to specify schemas manually. Decide how the schemas are used:
  • Use for all XML documents
  • Use only for documents which do not specify DTD/schema
Master SchemasKeep the default values or add more files.
Dependency Schemas and DTDsKeep the default values or add more files.

Entity settings

SettingInstruction
Enable entity conversionEntity settings specify whether special characters are converted to their corresponding HTML entity. Select or clear the check box depending on whether you want to have entity conversion enabled or not.
Convert numeric entity references to inline placeholder tagsSelect or clear the check box depending on whether you want to have the numeric entity conversion enabled or not.
Add an entitySelect plus sign, add the character, and decide whether the available check boxes are enabled or not: READ AS CHARACTER, WRITE AS ENTITY. When you add a new entity, the WRITE AS ENTITY setting is disabled automatically. The WRITE AS ENTITY setting only works for the default entities, and is disabled for the ones you add, which means that for the default ones characters will always be converted to entities.
  • READ AS CHARACTER - This setting specifies which entities are converted to their respective characters during parsing.
  • WRITE AS ENTITY - This setting specifies which characters are converted to their respective entities during writing.
Edit an entityAdjust the given values.

Embedded Content

SettingInstruction
Allow processing of content embedded in a document into tagged markup according to a set of regular expression rules. Embedded content is identified using document structure information.Toggle the button to enable the use of regular expressions when processing embedded content as translatable content.
Defined by document structure information - Document structure informationSelect or configure the document structure and then select plus sign.
Defined by document structure information - Tag definition rulesEnable this option to set the selected embedded content processors to extract embedded content from specific context. Create extraction rules based on the document structure information. Make sure that the document structure information you specify here is covered by a parser rule in the Parser settings section for your file type. Trados Studio and Trados Online Editor can only extract embedded content that is recognized by the file type parser.
  1. Select Add New Rule.
  2. From the Tag Type list, choose a value: Placeholder or Tag Pair.
  3. In the Regular expression field, enter the regular expression.
  4. Select the Ignore Case check box to ignore the case of the identified content. Otherwise, keep the default value (clear check box).
  5. From the Segmentation Hint list, choose a value from the list to determine how segmentation is performed: May Exclude (default), Include, Include With Text, Exclude. The value you specify here is available when you configure a parser rule under Parser > Add New Rule > Properties > Advanced Settings > Segmentation Hint. Check this topic to learn what each segmentation hint does.
  6. Select Save.