Documentation Center

Embedded content processors

Embedded content is content that uses a different syntax than the syntax specific to a certain file type. For example, some complex XML file may contain chunks of HTML content included inside a CDATA section.

Trados Studio can use separate processors for handling content embedded inside XML, XHTML, Java and Excel files. This enables Trados Studio to process embedded content as different file within the XML file and allows you to set different settings for extracting and displaying main and embedded content.

Available embedded content processors

The Embedded Content page containing the settings for configuring embedded content processing is available for the following file types:

File TypeEmbedded Content Processor UsedDescription
XML: Microsoft .NET ResourcesNew Embedded Content Processors

These processors:

  • extract content embedded with a dedicated processor, specific to the type of embedded content found inside the XML file
  • extract embedded content as a separate, individual file
  • enable you to specify which parts of the embedded content should be processed
  • enable you to define custom settings for processing the extracted embedded content
XML: OASIS DITA 1.2 CompliantNew Embedded Content Processors
XML: OASIS DocBook 4.5 CompliantNew Embedded Content Processors
New XML (Embedded Content) file typesNew Embedded Content Processors
XML: Any XMLLegacy embedded content processor
These processors:
  • extract embedded content with the file type parser, and then process extracted content with a generic embedded content processor
  • do not differentiate between the type of embedded content extracted from the file, which restricts you from specifying custom extraction and display settings for different types of embedded content
Microsoft ExcelLegacy embedded content processor
Java ResourcesLegacy embedded content processor
New XML (Legacy Embedded Content) file typesLegacy embedded content processor

Why should I use embedded content processors?

Trados Studio 2022 SP1 and later include separate embedded content processors, dedicated to handling HTML 5 ontent and Plain Text inside XML files. These dedicated processors enable Trados Studio to handle such sub-content as a separate file within the XML file. This means that you have all the settings to:
  • Configure what information Trados Studio extracts from the embedded content.
  • Tag embedded content as translatable or untranslatable when opened in the Editor view.
  • Use custom formatting for each embedded content element.
  • Segment HTML content correctly before converting it to a translatable format.
  • Add additional context information to structure HTML elements to make this reference information available in the Document Structure column.
  • Display HTML character entities in the Editor view.
  • Control HTML whitespace characters in embedded content.