Configuring the HTML 5 Embedded content processor

When you are working with an *.xml file that contains chunks of embedded HTML 5 content, for example in a CDATA section, you can assign a dedicated embedded content processor to handle this sub-content separately.

About this task

When you assign an embedded content processors for HTML 5 in an *.xml file, Studio 2014 SP1 and later handles embedded HTML 5 content as a different file within the *.xml document. This enables you to set custom settings for extracting and displaying correctly HTML 5 content embedded in the *.xml file.

To assign the embedded content processor for HTML 5 content in an *.xml file:

Choose the XML file type to process your *.xml document

Procedure

  1. Click File > Options > File Types to change the global XML settings or Project Settings > File Types to change the XML settings for the current project only.
  2. Expand the XML file type relevant for your *.xml document (XML: Microsoft .Net Resources, XML: Oasis DITA 1.2 Compliant, XML: Oasis DocBook 4.5 Compliant) or create a new XML (Embedded Content) file type. For more information about creating a custom file type, see Creating a File Type.

Enable the Embedded content processor to handle the HTML 5 content embedded in your *.xml document

Procedure

  1. On the Embedded content page of your XML file type, enable the Process embedded content using the following processor checkbox and choose HTML Embedded Content 5.2.0.0 from the drop-down menu.
  2. Specify where SDL Trados Studio should look for embedded content in the *.xml file. Choose the CDATA section options to process all HTML 5 content embedded in CDATA sections. Choose Document structure > Add... to define a more specific element where to look for embedded HTML 5.

Set processing rules for the HTML 5 embedded content processor

Procedure
  1. On the Options or Project Settings dialog box, expand Embedded Content Processors > HTML 5.
  2. On the Parser page, add parser rules to define how to extract for translation the HTML 5 content embedded in an *.xml file. For more information about setting HTML parser rules, see HTML Parser and Add/Edit/Copy Rule Page.
  3. On the Entities page, specify how to display HTML character entities in the Editor view. For more information about entity conversion and entity mapping, see HTML Entities.
  4. On the Whitespace page, define how Studio should deal with any extra whitespace characters it finds in the HTML 5 content embedded in the *.xml file. For more information about normalizing whitespace, see HTML Whitespace.
  5. Click OK to save your changes and close the Options/Project Settings dialog box.