Documentation Center

HTML custom settings

The HTML custom file type has the *.htm, *.html, *.asp, *.jsp, *.aspx, *.ascx, *.inc, *.php, *.hhk and *.hhc extensions.

Detection

SettingInstruction
ON/OFFToggle the button on or off.
DOCTYPE declarationType a declaration.
Root element namesType an element name and select plus sign.
Namespace declaration (xmlns) URIsType a namespace declaration and select plus sign.

Parser

OperationsInstruction
Add a ruleParsing rules define how elements are handled. To add a rule:
  1. Select Add New Rule.
  2. Under the Rule field, type the rule name and condition.
  3. Under Attributes, type an attribute name, select the Translate check box to make it translatable, and select plus sign.
  4. Under Properties, edit the following:
    • Translate: Yes (default), No, Inherit.
    • Tag Type: Structure (default), Inline.
    • Whitespace: Inherit from Parent (default), Always preserve, Always normalize, Normalize unless xml:space='preserve'
    • Segmentation Hint - This option is available only if you work with inline tags. Select one of the available options: May Exclude (default), Include, Include With Text, Exclude. This option is available only if you work with inline tags and only if you specified the extraction rule (segmentation hint) in the Embedded Content section of your file type. Select one of the available options: May Exclude (default), Include, Include With Text, Exclude. Check this topic to learn what each segmentation hint does.
  5. This option is available only if you work with inline tags. Under Formatting, specify: the size, the color, the position (Inherit, Normal, Superscript, Subscript), and the style (Bold, Italic, Strikethrough, Underline).
  6. This option is available only if you work with structure tags. Under Structure Information:
    1. Select Add new.
    2. Under Properties, select one of the available structure elements from the Name list, and then specify a code, an identifier, a description, color.
    3. Under Formatting, specify: the size, the color, the position (Inherit, Normal, Superscript, Subscript), and the style (Bold, Italic, Strikethrough, Underline).
  7. Select Save.

Writer settings

SettingInstruction
Unicode UTF-8 byte order mark (BOM)From the list, choose an option to determine how BOM is handled during translation: Preserve; don't add if not originally present, Preserve: add if not originally present, Remove if present.
Values of xml:lang and lang attributeFrom the list, choose an option to determine how 'lang' is handled during translation: Change matching source language to target language, Always change to target language, Do not change.
<META charset> tagsFrom the list, choose an option.

Whitespace settings

SettingInstruction
Whitespace in contentDecide how whitespace is treated during translation: Normalize unless xml:space='preserve' ; Always preserve; Always normalize.
Normalize whitespace in tagsSelect the check box to ensure that whitespace is normalized in tags.

Entity settings

SettingInstruction
Enable entity conversionEntity settings specify whether special characters are converted to their corresponding HTML entity. Select or clear the check box depending on whether you want to have entity conversion enabled or not.
Convert numeric entity references to inline placeholder tagsSelect or clear the check box depending on whether you want to have the numeric entity conversion enabled or not.
Add an entitySelect plus sign, add the character, and decide whether you need the READ AS CHARACTER enabled or not. When you add a new entity, the WRITE AS ENTITY setting is disabled automatically.
  • READ AS CHARACTER - This setting specifies which entities are converted to their respective characters during parsing.
  • WRITE AS ENTITY - This setting specifies which characters are converted to their respective entities during writing.
Edit an entityAdjust the given values.

Embedded Content

SettingInstruction
Server side scripts usingIf you select this, the default option is Embedded Content Plain Text v. 1.0.0.0.
Other elements identified by > Parser rulesChoose a parser rule, an embedded content processor, and select plus sign.
Other elements identified by > Structure information > Document structure informationChoose an element and select plus sign.
Other elements identified by > Structure information > Tag definition rulesTo add a tag definition rule:
  1. Under the Start Tag Expression and End Tag Expression columns, type the relevant information.
  2. From the Tag Type list, choose a value: Placeholder or Tag Pair.
  3. Under the Translate column, select the check box to enable the content for translation.
  4. Under the Ignore Case column, select the check box to ignore the case of the identified content.
  5. Under the Segmentation Hint column, choose a value from the list to determine how segmentation is performed: Include, May Exclude, Include With Text, Exclude. The value you specify here is available when you configure a parser rule under Parser > Add New Rule > Properties > Advanced Settings > Segmentation Hint. Check this topic to learn what each segmentation hint does.
  6. If you selected the Tag Pair option, under the Can Hide column, select the check box if the content can be hidden.
  7. If you selected the Tag Pair option, under the Formatting column, select Edit formatting, and then in the Font Formatting dialog:
    • Under Specify size, choose a font size from the list.
    • From the Position list, choose a value: Inherit, Normal, Superscript, Subscript.
    • Select the Specify color check box and select a color.
    • Select one, none or all of the following check boxes: Bold, Italic, Strikethrough, Underline.