Filter Determination for Segmentation
Content filters process the data in project assets to expose translatable content and hide non-translatable content. Translatable content is presented in segments; the process that content filters perform on assets is called segmentation. Filter configurations allow you to customize how the filters process the data.
- Opening an asset in the Browser Workbench
- Generating a scoping report for an asset
- Exporting an asset to a translation kit
- WorldServer inspects the MIME type data for each asset to obtain a filter instance of the appropriate type.
- WorldServer determines which filter configuration to use.
- WorldServer segments the asset using the specified configuration.
Filter association with MIME type
Segmentation is performed by a filter appropriate to the file type of the asset. For example, if the asset has an extension of .xml, WorldServer by default uses the filter associated with the MIME type of text/xml in the MIME type table (see ). The default MIME type for the .xml file extension is text/xml, and those files by default use the Any XML File Type filter.
Filter configuration selection
- Upload Files and Create Projects on the Home page
The filter configuration is that specified in the project type for the project.
- Create New Project on the page
As in the first case, the filter configuration is that specified in the project type for the project.
- or ad hoc in the WorldServer Explorer
An AIS property can be assigned to specify a filter configuration for a target asset or directory, in the Change Properties dialog accessed via . (This dialog is also accessible from .)
Filter Configurations drop-down list to appear in the Change Properties dialog.
When WorldServer consults the Filter Configuration property for a project, MIME type, or AIS folder, it checks whether the assigned configuration has the correct filter association. If so, that filter is used. If the file types do not match, the filter’s default configuration is used. See the "Filter Groups" topic for information on applying more than one filter configuration to a directory in AIS.
Asset Resegmentation
WorldServer resegments assets when it detects that the filter configuration has changed. However, in some cases a change will be noticed and in some other cases you have to touch the file or clear the cache proactively. A filter configuration can depend on many factors. For example, changing sentence-breaking rules will affect segmentation. It is best, unless you are absolutely sure that an asset has been resegmented after configuration changes, to force resegmentation by touching the source file to change the timestamp.