Archiving Web site Publications
For basic configuration of the Archive Manager you need to specify the Web site, or Web sites, you want to archive. You do this by adding a <Publication> element for each Web site in the Archive Manager cd_archivemanager_conf.xml configuration file.
Procedure
- Open your Archive Manager
cd_archivemanager_conf.xmlconfiguration file, by default located in your Tridion Installation\configdirectory, in a text editor: - Add a
<Publication>child element in the<Publications>section for each Web site you want to archive. The<Publication>element requires the following attributes:Id— is the item ID of the Publication for which you want all content that is published or unpublished from this publication to be archived.BaseUrl— defines the start of a URL and typically contains the protocol, host, and port parts of the URL; any published page which contains this URL is archived.
You can also set a defaultBaseUrlon the<Publications>element which is used if a<Publication>element does not have aBaseUrlspecified. For example:<Publications BaseUrl="http://www.mywebsite.com:8080"> <Publication Id="37"/> <Publication Id="20" BaseUrl="http://www.myotherwebsite.com:8082"/> </Publications>In the example above:- all URLs published from Publication 37 that start with the URL
http://www.mywebsite.com:8080are archived - all URLs published from Publication 20 that start with the URL
http://www.myotherwebsite.com:8082are archived.
Note: If you use the default port for your protocol (port 80 for HTTP, or port 443 for HTTPS), do not specify the port in theBaseUrlattribute.Note: The Content Manager uses a single URI for each Content Manager object to uniquely identify an item in the system. You can see URI of an item in the tooltip when you hover over an item, or in the item Info tab. For example, in the Publication URItcm:0-17-1, the item ID is 17. The format of a Content Manager URI is:tcm:<PublicationID>-<ItemID>[-<ItemType>][-v<VersionNumber>] - Save and close
cd_archivemanager_conf.xml.
- Including URLs
Inclusion rules allow you to define additional URLs to be archived when certain parts of the site, or certain resources, are published. You need to define inclusion rules for each Web site Publication you are archiving by adding<InclusionRule>elements as child elements a<Publication>. - Excluding URLs
You can define the conditions under which URLs are excluded from being archived by defining exclusion patterns. Exclusion patterns allow you to prevent certain parts of your Web site, or specific file types, from being archived. For example, to preserve storage space you may want to exclude private information, video files, and downloadable files such as drivers from being archived. You define exclusions by adding<Exclude>elements in the<capture>section of thecd_archivemanager_conf.xmlfile.