Archiving process
Archiving of content is integrated into the publishing and deployment process.
The archiving process works as follows:
- Authors create content in Content Manager and publish it.
- Content Manager sends the content as a Transport Package to the Content Deployer, which receives and analyzes content and reads instructions about the publish action.
- As part of deployment, the Archive Manager Modules find out what to archive. Specifically, this means sending content to the Archive Manager Queue, flagging content that has been unpublished, and keeping track of:
- the content used to compose a Page (Components and Component Templates, jointly called Component Presentations)
- linking information (managed and unmanaged links in the Page)
- the dependencies between pages and resources
- user-defined configuration (inclusions, exclusions, personalization)
- The Archiver picks up items from the Archive Manager Queue and actually archives them. It archives all content (used in the composition of a page such as images and CSS files) published from the Content Manager. To optimize performance, you can install the Archiver on a separate machine as part of the Archive Manager Server Role.
- At the same time (the Archiver works asynchronously), the Content Deployer sends the content to the Storage Layer for storage. The Storage Layer stores content in the Content Data Store and passes content to the Web site where content is presented to visitors.
- The Archiver gets a list of affected URLs (resources updated as part of the publish action) and all pages that are assembled dynamically from the Content Data Store. It then archives all the artifacts used to construct the Page, such as the URLs, the content, and linking information.
- You can use Archive Manager Explorer to view the Web site archived in the archiver database. Archive Manager Explorer accesses the database directly or via the Archive Manager Web Service, depending on your environment. In the Archive Manager Explorer you can:
- See when information went online or offline (published or unpublished)
- Identify all modifications made in a given period
- Locate the Content Manager resources used for a specific archived page
- View personalized pages