Capturing
Capturing defines the procedure and scope for capturing the content. The Archive Manager captures and archives Uniform Resource Locators (URLs).
These URLs include HTML, JSP, ASP, .NET, and ASP.NET Pages, and all other files ("artifacts") used to compose Web pages. The artifacts include, for example, CSS, JS, XML, and XSL files, as well as all MIME types supported by the Content Manager such as GIFs, JPEGs, and MPEGs.
The capture process involves getting a list of all affected URLs when content is published, unpublished, or republished. The affected URLs are all resources that have changed as a result of publish action. This also includes:
- Archiving Pages when a Component is published — when a Component is published, all Pages on which the Component is displayed are archived. This includes Dynamic Component Presentations in which the Components are statically included on the Page.
- Archiving Pages regardless of deployment order — if a page is captured and archived, but one or more of its artifacts has not been published yet, the page will be rearchived when the linked archive/resource is eventually published (but only if the resource changes the Page).
- Archiving Pages if a dynamic link changes — for example, if the priority or directory location changes, or if the Component links to a Component that has not yet been published (and only the link text is displayed).
Supported MIME types
The Archive Manager supports the archiving of all resources used in a Web site that use the major Web technologies, for example, HTML, JSP, ASP.NET, Flash, and so on. The following table describes the supported binary file types:
| Name | MIME type | Possible extensions |
|---|---|---|
| Access Database | application/octet-stream | mdb |
| Bitmap Image | application/octet-stream | bmp |
| Excel Sheet | application/ms-excel | xls |
| Executable | application/octet-stream | exe |
| Flash File | application/x-shockwave-flash | swf |
| Gif Image | image/gif | gif |
| Jpeg Image | image/jpeg | jpg, jpeg, jpe |
| MP3 Music | audio/x-mpeg | mp3 |
| Mpeg Video | video/mpeg | mpg |
| PDF Document | application/pdf | |
| Plain Text | text/plain | txt |
| Png Image | image/png | png |
| PowerPoint Presentation | application/ms-powerpoint | ppt |
| QuickTime Movie | video/quicktime | mov, qt |
| Real Player | video/vnd.rn-realmedia | rm, ram, ra, rv |
| Rich Text | text/rtf | rtf |
| Sound File | application/x-wav | wav |
| Word Document | application/msword | doc |