Storage

To archive a Web site, its content is transferred to persistent storage (RDBMS), specifically a SQL database (Microsoft SQL Server or Oracle). The storage or records is optimized by only archiving content that is unique.

The following scenarios describe how the Archive Manager stores records:

Publish eventArchive Manager response
Initial publicationArchive Manager stores the page as a record in the database along with each artifact it links to (such as JSPs, ASPs, HTML files, images and other binary file types)
Republication (content unchanged)If neither the contents of the page nor its artifacts have changed, Archive Manager creates a new version of the record to indicate that the resource has been republished, but the content itself is not rearchived.
Republication (content changed)Archive Manager creates a new version of the record and a new entry for its content.