Data archiving

CodeTwo Backup saves backed up data in local repositories called storages. As you continue to use the program, the storage volume increases. For safety reasons, backed up data can be stored only on local disk drives. To avoid situations in which all of your local disk drives are full, CodeTwo Backup (starting from version 1.2) features data retention policy and archiving.

Archive jobs

Archive jobs are used to create exact (1:1) copies of storages. Combined with the program's data retention policy, they form a control mechanism for storage data growth: only the most recent backup data is kept locally while the rest is stored in an external resource.

Archive jobs are defined separately for each storage by using an archive job configuration wizard. Each storage is treated as a whole, so an exact copy (snapshot) of the current storage state is created in the archive process - all items, including all of their versions and folder structure are archived.

Info

Every time an archive job is started, a completely new archive is created - the incremental backup mechanism is available only for the backup jobs, not for the archive jobs.

Contrary to storages, in which items are kept as binary files, archives store data as versioned XML files and (always encrypted) FTS binary data files. Data format conversion during an archiving (and importing) process is transparent for the user. Moreover, archives keep metadata about their own structure and version, so in any future versions of the program it will be possible to import old archives and convert them to current formats.

Other properties of archive jobs:

  • Each archive job must use a separate target folder for the archived data - it is not possible to share an archive location between multiple archive jobs.
  • Contrary to storages, archives can be kept on removable drives or network resources.
  • Archive jobs cannot be paused and resumed. Once started, the archive process has to finish. If stopped by force, an archive will be considered corrupted and automatically removed by the program.
  • If an archive job is started, the storage retention policy is suspended and resumed after the job is complete.
  • Backup jobs can be performed at the same time as archive jobs.
  • Indexer data for a particular storage is not archived. The Indexer will re-index all data in a new storage once an archive is imported.
  • You can protect an archive by a password so that only you can import it.
  • Aside from the storage retention policy that keeps tabs on the items' age in a storage, you can configure an archive retention policy to manage the number of versions stored for your archive.
  • Archive jobs can be scheduled.

PST archive jobs

With CodeTwo Backup you can also archive to PST files. As PST is a proprietary file format of Microsoft, archiving to PST comes with some limitations:

  • A PST archive job creates a separate PST file for every mailbox selected from a storage. It is not possible to archive a whole storage, containing data from multiple mailboxes, to a single PST file.
  • PST files are not encrypted. Despite the fact that CodeTwo Backup features password protection for PST archives, this should not be considered a solution for safe storage of data. PST archive password protection uses a standard defined by Microsoft for this file type. Unfortunately, the methods defined by PST file type standard were cracked long time ago. It is not difficult to find software that brakes PST passwords literally within seconds. CodeTwo did not implement better password protection to PST archives because if we did that, it would not be possible to import such PST files to MS Outlook.
  • Only a current state of a mailbox from a storage is copied to a PST archive. There is no item versioning, because it is not allowed by the PST file format.
  • Archiving data to PST files is a one-way operation, i.e. it is not possible to import data from a PST file back to a storage.
  • PST archive file cannot be larger than 10 GB, so if there is more data to be archived, the archive job will automatically create multiple PST files, each not bigger than 10 GB. This is by design, even though MS Outlook accepts PST files as large as 50 GB. During our extensive testing, we discovered inconsistent behavior of Outlook when importing PST files bigger than 10 GB. Also, due to the imperfections of the PST format, PST files corrupt rather easily. The problem becomes more visible for larger PST files. Therefore, for data safety we limited the archived PST file size to 10 GB. 
  • PST archives were implemented to allow easy import of archived items directly to MS Outlook. Considering this and the above-mentioned limitations, you probably should not think of PST archives as a proper and safe long term data archiving method.

Except for the above, PST archive jobs work in a similar way to standard archive jobs and are configured using a wizard as well.

Importing archives

Standard archives (not the PST ones) can be imported, to review or restore old data. Importing works as a one-time job and, as a result, it creates a new storage with imported items. Importing to an existing storage is prohibited to avoid item version conflicts. Imported items are always stamped with a current date and time. Learn more about importing archived storages

Was this information useful?