This guide will show you how Stash takes snapshot of PersistentVolumeClaims and restore them from snapshot using Kubernetes VolumeSnapshot API.
At first, you need to have a Kubernetes cluster and ensure that a CSI driver that implements snapshots is deployed on your cluster. You can find a list of CSI drivers that supports snapshots here. In this guide we are going to use GCE Persistent Disk CSI Driver.
You need to enable the Kubernetes VolumeSnapshotDataSource alpha feature via Kubernetes feature gates
--feature-gates=VolumeSnapshotDataSource=trueInstall Stash in your cluster following the steps here.
You should be familiar with the following Stash concepts:
You should be also familiar with the following Kubernetes concepts:
The following diagram shows how Stash creates VolumeSnapshot via Kubernetes native API. Open the image in a new tab to see the enlarged version.
The VolumeSnapshot process consists of the following steps:
At first, a user creates a BackupConfiguration crd which specifies the targeted workload or targeted PVC.
Stash operator watches for BackupConfiguration crd.
When it finds a BackupConfiguration crd, it creates a CronJob to take a periodic backup of the target volumes.
The CronJob triggers backup on each scheduled time slot by creating a BackupSession crd.
Stash operator watches for BackupSession crd.
When it finds a BackupSession crd, it creates a volume snapshotter Job to take snapshot of the targeted volumes.
The volume snapshotter Job creates VolumeSnapshot crd for each PVC of the target and waits for the CSI driver to complete snapshotting. These VolumeSnasphot crd names follow the following format:
<PVC name>-<BackupSession creation timestamp in Unix epoch seconds>
CSI external-snapshotter controller watches for VolumeSnapshot.
When it finds a VolumeSnapshot object, it backups VolumeSnapshot in the respective cloud storage.
Once the snapsotting is completed, Stash Operator updates the status.phase field of the BackupSession crd.
The following diagram shows how Stash restores PersistentVolumeClaims from snapshot using Kubernetes VolumeSnapshot API. Open the image in a new tab to see the enlarged version.
The restore process consists of the following steps:
At first, a user creates a RestoreSession crd which specifies the volumeClaimTemplates. VolumeClaimTemplates hold the information about VolumeSnapshot.
Stash operator watches for RestoreSession crd.
When it finds a RestoreSession crd, it creates a Restore Jobto restore PVC from the snapshot.
The restore Job creates PVC with spec.dataSource field set to the respective VolumeSnapshot name.
CSI external-snapshotter controller watches for PVC.
When it finds a new PVC with spec.dataSource field set, it reads the information about the VolumeSnapshot.
The controller downloads the respective data from the cloud and populate the PVC with it.
Once restore process is completed, the Stash operator updates the status.phase field of the BackupSession crd.