Fraud-detection with encrypted data

This example focuses on the same use case as before, fraud-detection, but doesn’t run the traditional image used so far. Instead, it runs the workload as Jupyter notebook or Openshift AI workbench.

The Confidential Workflow

The difference here is mainly that the jupyter notebook has two sealed secrets attached: one for the Azure storage access, and another for the dataset decryption key.

The fraud-detection container starts.
Two of the attached volumes in this pod are sealed secret (azure credentials and decryption key).
CoCo internal components find the sealed secrets, and begin attestation to get their content.
1. Attestation starts by first having the confidential container to generate a report that shows it is a genuine CoCo running on a secure, trusted platform.
2. The CoCo then sends this report to Trustee. Only after Trustee verifies that this report is genuine and correct, it can confirm that the container is secure by releasing the requested secret.
3. The secret is then inserted in the corresponding sealed secret, which acts as normal volume mount.
The container downloads the public model.
Then, it tries to pull the encrypted dataset. The Azure credentials are not available in the container; they are held remotely by Trustee.
1. The Azure SAS (connection string in a real example) required to access the blob has been loaded as sealed secret in the CoCo pod. If everything went well, it should be available in a volume defined in the podspec.
Once the blob is accessed, the decryption key is not present in the container; it is held remotely by Trustee too.
1. The key required to decrypt the dataset has been loaded as sealed secret in the CoCo pod. If everything went well, it should be available in a volume defined in the podspec.
2. In addition, in the jupyter notebook it is also possible to try lazy attestation to fetch the key manually.
The container uses the received key to decrypt the credit card datasets in memory.
The (now-decrypted) private data is fed into the public model for processing, all within the protected container.

CoCo-specific implementation steps

The main goal here is to show how much work is actually needed to convert the plain application to securely run with CoCo.

There are three main changes added to the podspec:

runtimeClassName: kata-remote in the podspec. This is just a single line to enable CoCo for this pod.
Sealed secrets. Such secrets are added to the podspec as normal secrets, but as we saw before such secrets contain a reference to the actual secret provided by Trustee.
Persistent storage. For CoCo in public cloud using peer-pods, since the pod VM is external to the worker node and PVCs are actually mounted on the worker node, there are no proper and maintainable mechanisms to mount the PVC in the pod VM instead of on the worker node and secure it from the cluster. It’s left to the application to directly mount the storage and use client side encryption/decryption for the storage.
1. In this example, we will replace the PVCs with podvm storage, meaning the CVM encrypted disk will be used to store data. However, once the pod terminates, the data will be lost too.

Add the application secrets into Trustee

PERSONA: Operational security expert

Let’s add the application secrets (decryption key and azure credentials) into the Trustee. Here we are in the trusted cluster.

In case you didn’t do it before, download the decryption key and upload it into the Trustee.

Now let’s also add the Azure storage secret.

### Azure SAS - sealed secret
AZURE_SAS_SECRET_NAME=fraud-azure-sas

oc create secret generic $AZURE_SAS_SECRET_NAME \
  --from-literal azure-sas="sp=r&st=2025-10-27T15:42:27Z&se=2028-10-27T22:57:27Z&spr=https&sv=2024-11-04&sr=b&sig=vjaRotd7de%2B3QwlzHVaHF2GVyehw1xb3fFiXe9E7YOI%3D" \
  -n trustee-operator-system

And then instruct Trustee to load that secret into its deployment, by updating the KbsConfig and restarting the Trustee deployment.

echo "Default Kbsconfig - kbsSecretResources:"
oc get kbsconfig trusteeconfig-kbs-config -n trustee-operator-system -o json \
  | jq '.spec.kbsSecretResources'

echo ""

oc patch kbsconfig trusteeconfig-kbs-config \
  -n trustee-operator-system \
  --type=json \
  -p="[
    {\"op\": \"add\", \"path\": \"/spec/kbsSecretResources/-\", \"value\": \"$AZURE_SAS_SECRET_NAME\"},
  ]"

echo ""

echo "Updated Kbsconfig - kbsSecretResources:"
oc get kbsconfig trusteeconfig-kbs-config -n trustee-operator-system -o json \
  | jq '.spec.kbsSecretResources'

oc rollout restart deployment/trustee-deployment -n trustee-operator-system

You should see a fraud-azure-sas and fraud-dataset secret in the KbsConfig.

Create the sealed secret

PERSONA: Application developer

Let’s now create the sealed secret that contains the pointer to the actual secret in trustee. Here we move to the untrusted cluster.

AZURE_SAS_SECRET_NAME=fraud-azure-sas
AZ_SECRET=$(podman run -it quay.io/confidential-devhub/coco-tools:0.3.0 /tools/secret seal vault --resource-uri kbs:///default/${AZURE_SAS_SECRET_NAME}/azure-sas --provider kbs | grep -v "Warning")

FD_SECRET_NAME=fraud-dataset
KEY_SECRET=$(podman run -it quay.io/confidential-devhub/coco-tools:0.3.0 /tools/secret seal vault --resource-uri kbs:///default/${FD_SECRET_NAME}/dataset_key --provider kbs | grep -v "Warning")

# namespace here is fraud-detection!
oc create namespace fraud-detection 2> /dev/null # skip if already present
oc create secret generic sealed-azure-sas --from-literal=azure-sas=$AZ_SECRET -n fraud-detection
oc create secret generic sealed-dataset-key --from-literal=key=$KEY_SECRET -n fraud-detection

Deploy the notebook

Let’s create a notebook and run it as CoCo.

This notebook specifically uses python sdk to download the encrypted data from Azure for two reasons:

Closely align with regular interactive AI workflows which uses python SDKs to download data from s3, azure, minio etc.
Provides an example of programmatic storage access for AI workloads when using the peer-pods approach.

Before running this notebook, ensure that ROOT_VOLUME_SIZE in the peer-pods configmap is set at least to 20 GB, as the steps in the guide will install a lot of python packages. If you modify that value, remember as always to to restart the OSC deployment!

There are two ways to deploy the notebook:

Via a Openshift AI (OAI) workbench: everything is handled by Openshift AI. A Notebook object is created, and OAI takes care of deploying it and exposing it for the user. In such way, we integrate CoCo with Openshift AI traditional workbenches, that take care of most of the work.
Via a plain Jupyter notebook: a simple custom pod, with networking handled by a custom service and route. This is the simplest and fastest way to deploy.