NeuralTrust | The leading security platform for generative AI

NeuralTrust Platform runs on any Kubernetes 1.24+ distribution — managed clouds, on-prem clusters, bare metal, k3s / RKE2, IBM Cloud Kubernetes Service, Oracle OKE, Civo, DigitalOcean Kubernetes, and so on. This guide covers the cloud-agnostic path; if you’re on EKS / AKS / GKE / OpenShift, use the dedicated guide for richer integrations:

Pick your path

Hybrid

Data Plane + TrustGate + Firewall in your cluster. Control Plane runs on NeuralTrust SaaS. Fastest to first dashboard.

Self-hosted

Full stack including Control Plane API, UI, and Scheduler in your cluster. For air-gapped, on-prem, or sovereignty mandates.

If you’re unsure which model fits your environment, see Deployment models.

Cluster prerequisites

Resource	Recommended
Kubernetes version	1.24+
CPU pool node spec	8 vCPU / 32 GiB recommended; ≥ 4 nodes for hybrid (CPU Firewall), ≥ 5 nodes for self-hosted (CPU Firewall). Subtract one when using GPU Firewall workers.
GPU pool (optional)	4 vCPU / 16 GiB + 1 × NVIDIA GPU per node — 5 nodes (one per default Firewall worker)
Storage class	A default `StorageClass` with `ReadWriteOnce` PV provisioning (`local-path`, Longhorn, Rook/Ceph, NFS CSI, etc.)
Ingress controller	NGINX, Traefik, HAProxy, or any conformant ingress
TLS	cert-manager with Let’s Encrypt or internal CA, or pre-existing TLS secrets
DNS	Any DNS provider that can resolve your domain to the ingress LB/node IPs
Image pull	`gcr-secret` (docker-registry) with `gcr-keys.json` from NeuralTrust, OR a mirrored internal registry for air-gapped
Optional	NVIDIA device plugin for GPU Firewall workers

Required cluster setup

NGINX + cert-manager
Traefik
MetalLB (bare metal)

# NGINX ingress
helm repo add ingress-nginx https://kubernetes.github.io/ingress-nginx
helm install ingress-nginx ingress-nginx/ingress-nginx \
  -n ingress-nginx --create-namespace

# cert-manager
helm repo add jetstack https://charts.jetstack.io
helm install cert-manager jetstack/cert-manager \
  -n cert-manager --create-namespace \
  --set installCRDs=true

Create a ClusterIssuer:

apiVersion: cert-manager.io/v1
kind: ClusterIssuer
metadata:
  name: letsencrypt-prod
spec:
  acme:
    server: https://acme-v02.api.letsencrypt.org/directory
    email: [email protected]
    privateKeySecretRef:
      name: letsencrypt-prod-key
    solvers:
      - http01:
          ingress:
            class: nginx

helm repo add traefik https://helm.traefik.io/traefik
helm install traefik traefik/traefik \
  -n traefik --create-namespace \
  --set ports.web.redirectTo=websecure \
  --set [email protected] \
  --set certificatesResolvers.letsencrypt.acme.httpChallenge.entryPoint=web \
  --set certificatesResolvers.letsencrypt.acme.storage=/data/acme.json

For bare-metal clusters without a cloud LB, install MetalLB to provide LoadBalancer service IPs:

kubectl apply -f https://raw.githubusercontent.com/metallb/metallb/v0.14.5/config/manifests/metallb-native.yaml

Then create an IPAddressPool for your IP range. After MetalLB is configured, NGINX/Traefik will receive an IP automatically.

Architecture

All workloads run inside your cluster. Data never leaves your environment.

Kubernetes-specific defaults

When global.platform: "kubernetes":

Ingress class: not set automatically — you set className per ingress (typically nginx, traefik, etc.).
No cloud annotations are applied automatically. You can add any annotations through ingress.annotations.
TLS: when a per-ingress secretName is set, the chart references it directly; when not set and global.ingress.tls.autoGenerate: true (default), the chart creates a shared self-signed kubernetes.io/tls secret.
Storage class: uses the cluster default unless you set global.storageClass.

Common configuration

Ingress with cert-manager

trustgate:
  ingress:
    enabled: true
    className: "nginx"
    annotations:
      cert-manager.io/cluster-issuer: "letsencrypt-prod"
      nginx.ingress.kubernetes.io/ssl-redirect: "true"

Pre-existing TLS secret

trustgate:
  ingress:
    enabled: true
    className: "nginx"
    tls:
      secretName: "wildcard-platform-example-com"

Make sure the Secret exists in the neuraltrust namespace.

Storage class

global:
  storageClass: "local-path"           # k3s default
  # storageClass: "longhorn"           # Rancher Longhorn
  # storageClass: "rook-ceph-block"    # Rook/Ceph
  # storageClass: "nfs-client"         # NFS CSI

Per-component override for ClickHouse on faster storage:

clickhouse:
  persistence:
    storageClass: "rook-ceph-block"
    size: 200Gi

GPU node label for Firewall workers

kubectl label nodes <gpu-node-name> gpu-node-pool=gpu-pool

Install the NVIDIA device plugin, then enable Firewall GPU workers (see Firewall deployment).

Backup and data lifecycle

Because there’s no cloud-managed backup story on vanilla Kubernetes, choose an explicit backup strategy:

PostgreSQL: schedule pg_dump CronJobs to your S3-compatible object store, or use CloudNativePG for built-in PITR.
ClickHouse: enable clickhouse.backup.enabled: true with any S3-compatible endpoint (MinIO, Wasabi, on-prem Ceph RGW, etc.).
Kafka: standard MirrorMaker or a managed Kafka offering.
PVC snapshots: use Velero with restic for cluster-wide backup and DR.

External-infra reference: Configuration scenarios.

Verification

kubectl get pods -n neuraltrust
kubectl get ingress -n neuraltrust -o wide

curl https://data-plane-api.platform.example.com/health

Common issues

Symptom	Likely cause	Fix
Ingress IP `pending`	No LB controller / MetalLB on bare metal	Install MetalLB or expose ingress via NodePort + external proxy
`PVC` stuck `Pending`	No default storage class	`kubectl get storageclass`; mark one default with `storageclass.kubernetes.io/is-default-class=true`
cert-manager challenge fails	DNS not resolving, or HTTP-01 challenge can’t reach ingress	Check `kubectl get certificate -A` and `kubectl describe challenge`
`ImagePullBackOff`	`gcr-secret` missing or wrong	Recreate with the JSON key from NeuralTrust; or use a mirrored internal registry
Pods OOMKilled	Under-sized nodes	Raise node spec; deploy to dedicated nodes with `nodeSelector`

Next steps

Hybrid deployment on vanilla Kubernetes — Data Plane only, Control Plane on SaaS
Self-hosted deployment on vanilla Kubernetes — full stack including CP
Deployment models — hybrid vs self-hosted in depth
Feature flags reference — local vs external Postgres/Redis/Kafka/CH, image registry, storage, secrets
Image catalog — what runs where, including air-gapped mirroring
Firewall deployment — GPU workers on vanilla Kubernetes
Secrets management — auto-generation, External Secrets Operator

​Pick your path

Hybrid

Self-hosted

​Cluster prerequisites

​Required cluster setup

​Architecture

​Kubernetes-specific defaults

​Common configuration

​Ingress with cert-manager

​Pre-existing TLS secret

​Storage class

​GPU node label for Firewall workers

​Backup and data lifecycle

​Verification

​Common issues

​Next steps