NeuralTrust Platform runs on Amazon EKS using AWS-native primitives — Application Load Balancer for ingress, ACM for certificates, EBS for persistent storage, IRSA for IAM, and Route 53 for DNS.Documentation Index
Fetch the complete documentation index at: https://docs.neuraltrust.ai/llms.txt
Use this file to discover all available pages before exploring further.
Pick your path
Hybrid (recommended)
Data Plane + TrustGate + Firewall in your EKS cluster. Control Plane runs on NeuralTrust SaaS. Fastest to first dashboard.
Self-hosted
Full stack including Control Plane API, UI, and Scheduler in your EKS cluster. For sovereignty and air-gapped requirements.
Cluster prerequisites
| Resource | Recommended starting point |
|---|---|
| EKS version | 1.28 or newer |
| CPU pool node type | m5.2xlarge or m6i.2xlarge (8 vCPU / 32 GiB) |
| Min CPU nodes | ≥ 4 for hybrid (CPU Firewall), ≥ 5 for self-hosted (CPU Firewall). Subtract one when using GPU Firewall workers. ≥ 3 AZs for HA. |
| GPU pool (optional) | g4dn.xlarge (4 vCPU / 16 GiB / 1 × T4) — 5 nodes (one per default Firewall worker) |
| VPC | At least 3 private subnets across 3 AZs |
| Storage | EBS CSI driver installed; gp3 storage class |
| Ingress | AWS Load Balancer Controller v2.6+ |
| DNS | Route 53 (or any DNS provider) for the platform base domain |
| Certificates | ACM certificate covering *.<your-domain> |
m5.xlarge (4 vCPU / 16 GiB) workers also work but require 7–8 nodes to fit the same workload. See Deployment models › Sizing baseline for the math.
For GPU Firewall workers, add a managed node group with g5.xlarge / g6.xlarge and the NVIDIA device plugin.
Required cluster add-ons
Architecture
All workloads run inside your AWS account and VPC. Data never leaves your environment.AWS-specific defaults
Whenglobal.platform: "aws":
- Ingress class:
alb(AWS Load Balancer Controller v2). - TLS: prefers ACM via
alb.ingress.kubernetes.io/certificate-arn. When no ARN is set, the chart provisions a self-signedkubernetes.io/tlssecret. - Service annotations:
alb.ingress.kubernetes.io/target-type: ipfor pod-direct routing; configurable scheme, target group, WAF, etc. - Storage class:
gp3recommended for cost/perf;io2for ClickHouse high-throughput.
Common configuration
ACM certificates
*.<your-domain>) so every platform hostname terminates against the same cert.
Storage class
io2:
Internal-only ingress
For VPC-internal endpoints (no internet exposure):IRSA for managed services
For Cloud SQL alternatives (Aurora, RDS) and S3-backed ClickHouse backups, prefer IRSA over static credentials. Annotate the chart service accounts with the IAM role ARN:GPU node group for Firewall workers
Region availability
NeuralTrust runs in any AWS commercial region with EKS support. Choose the region closest to your traffic and target LLM endpoints, or one that meets your data-residency obligations. For GovCloud or specific compliance regions, contact [email protected].Backup and data lifecycle
For production, configure backups against the persistent stores rather than relying on EBS snapshots alone:- PostgreSQL: use Amazon RDS for PostgreSQL externally; disable
neuraltrust-control-plane.infrastructure.postgresql.deploy. - ClickHouse: enable
clickhouse.backup.enabled: truewith S3 storage (backup.storage.s3.endpoint), or run ClickHouse Cloud externally. - Kafka: use MSK and set
infrastructure.kafka.deploy: false.
Verification
Common issues
| Symptom | Likely cause | Fix |
|---|---|---|
| Ingress doesn’t get an ALB | AWS LB Controller not installed or wrong IAM | Check kube-system logs for the controller |
Subnet kubernetes.io/role/elb tag missing | ALB Controller can’t find target subnets | Tag at least 2 subnets with the correct role label |
PVC stuck Pending | EBS CSI driver missing or wrong IAM | Verify driver installed and IRSA role attached |
ImagePullBackOff | Missing or wrong gcr-secret | Recreate the secret with the JSON key from NeuralTrust |
Next steps
- Hybrid deployment on EKS — Data Plane only, Control Plane on SaaS
- Self-hosted deployment on EKS — full stack including CP
- Deployment models — hybrid vs self-hosted in depth
- Feature flags reference — local vs external Postgres/Redis/Kafka/CH, image registry, storage, secrets
- Image catalog — what runs where
- Firewall deployment — GPU workers on EKS