Unified control plane for LLM traffic — model routing, team token budgets, guardrails, and observability via Kubernetes-style CRDs.
| Name | Status |
|---|---|
| gpt-4-turbo | Active |
| claude-3 | Active |
| llama-3.1-70b | Shadow |
Visualize how your configuration affects traffic flow and enforcement.
Create teams, assign budgets, and apply rate limits.
Add upstream models, providers, and per-model controls.
Toggle enforcement modules and configure thresholds.
Enable tracing/metrics/logging exports and sampling.