This validated performance benchmark shows how F5 BIG-IP Next for Kubernetes, deployed on NVIDIA BlueField-3 DPUs, improves AI inference economics by increasing token throughput, reducing time to first token, and lowering end-to-end latency compared to widely used open source and commercial data plane solutions.
Minimize latency and maximize return on investment.
Protect AI workloads without compromise.
Improve throughput per watt and reduce power consumption.