NVIDIA inference software cuts token cost 5x on Blackwell platform in one month
NVIDIA has achieved a 5x reduction in token costs for DeepSeek v4 on its Blackwell platform within just one month of the model's release. Leading AI companies including Baseten, Cognition, Deep Infra, and Together AI are already leveraging these full-stack inference software improvements to deliver superior performance across reasoning, coding, and large-scale workloads.