The Software Frontier

The Software Frontier

The Llama 3.3 70B Benchmark Problem

What a single H100 SXM5 can and cannot do with Llama 3.3 70B at FP8, a first-principles audit of vLLM, SGLang, and TensorRT-LLM, with the deployment decisions that follow

Lorenzo Bradanini's avatar
Lorenzo Tettamanti's avatar
Lorenzo Bradanini and Lorenzo Tettamanti
May 28, 2026
∙ Paid

Introduction

User's avatar

Continue reading this post for free, courtesy of Lorenzo Bradanini.

Or purchase a paid subscription.
© 2026 Lorenzo Bradanini · Privacy ∙ Terms ∙ Collection notice
Start your SubstackGet the app
Substack is the home for great culture