Skip to content

2026

Diagnosing Inference Energy Consumption with the ML.ENERGY Leaderboard v3.0

With The ML.ENERGY Benchmark v3.0 we released in December 2025, we expanded our scope to up-to-date important models, tasks, and GPU hardware. This included 46 models across 7 tasks, producing 1,858 configurations on NVIDIA H100 and B200 GPUs.1 As always, latest benchmarking results are public and can be browsed on The ML.ENERGY Leaderboard.

In this post, we first present empirical observations from measurements, and then develop a reasoning framework that explains why we observe certain energy behaviors.