LLM Inference Energy: A Longitudinal Analysis
The ML.ENERGY Leaderboard went from v2.0 (September 2024) to v3.0 (December 2025) with major changes: up-to-date models, hardware, software, and datasets. The v3.0 blog post covered the details of the v3.0 results, but how to they compare to the times v2.0? Are we making progress on energy efficiency? In this short post, we would like to look at the impact of software optimizations on energy efficiency over time, using the Llama 3.1 family as a case study.