[D]NVIDIA Blackwell Ultra crushes MLPerf

old.reddit.com

[D]NVIDIA Blackwell Ultra crushes MLPerf

old.reddit.com

Lemmit.Online botMAB to

Machine LearningEnglish · 2 days ago

NVIDIA dropped MLPerf results for Blackwell Ultra yesterday. 5× throughput on DeepSeek-R1, record runs on Llama 3.1 and Whisper, plus some clever...

This is an automated archive made by the Lemmit Bot.

The original was posted on /r/machinelearning by /u/pmv143 on 2025-09-10 19:39:51+00:00.

NVIDIA dropped MLPerf results for Blackwell Ultra yesterday. 5× throughput on DeepSeek-R1, record runs on Llama 3.1 and Whisper, plus some clever tricks like FP8 KV-cache and disaggregated serving. The raw numbers are insane.

But I wonder though . If these benchmark wins actually translate into lower real-world inference costs.

In practice, workloads are bursty. GPUs sit idle, batching only helps if you have steady traffic, and orchestration across models is messy. You can have the fastest chip in the world, but if 70% of the time it’s underutilized, the economics don’t look so great to me. IMO

You must log in or register to comment.

Chat

Machine Learning

machinelearning

You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: [email protected]

Community locked: only moderators can create posts. You can still comment on posts.

This subreddit is temporarily closed in protest of Reddit killing third party apps, see /r/ModCoord and /r/Save3rdPartyApps for more information.

Visibility: Public

This community can be federated to other instances and be posted/commented in by their users.

1 user / day
1 user / week
1 user / month
5 users / 6 months
1 local subscriber
20 subscribers
2.5K Posts
1 Comment
Modlog

mods:
Lemmit.Online bot