본문으로 건너뛰기
All news

NVIDIA Kicks Off Blackwell Ultra B300 Mass Production — 50x Hopper Throughput per Watt

Summary: NVIDIA started mass production of its Blackwell Ultra B300 GPU on June 12 — the next data-center generation designed explicitly for agentic AI inference and long-context reasoning workloads.

Key Facts

  • 288 GB HBM3e on the B300, up from 192 GB on the B200 — essential headroom for running larger models and longer context windows
  • DGX B300 delivers 192 petaFLOPS for inference and 70 petaFLOPS for training
  • 50x higher throughput per megawatt and 35x lower cost per token vs. NVIDIA Hopper on low-latency agentic workloads
  • Production start triggers the H2 2026 data-center build cycle; first deployments expected in enterprise AI clusters before year-end

Why It Matters

As frontier AI shifts from chatbots toward multi-step reasoning agents, inference efficiency — not just peak benchmark scores — determines which providers can profitably serve at scale. The B300's economics change the math on what it costs to run an AI agent continuously, directly affecting the price floor for AI services and the competitive position of hyperscalers racing to build out capacity.

Read More

뉴스레터 구독

곧 오픈 예정 (Coming soon)

매일 AI 뉴스를 메일로 받아보세요

매일 아침 AI·LLM 핵심 소식을 받아보실 수 있어요.