본문으로 건너뛰기
All news

OpenAI and Broadcom Unveil Jalapeño, a Custom LLM Inference Chip

One-liner: OpenAI and Broadcom introduced Jalapeño, OpenAI's first custom AI accelerator — a reticle-sized ASIC designed from the ground up for LLM inference, not repurposed from training hardware.

Key Facts

  • Full-reticle ASIC purpose-built for LLM inference; designed in nine months using OpenAI's own AI models to accelerate chip development
  • Developed with Broadcom (manufacturer) and Celestica (production scale-up) — early testing shows better performance per watt than current state-of-the-art chips
  • Microsoft will purchase 40% of initial production; full deployment targeted for late 2026
  • OpenAI aims for 10 gigawatts of Jalapeño-powered compute by 2029, roughly the output of ten nuclear reactors

Why It Matters

This is OpenAI's clearest move yet toward hardware self-sufficiency at the inference layer. Training still depends on Nvidia GPUs, but inference is where serving costs compound at scale — every ChatGPT query runs it. A purpose-built inference ASIC could meaningfully reshape OpenAI's unit economics as it approaches its IPO, while signaling that Nvidia's dominance at the edge of the AI stack is no longer guaranteed.

Read More

뉴스레터 구독

무료 뉴스레터

매주 핵심 AI 소식, 한 번에 받기

쏟아지는 AI·LLM 뉴스 중 꼭 알아야 할 것만 골라 메일로 보내드려요. 뉴스레터 발송이 시작되면 구독자분들께 가장 먼저 보내드립니다.