OpenAI introduces its first AI chip designed for LLM workloads: Check details
OpenAI has officially introduced Jalapeno, its first custom AI chip built for large language models (LLMs) workloads.
he chip has been developed in partnership with Broadcom
OpenAI and Broadcom developed Jalapeno from the initial design to manufacturing tape-out in just nine months.
OpenAI has officially introduced Jalapeno, its first custom AI chip built for large language models (LLMs) workloads. The chip has been developed in partnership with Broadcom and marks a major step in OpenAI’s effort to build more of the technology behind ChatGPT and its other AI products. According to the company, Jalapeno is not a general-purpose AI chip. Instead, it has been designed from the ground up for AI inference.
SurveyOpenAI describes Jalapeno as its “first Intelligence Processor: an accelerator architected around OpenAI’s vision for the future of LLM inference, and the first AI accelerator in a multi-generation compute platform the companies are building together to make advanced AI faster, more reliable, and more accessible to more people.”
Also read: GTA 6 pre orders now live: India price, benefits and other details
OpenAI said engineering samples of Jalapeno are already running AI workloads in its labs, including GPT-5.3-Codex-Spark. While the company is still testing the final performance, early results show that the chip offers significantly better performance per watt than current state-of-the-art. A detailed technical report will be shared in the coming months.
“Jalapeno is part of our long-term full-stack infrastructure strategy to make compute more abundant, resulting in AI which is faster, more reliable, more affordable for people and businesses, and can be used to solve more important problems,” said Greg Brockman, President and Co-Founder of OpenAI. “By designing more of the stack ourselves, we can serve more intelligence with greater efficiency and keep pushing advanced AI toward broader access.”
One of the biggest highlights is the speed of development. OpenAI and Broadcom developed Jalapeno from the initial design to manufacturing tape-out in just nine months. The companies believe this is the fastest ASIC development cycle ever achieved for advanced high-performance semiconductor chips.
“If AI can help engineers design better chips faster, it can lower the cost of compute across the industry and help democratize access to advanced AI,” OpenAI said.
According to OpenAI, Jalapeno is the “first step in a multi-generation compute platform.” The company plans to start deploying the new platform by the end of 2026.
“The point of this work is simple: inference is where AI reaches people. Every improvement in cost, speed, and reliability can show up as a faster ChatGPT answer, a Codex task that can take more steps with less waiting, an API product that is cheaper to build, or more dependable access when demand is high,” the company said.
Ayushi works as Chief Copy Editor at Digit, covering everything from breaking tech news to in-depth smartphone reviews. Prior to Digit, she was part of the editorial team at IANS. View Full Profile
