AI Inference

For a decade, Nvidia sold the world a simple story: GPUs are all you need. Training? GPUs. Inference? Also GPUs. That story built a $3 trillion empire. On March 16 at GTC 2026 in San Jose, Jensen Huang is expected to blow it up himself. Nvidia will reportedly unveil a dedicated inference processor — not a GPU — built on technology from Groq, the inference startup it absorbed in a $20 billion deal last December. OpenAI is lined up as the first major customer. And the implications for the entire AI hardware ecosystem are enormous. ...

AI Inference

Nvidia Just Admitted GPUs Aren't Enough — Its $20B Groq Bet Changes Everything

Taalas HC1: The Chip That Bakes AI Models Directly Into Silicon at 17,000 Tokens Per Second