Google's TurboQuant Compresses AI Memory by 6x — And It's Crashing Chip Stocks
Remember Pied Piper from HBO’s Silicon Valley? The fictional startup that built a compression algorithm so good it basically broke the internet? Google just built the real thing. Except instead of compressing video files, it’s compressing AI’s brain. On Tuesday, Google Research unveiled TurboQuant, a compression algorithm that reduces the memory footprint of large language models by at least 6x while delivering up to 8x faster performance on Nvidia H100 GPUs. The kicker: zero accuracy loss. ...