Huffman Coding Data Compression

Nvidia says it can shrink LLM memory 20x without changing model weights

Nvidia researchers have introduced a new technique that dramatically reduces how much memory large language models need to track conversation history — by as much as 20x — without modifying the model ...

InfoQ

Google’s TurboQuant Compression May Support Faster Inference, Same Accuracy on Less Capable Hardware

Google Research unveiled TurboQuant, a novel quantization algorithm that compresses large language models’ Key-Value caches ...

TechCrunch

Google unveils TurboQuant, a new AI memory compression algorithm — and yes, the internet is calling it ‘Pied Piper’

If Google’s AI researchers had a sense of humor, they would have called TurboQuant, the new, ultra-efficient AI memory compression algorithm announced Tuesday, “Pied Piper” — or, at least that’s what ...

Mashable

Google AI breakthrough shows why we don't need more data centers

We have seen the future of AI via Large Language Models. And it's smaller than you think. That much was clear in 2025, when we first saw China's DeepSeek — a slimmer, lighter LLM that required way ...

Bloomberg L.P.

AI Coding Startup Cursor Plans New Model to Rival Anthropic, OpenAI

Cursor, a leading artificial intelligence startup for coding, is set to release a more efficient AI model for software development in a bid to keep pace with larger firms like Anthropic PBC and OpenAI ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results