TurboQuant Doesn’t Work on Qwen 3.5 — So I Made It Work
How I built a routing proxy that brings Google’s TurboQuant compression to hybrid-attention models running locally on a Mac. A month ago, Google published a blog post about a paper...
Continue reading
0 Comments