Implementing TurboQuant in llama.cpp: CUDA Scars and What Actually Ships3 weeks of porting TurboQuant to CUDA, 5 scars, and what actually ships for document processing on T4 GPUsApr 6, 2026·11 min read·45