PRODUCT · COMPRESSION
TurboQuant
Two-stage extreme quantisation algorithm — 3-bit zero-loss KV-cache compression with no training.
CAPABILITIES
- PolarQuant random rotation
- QJL 1-bit residual correction
- Near-optimal distortion across bit-widths
- Deployable in real-time production
STACK
TritonCUDAPyTorch