PRODUCT · COMPRESSION

TurboQuant

Two-stage extreme quantisation algorithm — 3-bit zero-loss KV-cache compression with no training.

CAPABILITIES
  • PolarQuant random rotation
  • QJL 1-bit residual correction
  • Near-optimal distortion across bit-widths
  • Deployable in real-time production
STACK
TritonCUDAPyTorch