KVarN: Variance-Normalized KV-Cache Quantization Mitigates Error Accumulation in Reasoning Tasks
Paper β’ 2606.03458 β’ Published β’ 49
hey dark , actually if you're keen it wouold be nice to get this into mobile apps , but i do need help with the ux