<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>KV Cache on Text Matrix</title><link>https://txtmix.com/tags/kv-cache/</link><description>Recent content in KV Cache on Text Matrix</description><generator>Hugo</generator><language>zh-cn</language><lastBuildDate>Sat, 23 May 2026 08:55:34 +0800</lastBuildDate><atom:link href="https://txtmix.com/tags/kv-cache/index.xml" rel="self" type="application/rss+xml"/><item><title>TurboQuant+ 深度解读：LLM KV 缓存极限压缩的工程实践</title><link>https://txtmix.com/posts/tech/turboquant-plus-kv-cache-compression-guide/</link><pubDate>Thu, 23 Apr 2026 21:07:12 +0800</pubDate><guid>https://txtmix.com/posts/tech/turboquant-plus-kv-cache-compression-guide/</guid><description>&lt;h2 id="项目概览">项目概览&lt;/h2>
&lt;p>&lt;a href="https://github.com/TheTom/turboquant_plus" target="_blank" rel="noopener noreffer ">TurboQuant+&lt;/a> 是对 Google Research &lt;a href="https://research.google/blog/turboquant-redefining-ai-efficiency-with-extreme-compression/" target="_blank" rel="noopener noreffer ">TurboQuant&lt;/a> 论文（ICLR 2026）的开源实现与扩展工程。截至 2026 年 4 月，该项目已获得 &lt;strong>6,482 Stars&lt;/strong> 和 &lt;strong>872 Forks&lt;/strong>，是近期最具影响力的 LLM 推理优化开源项目之一。&lt;/p></description></item></channel></rss>