Quantization on Text Matrix

Quantization on Text Matrixhttps://txtmix.com/tags/quantization/Recent content in Quantization on Text MatrixHugozh-cnSat, 23 May 2026 08:55:34 +0800Quantization 量化技术完全指南：从原理到 LLM 实战https://txtmix.com/posts/tech/llm/quantization-llm-model-compression-guide/Sun, 29 Mar 2026 23:28:00 +0800https://txtmix.com/posts/tech/llm/quantization-llm-model-compression-guide/<hr> <h2 id="一先看一个惊人的事实">一、先看一个惊人的事实</h2> <p><strong>Qwen-3-Coder-Next</strong> 是一个 800 亿参数的模型：</p> <ul> <li><strong>体积：159.4GB</strong></li> <li>需要至少 159GB 内存才能运行</li> <li>这还不算「大型」模型——据说前沿模型超过 <strong>1 万亿</strong>参数，需要 <strong>2TB+</strong> 内存</li> </ul> <p><strong>但如果我告诉你：</strong></p>