<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>投机解码 on Text Matrix</title><link>https://txtmix.com/tags/%E6%8A%95%E6%9C%BA%E8%A7%A3%E7%A0%81/</link><description>Recent content in 投机解码 on Text Matrix</description><generator>Hugo</generator><language>zh-cn</language><lastBuildDate>Sat, 23 May 2026 08:55:34 +0800</lastBuildDate><atom:link href="https://txtmix.com/tags/%E6%8A%95%E6%9C%BA%E8%A7%A3%E7%A0%81/index.xml" rel="self" type="application/rss+xml"/><item><title>DFlash：块扩散模型加速LLM推理——让大模型推理速度提升2-3倍</title><link>https://txtmix.com/posts/tech/dflash-block-diffusion-speculative-decoding/</link><pubDate>Fri, 17 Apr 2026 16:35:00 +0800</pubDate><guid>https://txtmix.com/posts/tech/dflash-block-diffusion-speculative-decoding/</guid><description>&lt;h1 id="dflash块扩散模型加速llm推理">DFlash：块扩散模型加速LLM推理&lt;/h1>
&lt;blockquote>
&lt;p>&lt;strong>目标读者&lt;/strong>：LLM推理优化工程师、ML平台架构师、MLOps实践者
&lt;strong>前置知识&lt;/strong>：深度学习基础、LLM原理、对投机解码有基本了解
&lt;strong>技术栈&lt;/strong>：Python / PyTorch / vLLM / SGLang / Transformers / MLX
&lt;strong>难度定位&lt;/strong>：⭐⭐⭐⭐ 专家设计&lt;/p></description></item></channel></rss>