<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Flash Attention on Text Matrix</title><link>https://txtmix.com/tags/flash-attention/</link><description>Recent content in Flash Attention on Text Matrix</description><generator>Hugo</generator><language>zh-cn</language><lastBuildDate>Sat, 23 May 2026 08:55:34 +0800</lastBuildDate><atom:link href="https://txtmix.com/tags/flash-attention/index.xml" rel="self" type="application/rss+xml"/><item><title>Flash Attention：40K Stars·Tri Dao发明·2-4倍加速·O(N)内存</title><link>https://txtmix.com/posts/tech/flash-attention-fast-exact-attention-guide/</link><pubDate>Sun, 12 Apr 2026 02:31:39 +0800</pubDate><guid>https://txtmix.com/posts/tech/flash-attention-fast-exact-attention-guide/</guid><description>&lt;h1 id="flash-attention40k-starstri-dao发明2-4倍加速on内存transformer标配llamamistralcodellama内置">Flash Attention：40K Stars·Tri Dao发明·2-4倍加速·O(N)内存·Transformer标配·Llama/Mistral/CodeLlama内置&lt;/h1>
&lt;h2 id="一项目概述">一，项目概述&lt;/h2>
&lt;h3 id="11-flash-attention-是什么">1.1 Flash Attention 是什么&lt;/h3>
&lt;p>&lt;strong>Flash Attention&lt;/strong> 是由 &lt;strong>Tri Dao&lt;/strong>（斯坦福大学）发明的&lt;strong>快速、内存高效、精确的注意力机制算法&lt;/strong>。&lt;/p></description></item></channel></rss>