<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Policy Gradient on Text Matrix</title><link>https://txtmix.com/tags/policy-gradient/</link><description>Recent content in Policy Gradient on Text Matrix</description><generator>Hugo</generator><language>zh-cn</language><lastBuildDate>Sat, 23 May 2026 08:55:34 +0800</lastBuildDate><atom:link href="https://txtmix.com/tags/policy-gradient/index.xml" rel="self" type="application/rss+xml"/><item><title>Mathematical Foundations of Reinforcement Learning：强化学习的数学基石——从入门到精通的完整指南</title><link>https://txtmix.com/posts/tech/mathematical-foundations-of-reinforcement-learning-book/</link><pubDate>Fri, 17 Apr 2026 16:05:00 +0800</pubDate><guid>https://txtmix.com/posts/tech/mathematical-foundations-of-reinforcement-learning-book/</guid><description>&lt;h1 id="mathematical-foundations-of-reinforcement-learning强化学习的数学基石">Mathematical Foundations of Reinforcement Learning：强化学习的数学基石&lt;/h1>
&lt;blockquote>
&lt;p>&lt;strong>目标读者&lt;/strong>：计算机科学/人工智能研究生、RL研究者、工程师
&lt;strong>前置知识&lt;/strong>：概率论、线性代数基础
&lt;strong>特色&lt;/strong>：网格世界（Grid World）贯穿全书的统一示例，数学严谨但叙述友好
&lt;strong>难度定位&lt;/strong>：⭐⭐⭐⭐ 专家设计&lt;/p></description></item></channel></rss>