<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>推理服务器 on Text Matrix</title><link>https://txtmix.com/tags/%E6%8E%A8%E7%90%86%E6%9C%8D%E5%8A%A1%E5%99%A8/</link><description>Recent content in 推理服务器 on Text Matrix</description><generator>Hugo</generator><language>zh-cn</language><lastBuildDate>Sat, 23 May 2026 08:55:34 +0800</lastBuildDate><atom:link href="https://txtmix.com/tags/%E6%8E%A8%E7%90%86%E6%9C%8D%E5%8A%A1%E5%99%A8/index.xml" rel="self" type="application/rss+xml"/><item><title>oMLX：macOS菜单栏管理13k星的LLM推理服务器，连续批处理+SSD缓存</title><link>https://txtmix.com/posts/tech/omlx-apple-silicon-llm-inference-server/</link><pubDate>Mon, 11 May 2026 13:10:00 +0800</pubDate><guid>https://txtmix.com/posts/tech/omlx-apple-silicon-llm-inference-server/</guid><description>&lt;blockquote>
&lt;p>&amp;ldquo;我试过的每个 LLM 服务器都要我在便利性和控制性之间二选一。我想把常用模型常驻内存，把重的模型自动 swap 到 SSD，还能设置上下文限制——全部从菜单栏管理。这就是我造 oMLX 的原因。&amp;rdquo;&lt;/p></description></item></channel></rss>