<?xml version="1.0" encoding="utf-8"?>
<feed xmlns="http://www.w3.org/2005/Atom">
    <id>https://matrixhub.ai/zh-CN/blog</id>
    <title>MatrixHub Blog</title>
    <updated>2026-04-27T00:00:00.000Z</updated>
    <generator>https://github.com/jpmonette/feed</generator>
    <link rel="alternate" href="https://matrixhub.ai/zh-CN/blog"/>
    <subtitle>MatrixHub Blog</subtitle>
    <icon>https://matrixhub.ai/zh-CN/img/favicon.ico</icon>
    <entry>
        <title type="html"><![CDATA[DeepSeek v4 跑不起来？99% 的人都卡在分发]]></title>
        <id>https://matrixhub.ai/zh-CN/blog/deepseek-v4-distribution</id>
        <link href="https://matrixhub.ai/zh-CN/blog/deepseek-v4-distribution"/>
        <updated>2026-04-27T00:00:00.000Z</updated>
        <summary type="html"><![CDATA[为什么企业环境里的 DeepSeek 落地经常卡在分发层，以及 MatrixHub 能解决什么问题。]]></summary>
        <content type="html"><![CDATA[<p>最近 DeepSeek 发布了 DeepSeek v4，不少团队都在第一时间尝试接入。</p>
<p>但如果你是在企业环境，尤其是内网或私有化部署里，很快就会发现一件事：</p>
<blockquote>
<p>模型不是最大的问题，分发才是。</p>
</blockquote>
<p>我们在内网落地 DeepSeek v4，踩了一堆坑，整理下来，本质其实就三类问题。</p>
<h2 class="anchor anchorTargetStickyNavbar_Vzrq" id="一你以为是下载问题其实是架构问题">一、你以为是“下载问题”，其实是架构问题<a href="https://matrixhub.ai/zh-CN/blog/deepseek-v4-distribution#%E4%B8%80%E4%BD%A0%E4%BB%A5%E4%B8%BA%E6%98%AF%E4%B8%8B%E8%BD%BD%E9%97%AE%E9%A2%98%E5%85%B6%E5%AE%9E%E6%98%AF%E6%9E%B6%E6%9E%84%E9%97%AE%E9%A2%98" class="hash-link" aria-label="一、你以为是“下载问题”，其实是架构问题的直接链接" title="一、你以为是“下载问题”，其实是架构问题的直接链接" translate="no">​</a></h2>
<h3 class="anchor anchorTargetStickyNavbar_Vzrq" id="hugging-face-在企业环境并不好用">Hugging Face 在企业环境并不好用<a href="https://matrixhub.ai/zh-CN/blog/deepseek-v4-distribution#hugging-face-%E5%9C%A8%E4%BC%81%E4%B8%9A%E7%8E%AF%E5%A2%83%E5%B9%B6%E4%B8%8D%E5%A5%BD%E7%94%A8" class="hash-link" aria-label="Hugging Face 在企业环境并不好用的直接链接" title="Hugging Face 在企业环境并不好用的直接链接" translate="no">​</a></h3>
<ul>
<li class="">网络不稳定甚至断网</li>
<li class="">下载慢，大文件容易中断</li>
<li class="">权限不可控</li>
</ul>
<p>看起来是“下载慢”，本质是：</p>
<blockquote>
<p>Hugging Face 不是为企业分发设计的，它的设计目标是研究协作，不是企业分发。</p>
</blockquote>
<h2 class="anchor anchorTargetStickyNavbar_Vzrq" id="二你开始自救但问题更大">二、你开始自救，但问题更大<a href="https://matrixhub.ai/zh-CN/blog/deepseek-v4-distribution#%E4%BA%8C%E4%BD%A0%E5%BC%80%E5%A7%8B%E8%87%AA%E6%95%91%E4%BD%86%E9%97%AE%E9%A2%98%E6%9B%B4%E5%A4%A7" class="hash-link" aria-label="二、你开始自救，但问题更大的直接链接" title="二、你开始自救，但问题更大的直接链接" translate="no">​</a></h2>
<h3 class="anchor anchorTargetStickyNavbar_Vzrq" id="常见方案都会踩坑">常见方案都会踩坑<a href="https://matrixhub.ai/zh-CN/blog/deepseek-v4-distribution#%E5%B8%B8%E8%A7%81%E6%96%B9%E6%A1%88%E9%83%BD%E4%BC%9A%E8%B8%A9%E5%9D%91" class="hash-link" aria-label="常见方案都会踩坑的直接链接" title="常见方案都会踩坑的直接链接" translate="no">​</a></h3>
<ul>
<li class="">手动拷贝会带来版本混乱，也不可审计</li>
<li class="">NFS 和 NAS 会遇到 IO 瓶颈，而且没有缓存层</li>
<li class="">每台机器各自下载会迅速耗尽带宽，冷启动也会更慢</li>
</ul>
<p>尤其在 vLLM 和 SGLang 场景下：</p>
<blockquote>
<p>每个节点都在重复下载模型，会把带宽压力放大 N 倍。</p>
</blockquote>
<h2 class="anchor anchorTargetStickyNavbar_Vzrq" id="三真正的问题其实只有一个">三、真正的问题其实只有一个<a href="https://matrixhub.ai/zh-CN/blog/deepseek-v4-distribution#%E4%B8%89%E7%9C%9F%E6%AD%A3%E7%9A%84%E9%97%AE%E9%A2%98%E5%85%B6%E5%AE%9E%E5%8F%AA%E6%9C%89%E4%B8%80%E4%B8%AA" class="hash-link" aria-label="三、真正的问题其实只有一个的直接链接" title="三、真正的问题其实只有一个的直接链接" translate="no">​</a></h2>
<p>所有问题，本质都可以归结为一句话：</p>
<blockquote>
<p>缺一个“模型分发基础设施层”，就像容器依赖镜像仓库一样。</p>
</blockquote>
<p>就像你不会在生产里直接用 Docker Hub，而是会用私有镜像仓库一样。但在模型领域，这一层长期是缺失的。</p>
<h2 class="anchor anchorTargetStickyNavbar_Vzrq" id="四我们的解法">四、我们的解法<a href="https://matrixhub.ai/zh-CN/blog/deepseek-v4-distribution#%E5%9B%9B%E6%88%91%E4%BB%AC%E7%9A%84%E8%A7%A3%E6%B3%95" class="hash-link" aria-label="四、我们的解法的直接链接" title="四、我们的解法的直接链接" translate="no">​</a></h2>
<h3 class="anchor anchorTargetStickyNavbar_Vzrq" id="核心思路">核心思路<a href="https://matrixhub.ai/zh-CN/blog/deepseek-v4-distribution#%E6%A0%B8%E5%BF%83%E6%80%9D%E8%B7%AF" class="hash-link" aria-label="核心思路的直接链接" title="核心思路的直接链接" translate="no">​</a></h3>
<div class="language-text codeBlockContainer_Ckt0 theme-code-block" style="--prism-color:#F8F8F2;--prism-background-color:#282A36"><div class="codeBlockContent_QJqH"><pre tabindex="0" class="prism-code language-text codeBlock_bY9V thin-scrollbar" style="color:#F8F8F2;background-color:#282A36"><code class="codeBlockLines_e6Vv"><span class="token-line" style="color:#F8F8F2"><span class="token plain">公网模型源（Hugging Face）</span><br></span><span class="token-line" style="color:#F8F8F2"><span class="token plain">        ↓</span><br></span><span class="token-line" style="color:#F8F8F2"><span class="token plain">模型代理 / 缓存层</span><br></span><span class="token-line" style="color:#F8F8F2"><span class="token plain">        ↓</span><br></span><span class="token-line" style="color:#F8F8F2"><span class="token plain">企业内部统一分发</span><br></span><span class="token-line" style="color:#F8F8F2"><span class="token plain">        ↓</span><br></span><span class="token-line" style="color:#F8F8F2"><span class="token plain">vLLM / 推理服务</span><br></span></code></pre></div></div>
<p>这个架构其实复用了一个已经被验证过的模式：</p>
<ul>
<li class="">Docker -&gt; Docker Hub -&gt; Harbor</li>
<li class="">Maven -&gt; Central -&gt; Nexus</li>
<li class="">PyPI -&gt; pip -&gt; 私有仓库</li>
</ul>
<p>模型分发，本质是同一类问题。</p>
<h3 class="anchor anchorTargetStickyNavbar_Vzrq" id="关键能力">关键能力<a href="https://matrixhub.ai/zh-CN/blog/deepseek-v4-distribution#%E5%85%B3%E9%94%AE%E8%83%BD%E5%8A%9B" class="hash-link" aria-label="关键能力的直接链接" title="关键能力的直接链接" translate="no">​</a></h3>
<p>这个分发层需要：</p>
<ol>
<li class="">代理 Hugging Face，而不是替代它</li>
<li class="">自动缓存模型</li>
<li class="">支持断点续传</li>
<li class="">支持权限控制</li>
<li class="">支持内网分发</li>
<li class="">兼容 vLLM 和 SGLang</li>
</ol>
<h2 class="anchor anchorTargetStickyNavbar_Vzrq" id="五我们把它做成了一个项目">五、我们把它做成了一个项目<a href="https://matrixhub.ai/zh-CN/blog/deepseek-v4-distribution#%E4%BA%94%E6%88%91%E4%BB%AC%E6%8A%8A%E5%AE%83%E5%81%9A%E6%88%90%E4%BA%86%E4%B8%80%E4%B8%AA%E9%A1%B9%E7%9B%AE" class="hash-link" aria-label="五、我们把它做成了一个项目的直接链接" title="五、我们把它做成了一个项目的直接链接" translate="no">​</a></h2>
<p><a href="https://github.com/matrixhub-ai/matrixhub" target="_blank" rel="noopener noreferrer" class="">MatrixHub</a> 本质上就是：</p>
<blockquote>
<p>企业版 Hugging Face 代理 + 模型分发加速层。</p>
</blockquote>
<p>它提供：</p>
<ul>
<li class="">Hugging Face 代理，解决公网访问问题</li>
<li class="">模型缓存层，减少重复下载</li>
<li class="">企业统一接入入口，处理权限和治理</li>
</ul>
<p>你可以把它理解为：</p>
<ul>
<li class="">模型领域的 Harbor</li>
<li class="">或者 AI 时代的镜像仓库</li>
</ul>
<h2 class="anchor anchorTargetStickyNavbar_Vzrq" id="六快速上手">六、快速上手<a href="https://matrixhub.ai/zh-CN/blog/deepseek-v4-distribution#%E5%85%AD%E5%BF%AB%E9%80%9F%E4%B8%8A%E6%89%8B" class="hash-link" aria-label="六、快速上手的直接链接" title="六、快速上手的直接链接" translate="no">​</a></h2>
<h3 class="anchor anchorTargetStickyNavbar_Vzrq" id="step-1启动服务">Step 1：启动服务<a href="https://matrixhub.ai/zh-CN/blog/deepseek-v4-distribution#step-1%E5%90%AF%E5%8A%A8%E6%9C%8D%E5%8A%A1" class="hash-link" aria-label="Step 1：启动服务的直接链接" title="Step 1：启动服务的直接链接" translate="no">​</a></h3>
<p>下载 <a href="https://matrixhub.ai/deploy/docker/docker-compose.yaml" download="docker-compose.yaml"><code>docker-compose.yaml</code></a> 和 <a href="https://matrixhub.ai/deploy/docker/config.yaml" download="config.yaml"><code>config.yaml</code></a>，并保证二者在同一目录下。</p>
<div class="language-bash codeBlockContainer_Ckt0 theme-code-block" style="--prism-color:#F8F8F2;--prism-background-color:#282A36"><div class="codeBlockContent_QJqH"><pre tabindex="0" class="prism-code language-bash codeBlock_bY9V thin-scrollbar" style="color:#F8F8F2;background-color:#282A36"><code class="codeBlockLines_e6Vv"><span class="token-line" style="color:#F8F8F2"><span class="token plain">docker compose -f docker-compose.yaml up -d</span><br></span></code></pre></div></div>
<p>默认服务地址：</p>
<div class="language-text codeBlockContainer_Ckt0 theme-code-block" style="--prism-color:#F8F8F2;--prism-background-color:#282A36"><div class="codeBlockContent_QJqH"><pre tabindex="0" class="prism-code language-text codeBlock_bY9V thin-scrollbar" style="color:#F8F8F2;background-color:#282A36"><code class="codeBlockLines_e6Vv"><span class="token-line" style="color:#F8F8F2"><span class="token plain">http://127.0.0.1:3001</span><br></span></code></pre></div></div>
<p>验证：</p>
<div class="language-bash codeBlockContainer_Ckt0 theme-code-block" style="--prism-color:#F8F8F2;--prism-background-color:#282A36"><div class="codeBlockContent_QJqH"><pre tabindex="0" class="prism-code language-bash codeBlock_bY9V thin-scrollbar" style="color:#F8F8F2;background-color:#282A36"><code class="codeBlockLines_e6Vv"><span class="token-line" style="color:#F8F8F2"><span class="token plain">curl http://127.0.0.1:3001</span><br></span></code></pre></div></div>
<h3 class="anchor anchorTargetStickyNavbar_Vzrq" id="step-2登录">Step 2：登录<a href="https://matrixhub.ai/zh-CN/blog/deepseek-v4-distribution#step-2%E7%99%BB%E5%BD%95" class="hash-link" aria-label="Step 2：登录的直接链接" title="Step 2：登录的直接链接" translate="no">​</a></h3>
<ul>
<li class="">用户名：<code>admin</code></li>
<li class="">密码：<code>changeme</code></li>
</ul>
<p>建议立即修改密码。</p>
<p><img decoding="async" loading="lazy" alt="登录" src="https://matrixhub.ai/zh-CN/assets/images/login-ef198b8de99c9a9d150ae480d08fea6a.png" width="794" height="930" class="img_ev3q"></p>
<h3 class="anchor anchorTargetStickyNavbar_Vzrq" id="step-3创建远程仓库">Step 3：创建远程仓库<a href="https://matrixhub.ai/zh-CN/blog/deepseek-v4-distribution#step-3%E5%88%9B%E5%BB%BA%E8%BF%9C%E7%A8%8B%E4%BB%93%E5%BA%93" class="hash-link" aria-label="Step 3：创建远程仓库的直接链接" title="Step 3：创建远程仓库的直接链接" translate="no">​</a></h3>
<p>关键配置：</p>
<div class="language-text codeBlockContainer_Ckt0 theme-code-block" style="--prism-color:#F8F8F2;--prism-background-color:#282A36"><div class="codeBlockContent_QJqH"><pre tabindex="0" class="prism-code language-text codeBlock_bY9V thin-scrollbar" style="color:#F8F8F2;background-color:#282A36"><code class="codeBlockLines_e6Vv"><span class="token-line" style="color:#F8F8F2"><span class="token plain">Remote URL: https://hf-mirror.com ( 或 https://huggingface.co )</span><br></span><span class="token-line" style="color:#F8F8F2"><span class="token plain">Type: HuggingFace</span><br></span><span class="token-line" style="color:#F8F8F2"><span class="token plain">推荐名称：huggingface</span><br></span></code></pre></div></div>
<p>作用：</p>
<div class="language-text codeBlockContainer_Ckt0 theme-code-block" style="--prism-color:#F8F8F2;--prism-background-color:#282A36"><div class="codeBlockContent_QJqH"><pre tabindex="0" class="prism-code language-text codeBlock_bY9V thin-scrollbar" style="color:#F8F8F2;background-color:#282A36"><code class="codeBlockLines_e6Vv"><span class="token-line" style="color:#F8F8F2"><span class="token plain">请求 -&gt; MatrixHub -&gt; Hugging Face -&gt; 回源</span><br></span></code></pre></div></div>
<p><img decoding="async" loading="lazy" alt="远程仓库1" src="https://matrixhub.ai/zh-CN/assets/images/remote1-bb687efcb9ba5609ed56cdd89b0702f3.png" width="945" height="317" class="img_ev3q">
<img decoding="async" loading="lazy" alt="远程仓库2" src="https://matrixhub.ai/zh-CN/assets/images/remote2-6fe579276bb8b4c03dd32989f95f5792.png" width="947" height="280" class="img_ev3q">
<img decoding="async" loading="lazy" alt="远程仓库3" src="https://matrixhub.ai/zh-CN/assets/images/remote3-ce382af00273da708a071d92aa4cda21.png" width="929" height="889" class="img_ev3q">
<img decoding="async" loading="lazy" alt="远程仓库4" src="https://matrixhub.ai/zh-CN/assets/images/remote4-02a21a55c75522ffdc111f0d03b35ade.png" width="934" height="299" class="img_ev3q"></p>
<h3 class="anchor anchorTargetStickyNavbar_Vzrq" id="step-4创建-proxy-项目">Step 4：创建 Proxy 项目<a href="https://matrixhub.ai/zh-CN/blog/deepseek-v4-distribution#step-4%E5%88%9B%E5%BB%BA-proxy-%E9%A1%B9%E7%9B%AE" class="hash-link" aria-label="Step 4：创建 Proxy 项目的直接链接" title="Step 4：创建 Proxy 项目的直接链接" translate="no">​</a></h3>
<p>作用：</p>
<div class="language-text codeBlockContainer_Ckt0 theme-code-block" style="--prism-color:#F8F8F2;--prism-background-color:#282A36"><div class="codeBlockContent_QJqH"><pre tabindex="0" class="prism-code language-text codeBlock_bY9V thin-scrollbar" style="color:#F8F8F2;background-color:#282A36"><code class="codeBlockLines_e6Vv"><span class="token-line" style="color:#F8F8F2"><span class="token plain">用户 -&gt; 代理项目 -&gt; 远程仓库（HF） -&gt; 缓存</span><br></span></code></pre></div></div>
<p>创建项目时：</p>
<ul>
<li class="">选择刚才创建的 <code>huggingface</code> 远程仓库</li>
<li class="">填写代理模型组织：<code>deepseek-ai</code></li>
</ul>
<p><img decoding="async" loading="lazy" alt="创建项目1" src="https://matrixhub.ai/zh-CN/assets/images/creprojcet1-0fc527adb0bce4ecfc45fe40f4efb851.png" width="1139" height="281" class="img_ev3q">
<img decoding="async" loading="lazy" alt="创建项目2" src="https://matrixhub.ai/zh-CN/assets/images/creprojcet2-bde671d369b2833723d93399133068f8.png" width="463" height="404" class="img_ev3q"></p>
<h3 class="anchor anchorTargetStickyNavbar_Vzrq" id="step-5客户端接入">Step 5：客户端接入<a href="https://matrixhub.ai/zh-CN/blog/deepseek-v4-distribution#step-5%E5%AE%A2%E6%88%B7%E7%AB%AF%E6%8E%A5%E5%85%A5" class="hash-link" aria-label="Step 5：客户端接入的直接链接" title="Step 5：客户端接入的直接链接" translate="no">​</a></h3>
<div class="language-bash codeBlockContainer_Ckt0 theme-code-block" style="--prism-color:#F8F8F2;--prism-background-color:#282A36"><div class="codeBlockContent_QJqH"><pre tabindex="0" class="prism-code language-bash codeBlock_bY9V thin-scrollbar" style="color:#F8F8F2;background-color:#282A36"><code class="codeBlockLines_e6Vv"><span class="token-line" style="color:#F8F8F2"><span class="token plain">export HF_ENDPOINT="http://127.0.0.1:3001"</span><br></span></code></pre></div></div>
<p><img decoding="async" loading="lazy" alt="客户端1" src="data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAisAAAApCAIAAAB7m/5dAAAdYElEQVR4nOydC1xMW/v4194zzdSkK90bqulKiqSQEpVyyf0tHOTyulVuhxPC73R0DuGcw8/t8DpHkmt0kYgT5yUUDiWVJroL3ZRGTc1tz/8zs7ONuTUUnf/P+n58PmY/8+z1rL1Ws5512+shJSQknDt3DnQrBgYG169fFwgE1dXVNBpNTU2Nw+Eo0Tc2NkZRVLmOEkgkkqWlpUAgIFIwNjbmcrkCgQC/pNPpJBKpV69eAoHA2dk5KSmJy+WWlpbq6OjQaDRnZ+dz585VVFSUl5d/Wgb++dBoNBcXFw6Hw2azCSGKov37929ubiYK6qM0uw6FQunbt29LSwuGYZ/VUFeARQeBfD6QhISEoKCg7k3UwMAgNTWVuBQKhSNGjOheEx9FWFhYcHBwfn4+m812c3OrqamZO3fuH3/8YW1tTehs3rz5+vXrPZhJCAQC+dr4LB4IQRAbGxviks/nl5WVda+Jj0JHR2fMmDGDBw9WV1fPy8tLT09vbGw0MTHR0tIidCoqKrhcbg9mEgKBQL46EhISejoLEAgEAvkaQXs6AxAIBAL5SoEeCAKBQCA9A/RAEAgEAukZOjyQ55SlPZ2TzrFeuTIw9zFKpSrR0XIZ4p2UTKXTO03NwG+sV2IymUbrSpZM6BPGT73WlRQAAGpqWiNG7beymf+xN64a4JA/awaVpKwboanV32dcAo1m27U89gwIgqDo+6dDxRAfCBAEUZSClGanaUolJXmjJNra2l5eXqamph9rSC6KMu/o6Dhr1iwajTZt2rRRo0YBAPT09GbNmqWhofExpdgN6Onp+fj4kMlkJULZ0mMwGN7e3gMGDCCRSISOis+OF7Kbm5tkIXddU24+ZSGTyYMGDfL09JRKU64hRdalDMnWJkRU1Ph/s1fvZD68Uf+8+MvnYPy9v8sSzzF37uxU037+Qn2ngcYTJr5MSlSkYzV9er/JU8pTUyuPxSpPzWbOHMspUx47D3qTnfVJGRcnYv8vej9vMpnG579/BSRw+o2iwvgS5h+d3o4g6PRvHvUxGMjns+tq8z7W+sKBDo59+kw0N0usfK5Era+Fr439v9itNQnxbu1tyjS/JKtWrbpz586DBw+U6MyZMyc0NHT8+PFNTU0AgNTU1Ly8vE2bNiUkJJiZmRFqJ06cOHDggNwUbt26JdnMhYSEuLu7h4aGbtiw4ebNmwCAuLi4qqqqtLS0X3/9FQDAZrOZTObly5cvXbpEp9PxfToYhtXW1ubl5W3fvp3L5eK5whuXq1ev/vDDD1ZWVidOnJC0GxMTo62tLWto0KBB+vr6UpkkHlAKX1/f4ODgvLy8lStXMpnMmzdvDhw4cOXKlbm5uUwmU5XyVKWQVeHHH3+0sLC4c+cOn8+XFWpqakZERIwZM+by5cvR0dH47tO4uDgjIyNcs6ysbPHixdbW1ocPH5ZM9s2bN+PGjZNrUbaQhUJhFzV1dHSk8ikXc3Pz06dPE5719OnTe/fuVWRIrlCuIdna7KzUvwo6Spmq3its+5mtIUMxAf8L58DQ1VXQxlbFA2VMDjQe46PE/Xx5MjPmZ2ZIj11MzD2aWRUqeCAkOKRYR9fywvkJr55f/gTrEy9c8unM/bS+fRL7mz6NZj1zQc43Cwvjj1hzuXWfYKvbCQ4Obm9vV944qqmp4R1S/JJEIuGfURQtLi7G2wUAQFVVlaIUUBTNyso6efIkfllWVjZy5EgAwIoVK27duoVhGFkMbuiPP/6ora319/ePjIxksVjV1dUAgOTk5OzsbFdX16CgIIFAcP78+bCwsEePHsXHx/v5+QUEBOTn51+4cCE0NJTBYKxduzYhIeHmzZvFxcXBwcGyhrZu3aqhoUGn00NDQy9evJiVJer9sFgsuZl/+fIlAKCysrKhoaGmpuYTylOVQu6UiRMnuri4REVFtbe3ywpRFE1KSuLxeHgF4d9yudzz589nZ2c3NDTMnz9/5syZ/v7+V69e3bFjB65AJpPXrl1bW1sr16K9vX1YWNiNGzf27t0bHh7u7++fn5+fmCjnh6+6Jo1Gk82nXOrr63/++eeioiI+n79+/fqgoKC4uDgTExNZQ4WFhbLC9PR0uYY+qja/Ht4Pq/taO836ds/JXeGERMtV1+/U+DvrbtamvlA9xWklZfzWVm0bWwCEzy9dspw6tYnJvODYX93a2vfsOT17e5RMFgiwygvJpefOGQwZIgRA28LK+adtAICSc+daH+VOLixCSKS22hojN3dUTe3V7VtPT8R77N0varOFwpMnT2Dijpjv1QxjT8+nsbH3w5ZbLV02YveemszM+oeiH1tff3+PX3dTtLU5jY1Jbq62CxY6fxeRNHQIu7BQ1Jd521p+IUUofnXcZvZs28tX1LR6tdbWpvS35zU3y30oBEHHTfnTxMydTNbAMEFx0ZnCvN+nzcwQv+3UFntQD1czMvUzNh2GIGifPo7OrlsAAM+enGazS0Q/FcfwUb7/W/zk1I0/5+LKnmN+19WzTk+dRbif8nmzWvl8W11dIRBeKquYasNgNjY5nEywpmmcnzTOXk+fjKKYUJhcUnqtsmq/j7f4bV9worSCL34xPry/7S/eXscKnsy0t9OmUl63tTvGn6kRnxPBZpeciXWZs/jJpKBr5084Kao7HR2djRs3Dhw4kM8XpKdfPnTo0MKFC/38/OLi4q5cueLo6BgZGZmRkVFSUrJ8+fLDhw/PmTPHwsIiLy8vMjKSy+V6enouXrzY1NS0sbExJSXl1KlTAIDIyEgSiVRfXz9mzBg9Pb3z58//+eefdDodQZB+/fp5eXkBAPLy8poVlLwiWCxWTk6OKpp1dXWymmZmZrNnz5YauDx9+jQzMzM9PT0tLW3ixImHDh3Cm4xbYnx9fe3t7adNm4Zh2IYNG5qbm7OzswcPHjxhwoTExMTc3Fz8KISqqipJc1KG7t27BwCws7MLDQ0tLi6+ceOGkpxXVlbW19e3trZWVFRIellHR8fVq1czGAwmkxkREWFsbCxbnvr6+rJCJycnuRWH99AjIiISExOlhik0Gi08PPzvv/++evWqXCGJRIqOjr59+7bkex1tbW3EI1PF0+YlJSVsNjslJQUXRkVF8Xi87du3y31wvJBjYmK4XK6Tk+jPFS/krmhyOBzZfOJIPTuHw7lw4YKoAdTSolAoGIa1tbXJNWRnZycrTElJkWtIUW1+5XwwsTtm+rL8uxmPb13AL2mmmtr9tPzPTsiYfflVcrWKKdIMDUmamoDPB2SyxbRpAh5P184OpVCm3ftbTVeX19xck51lNnqMJp3uvj1Gx9paCACNbj54w0YAgKa5eVbIPA0jI6qenq6NzZtnJbrWDKp+79ePHtdkZhoMGULR1wckEhB7oKzVK/+Vl99/2TJWZbl79E8AQbLWrLKZI2rfrYKCsfZ2VkWFlqWl56H/tLyoJlGp6n0M8JkyMo3Wi05/W1EBAOgfGiZgs3lvW2jGxgN/2JqzepXch5oy866R8dC62pynT86MGPWTpqYRq+nRvTtbrawnGpm4E2pOLqEWlgEIghoYDerdZ4D4r7mZmS/qqlOo2giCUtXfz8CY9fVqb39dWXqGkBiKjzASYBgJJU21seZjAjs9PQqKPpgzU4dKZXE4Wa9ejaab07V65dU3ZFa/GGJkpKeuTkIAPm4166VFIZGXODtx+PyKZpaFjk7s2NHjLl7BE2ezS+pqHhoYDlJUcSiK7tu3z9zcPDk52dbWNiQkJD8//+zZs5MnT1m/fn1DQ8OWLVsQBElISPDz87O0tIyJiSkuLq6oqPDw8PDy8iovL4+JiWloeJ2cnDxs2LAVK1awWKy0tDRzc/PBgwfzeLyMjAxPT09dXd25c+f6+/sDAEaPHo1PiO/atYtom2QJCQnBRwmampqE0MTEZMaMGfjn1NRUJW8TW1tb45pNTU3EsRdFRUUhISHJycmy+nw+//nz5/QPlxIHDhyoq6t7+/ZtExOThobXuL8UCoVlZWV2dnaKTCs31Cn37t2bNGkSAGDdunWS8rVr15aVlWVlZY0dO3bmzJl0Ol22PJ2cnGSFAADZirt27Rp+cpWWlla/fv2k8hAREaGurr5t2zZFQoFAoMSPThWTnp6en59PCGfMmOHv7793797iYvnT/sbGxg0NDc3NzdHR0erq6iUlJcSE3idrKsmn7LN7eHgsXbqUwWCgKPrTTz9xuVy5hlpaWmSFigwpqs2vnA9WAksL7tVWPpv2dPa8tiXz2pb4n50AEICSEL9T4z8uVQy7ONYPANBeW1t2LgFB0QGbNqvp6r5+9OiUvm6Grw+ulWxnc4yEIADU3Mo8RkKOkZCskHlEGsyjR1PsbbgsFqfxNevB/WsBY+sfPpQ0wi4qygxbDgBw374DIZPvrF7FLirCv+K/fXtSXzfJ2goBgNqnj5KccpqaTupoJbo4IwBYBE6Sq6NBszIyHlpeeinp1JCCR7swTDS+5vHePn64ra4mV1IzI23qkX0aGMYvKjxxZJ/GkX0auPsBADx+uO33fZpXUycQytra/d40PpWyJRQKfc8liXru7NaE4qcIAra4OOtQqXn1dTr/iR2TcglXu9/4xi81/aG8SYwWLlfnP0ctj58GQGhM+2DJ+sXzWySyOlXdTO5jOjo62tjYMJnM27dvX7x4EQDg5ubW2toaGbmRRCLt27dPX19/8+bNra2tuH5OTs7ChQtXrVqFYZiXl9fkyZNRFI2O3nrgwIElS5ZgGBYYGEgkvnPnzujo6PLy8jdv3mzdutXDw0MoFB47dsxDjBL3AwAYOnToSDH4XBmOqanp3Lnz8H+yKyuS2Nra4mpTp04lhAcPHuzVq9eKFSvk3tLW1qajo4N/Hjt27F4xDQ0N8fHxmpqaHM77yaj29nZJvyiLckM4+/btu/4hly5dUqJfVFQ0b96877//ns/n6+rqyi1PJYUsVXG4MC4ubu7cud9//72kIVdX17Fjx8bFxUnOGskVyiUgIGDdunUPHjz48ccfCaGFhUV4ePjdu3dPnz6t6EZNTc3W1lYfHx9fX98DBw7U1dUpKmTVNZUg++wNDQ2PHz++f/++QCAICwszNjaWa6hbrH/lvB8DsVve7N84k1X/vPq6nqGrqB+BoEjvgaLmu62e/VGJvnn2rKWYKfIiccc0xVtETEePBgAUHjqoYgq8lpa7ixcBAE7p6ypRKz9yZOj/RGmYmvLa2koOvl+Lvhe5EcNPKRUK0Q838EhxY9FCoXiYDQAgKdC0GyDyizn3flYx84qQ3K0gLl4Sl9sqpfOs6Q2z+S0A4FjBE9NeWqI+bF9zAMBvufkqWtl46zZH0HFaJfnDvUbtbY0AAHWaMaddzrSqg4MDAGDw4MG//PIL3grje64KCwuzs7O9vLwqKioeP35M6O/atQvDMPwcTAqFYmJiIhQKc3Nz8XurqqoMDQ1xzaKiorS0NADA0qWfsuUyPDy8vr4eAJCenk4IHz58uHLlSlVuv3z5MrH8QJCTk3P37t3AwEA+ny87JaKvr080r3p6ei9evDh27FhqamqTGMlNENra2nI3EahoCCc2NtbAwEBSovyAqD179uDF3tbWRlb65y0XqYrDhRiGlZSUSKqRyeT169eXl5fHxcUpF8plxIgRmzdvfvjw4bfffkscn4oP4Hg8XlRUlJJ7m5qaHBwcvvvuu9zc3JSUlG+++UZRIauuqQTZZy8Wg8/yfffdd7NmzZJrqFusf+W8//M9tiOMVf9c1L9efgeX2Kx08PjZq7WmNdH61EclKhSvwolGGG+acA9E1qABAFqrXwIA/K7/Jd6q8n6fIrmXlnQSEn+ySui/eTOtI30Nv2vXidEVj/1BW49xRfnRMBA1iL5X/pTcKNNeVydqdn+IFgLwukB+K0+jidzwW1YpAMDKZh6Z3OkGbqFa5zqgjV3bu4+DlJAn7HjwpnYO7oFoaqI6qm5pAQD8d8oE8S5fZcm28hQejUy38MEwfnPjQ7nfVlZWAgC2bduGD4AIhg0bNnLkSB6PZ21t/e9///v333/H5S3iLPn4+KAoWlhYaGpqiiCInZ3dkydP1NTUTExMiMMAJTdQEQgEAlrXtsJ3kf379x8/fpxoggkYDIaFhQWx7HFWDPFtdXW1h4cHg8EoLS1VU1NzcHCQ3ZamoiGC+fPn4+6foL29/b///a8ifcI/SW76kluecoVSFafICr6kt3TpUkn/IVcoi7q6+pYtW8rLy9etWydZ+6ampq6urmfOnFG+7FddXY0giLq6enR0NIVCMTU1ffToURc1Pw18SU9TU1Ouoc9t/WugwwNlph17kHFG6rvSQ8VkdXLxnkKMq5I/UELpubN9XFy8jx5tr6vTd3QUAqDep6PTx+dw9R0dbVevNnIf9raq6tH6COVJOW2Nbnn+vGT/Pm1Xt6Hf/yDg8RJdh0y9e89s9BiHyEi5t7BrXgEAXKOiXDZt6u3kJOmBHJYs0f35VyN3NyGG5UjMFUiSe/9nJ5cwv/HxVRXX3Ef+j9QvX+TbNCwEgrdc7mv8ksNpNjX3AAAxMvOnkGnPK5MAAEZmAeMnncy5vzvvYYeVutrcfpYBFIqh8s1pZ4ufDTY0ig3wq2trG9BbHwBgoPFBm/Kjq8vzlrd7CzvfSW9k4try9qWib3NychobG8PCwlpaWhAE8fDwKCoq+uuvv6Kiompra+fPn3/gwIEFCxYQE/oeHh50On3y5MnNzc3p6elWVlZTpkxZt25dcnKyh4cHlUrNzMxUkpkXL154eHjcuHGDwWA0NDQoX5PvClZWVsT8219//UXIS0tLr1+/7ufnR0hGjhw5Qgyfz4+Pj5fr6lNSUoKDgzds2LB9+/Y5c+bQaLT09HQ1NbVhw4bhCwk2Njaenp6Sg0VZQ1KcOHGid+/ekpJPOCRXbnlKCXFNqYrDhT4+Pvg2d3z/hYWFxezZsy9evFhQUECYkCtEUXT48OEoimpqahobG3t6ejY2Nurr6+vq6hYVFU2Y0DHtXFRUxGQyGQwGPqqWyryUdbyQa2pqqFRqZGQkiqKK8qm6ptx84jmR0uzfv/+4cePu3LnDZrMXLVoEAHjw4AGTyZQ1VFBQICtUYggiS4cHit+xXPY7jIsV7VR18odAKMCEGAa4XESUAhcffzz99Vfn1d/SjIxohobVGRkm3t4UHW1cv/jo7wOWhw7/ZTcCwMvMm6LBkYL+FcYXJeW87jsBh1uyf9+41FQERTP/vYhdkH918qSJV/90+yE6N2a7qOvX1jFTLxQKMT6fuWun85pvdR0ccBOmXqMwPl/AEf3IbeeFIADwWG/vbdqo6MWgtraKupoHZn1HmfUdVVKcZGk9QSh8n0EEQUOWlXPam2J/61iN+Dt7h9foncvWiHSKCuNxD8SwDqSq6zNsJxMeKDNjzZzFY4PmPjjxR9+OBxQKMUzIFbk3IRfDeOKd8b88fvLtkMGGNJoBTeNaZZU33Vz7XW+aJ55tW+fmyhMI9hYWc8TzKu3vupxCIeBLFGTA5EtUqu7NDPlbLfBWb82aNTExMdu2bRMKhc+fP8/Jydm0aRONRlu1ahWLxYqIiDh+/HhUVBQ+DNqwYQOGYUVFRfgaSUNDw+HDhxctWhQZGSkUCq9cuRIfH493w+Wai42N3bhx48GDB/G1CrkeCN/SSnSiBQIB/pmYRCLw9PQkJv1wOBxOWloahmFOYnBhXV0dj8cj/r4OHDjg7e3N5/NxQ4GBgS0tLQUFBRcuXKioqMA3I0gZqqio2L9/f2hoKL7D++rVq5cvX7a0tNz57nWCyWJiYmLkGsIv8Q/E5f379xVVitwC4b2bYyAKRFF5SgnrxCN+qYrDb+/Xr5+GhgaxGr9x48bm5uY9e/ZIWpcr1NDQ2LVrF+6tncUwmczjx48DAIaLwdUSExOZTCa+ktfY2Cj1XFLWiULG14rwQu6iptx8LliwQFazT58+06dPx3evCASCU6dOXblyBR/LShnCMExWqMQQRA5f8mxs/VHeau8WeCVBqVSzGUE0x4Gf1bqBfwBFaskaQdQtLSkfTsHLgk+7aWr1J5NpVKrxsjXC4V7/q/wWFCWb9g2k0awlhQy7BSjpgwMdrGzmLVsj/GZhuY3DEuUJehv20aGoKddRhJGp39RZ95etEY4cc0QVfWNjY21tbSUKU6ZMyc7OZjAYsjNLZDLZxMREXV1dFUMkEsnKykq5LRXZvXv3lQ9JSkrqerKKoNFoTk5OfZRucvnyyC1PSaGSikNRdMCAAYTc1dXVwsJCSkeusFuQso5Do9Hc3d2lCrmLmipa19HRcXd3d3FxkdpZINeQXCFEVWB0hk7x9j+5KIw1fuo13/GJ/17RujD0DYJ023l69o7hsxeULFuNeXj/1l1pSmJkFrBsjXDeklfunru7K028IYM/uf/vgBUH+ccBPVCnaGhYjBx9ODikeM6iKp9xCRo0q243gaIURZuku5oyiUqhGHZvmra2tmvXrlWldwn5RwErDvKPA3ogCAQCgfQIMDoDBAKBQHoG6IEgEAgE0jNADwSBQCCQngFGqIMR6iAQCKRn6Gi8Zq/eaUBXdsTv52P8vb/tIzo5BwGHiFCnRAePUGfs49tpaniEul7OCs+KVgUiQp2kMHD6DWv7RarcjiDojDmPF4Wz+jstQEkf/boPEaFOuVpfC995S4vnLXmlrtG5Y4ZAIJAvB74X7uhdQdSJXJT00accdp35AqH/DZXCBVLp9H4hnQwUnH/aFiIQ9pvf+RvII+KOhwiEusNHqJxTVVmyiufld0wFRWTm/GdLV/NN6B959Pg76BrU+TYqbQ2n0awXhrEWhbG6fWc2BAKBfDIwQh2MUAeBQCA9wwdLCGOmL3PynExcEhHqTKaaq54izdBQz9GRREJJVKrFtGkCPp+IUNd70CABh/PyViaZSsEj1OGB6fAIdYM3bHReswYAoGFkpGNjYzLSk1VZJRr6vItQJ2htRdXVwbvAt1mrV5LI5P7LltlHRHju3Ucik7PWdBx6ZhUUTKJSWRUVlN69PQ/9R9PcHI9Qh3+LR6jDP/cPDUPIJCJCnaKHmjLzbl8Ln6bGp3dubEQQhIhQV1+bQ6G8PwTFySXUbXgkHqHObXik2/DIvoyO8Y2KEeoG9O6NIgiFRBZHqMOICHXOBoYcgSDzxQs1EkpEqGvl8alkMund+ZlEhDoqiVTRzOqtoRE7djSROB6hTk/PRvWqhEAgkM8KjFAHI9RBIBBIz/DeA+ER6mqrnlRfr2wsaGgsaGh60hFuoCsR6oTiCaKuRKi76j1KkVr5kSPsly9F+v8XI9RhQuQzRajr2nNAIBBI9/C+hZKMUJc2NCltaBIz7gkAoIsR6vAPnzVCHfIuQh0h/78Roa7D+peKUAeBQCBfmA4PpChC3b3NWYnWp7olQh0AwPvo0Ul5+WbeovGQbIQ6z9NnBu3Y2WlSTlujrcNXAACICHUJzk78tjZVItQF5uaZfxgozGHJknF37josWgSURqgTOc7x8c5DNvuO/11uhDoK5X2EMYkIdQH0ftNwoZFZwILlr52HbCbU6mpzaZomnW5OO1v8DAAQG+BXMPtf3nRzuRHqVg5QaSe98gh1EAgE8oXp8EBKItR9rPtRFKGuvbaWZmio7+hYnZGB8XiSEepIamrDf9nNCAo2dHPriFAnL6wZEaFu2E6RS+iIULd0CR6hDgiFbj9E45N+shHq+G/f6jo49HZyepl5U5QxiQh1RsPceW9b7qxa2WmEuuFe0WXPLgoEHNkIdd8sekZI/s7eoalpsmwNNjUo3cpuCi4kItQRapkZa4RCLGjug/cPqCBCXR271YCmMaC3/rXKKj4mkI1Q9/MoL5HnUyFC3d1bWz6qNiEQCOQzAiPUwQh1EAgE0iMgCQkJQUFBPZ2NfzTe/icZ1oGvXt7ncpotGAGYgBf7m77kMKgr2DuGuwxdra1jlf/o8J0bckaiXcTILGBqUDq7taa46My9W2u6PX0IBAL5dGB8oE6BEeogEAjkswA9EAQCgUB6BBidAQKBQCA9A/RAEAgEAukZoAeCQCAQSM/w/wIAAP//EAFKy8F7ycUAAAAASUVORK5CYII=" width="555" height="41" class="img_ev3q">
本质上是在做这几件事：</p>
<ul>
<li class="">劫持客户端请求</li>
<li class="">首次请求回源 Hugging Face</li>
<li class="">自动缓存到本地</li>
<li class="">后续请求全部走内网</li>
</ul>
<h3 class="anchor anchorTargetStickyNavbar_Vzrq" id="step-6下载模型">Step 6：下载模型<a href="https://matrixhub.ai/zh-CN/blog/deepseek-v4-distribution#step-6%E4%B8%8B%E8%BD%BD%E6%A8%A1%E5%9E%8B" class="hash-link" aria-label="Step 6：下载模型的直接链接" title="Step 6：下载模型的直接链接" translate="no">​</a></h3>
<div class="language-bash codeBlockContainer_Ckt0 theme-code-block" style="--prism-color:#F8F8F2;--prism-background-color:#282A36"><div class="codeBlockContent_QJqH"><pre tabindex="0" class="prism-code language-bash codeBlock_bY9V thin-scrollbar" style="color:#F8F8F2;background-color:#282A36"><code class="codeBlockLines_e6Vv"><span class="token-line" style="color:#F8F8F2"><span class="token plain">hf download deepseek-ai/DeepSeek-V4-Pro</span><br></span></code></pre></div></div>
<p><img decoding="async" loading="lazy" alt="客户端2" src="https://matrixhub.ai/zh-CN/assets/images/client2-838fb1ca9e21f71866da3e6776d93b2b.png" width="924" height="76" class="img_ev3q"></p>
<p>下载完成后，进入‘deepseek-ai' 项目可以看到 DeepSeek-V4-Pro 模型在页面上出现.
<img decoding="async" loading="lazy" alt="下载" src="https://matrixhub.ai/zh-CN/assets/images/download-53cf0791790ee5a0a6d2abb9981894d7.png" width="1123" height="333" class="img_ev3q"></p>
<h2 class="anchor anchorTargetStickyNavbar_Vzrq" id="验证缓存是否生效">验证缓存是否生效<a href="https://matrixhub.ai/zh-CN/blog/deepseek-v4-distribution#%E9%AA%8C%E8%AF%81%E7%BC%93%E5%AD%98%E6%98%AF%E5%90%A6%E7%94%9F%E6%95%88" class="hash-link" aria-label="验证缓存是否生效的直接链接" title="验证缓存是否生效的直接链接" translate="no">​</a></h2>
<p>用 <code>curl</code> 看请求行为。</p>
<h3 class="anchor anchorTargetStickyNavbar_Vzrq" id="第一次请求回源">第一次请求：回源<a href="https://matrixhub.ai/zh-CN/blog/deepseek-v4-distribution#%E7%AC%AC%E4%B8%80%E6%AC%A1%E8%AF%B7%E6%B1%82%E5%9B%9E%E6%BA%90" class="hash-link" aria-label="第一次请求：回源的直接链接" title="第一次请求：回源的直接链接" translate="no">​</a></h3>
<div class="language-bash codeBlockContainer_Ckt0 theme-code-block" style="--prism-color:#F8F8F2;--prism-background-color:#282A36"><div class="codeBlockContent_QJqH"><pre tabindex="0" class="prism-code language-bash codeBlock_bY9V thin-scrollbar" style="color:#F8F8F2;background-color:#282A36"><code class="codeBlockLines_e6Vv"><span class="token-line" style="color:#F8F8F2"><span class="token plain">curl -I http://127.0.0.1:3001/deepseek-ai/DeepSeek-V4-Pro/resolve/main/config.json</span><br></span></code></pre></div></div>
<p>特征：</p>
<ul>
<li class="">请求时间较长</li>
<li class="">会带有上游响应头</li>
</ul>
<h3 class="anchor anchorTargetStickyNavbar_Vzrq" id="第二次请求命中缓存">第二次请求：命中缓存<a href="https://matrixhub.ai/zh-CN/blog/deepseek-v4-distribution#%E7%AC%AC%E4%BA%8C%E6%AC%A1%E8%AF%B7%E6%B1%82%E5%91%BD%E4%B8%AD%E7%BC%93%E5%AD%98" class="hash-link" aria-label="第二次请求：命中缓存的直接链接" title="第二次请求：命中缓存的直接链接" translate="no">​</a></h3>
<div class="language-bash codeBlockContainer_Ckt0 theme-code-block" style="--prism-color:#F8F8F2;--prism-background-color:#282A36"><div class="codeBlockContent_QJqH"><pre tabindex="0" class="prism-code language-bash codeBlock_bY9V thin-scrollbar" style="color:#F8F8F2;background-color:#282A36"><code class="codeBlockLines_e6Vv"><span class="token-line" style="color:#F8F8F2"><span class="token plain">curl -I http://127.0.0.1:3001/deepseek-ai/DeepSeek-V4-Pro/resolve/main/config.json</span><br></span></code></pre></div></div>
<p>特征：</p>
<ul>
<li class="">响应很快</li>
<li class="">不再访问 Hugging Face</li>
</ul>
<h2 class="anchor anchorTargetStickyNavbar_Vzrq" id="写在最后">写在最后<a href="https://matrixhub.ai/zh-CN/blog/deepseek-v4-distribution#%E5%86%99%E5%9C%A8%E6%9C%80%E5%90%8E" class="hash-link" aria-label="写在最后的直接链接" title="写在最后的直接链接" translate="no">​</a></h2>
<p>如果你也在企业内网落地大模型，一定会遇到这些问题：</p>
<ul>
<li class="">下载慢</li>
<li class="">带宽炸</li>
<li class="">节点重复拉取</li>
<li class="">权限不可控</li>
</ul>
<p>这些都不是偶发问题，而是架构缺失。</p>
<p>MatrixHub 只是把这件事补上了。</p>
<p>如果你正在做类似事情，欢迎交流：</p>
<p><a href="https://github.com/matrixhub-ai/matrixhub" target="_blank" rel="noopener noreferrer" class="">https://github.com/matrixhub-ai/matrixhub</a></p>]]></content>
    </entry>
    <entry>
        <title type="html"><![CDATA[示例]]></title>
        <id>https://matrixhub.ai/zh-CN/blog/examples</id>
        <link href="https://matrixhub.ai/zh-CN/blog/examples"/>
        <updated>2026-04-27T00:00:00.000Z</updated>
        <summary type="html"><![CDATA[MatrixHub 在企业内网里的实际使用示例，重点展示模型缓存和分发加速效果。]]></summary>
        <content type="html"><![CDATA[<p>这里放一个 MatrixHub 的真实使用示例。</p>
<h2 class="anchor anchorTargetStickyNavbar_Vzrq" id="常用场景">常用场景<a href="https://matrixhub.ai/zh-CN/blog/examples#%E5%B8%B8%E7%94%A8%E5%9C%BA%E6%99%AF" class="hash-link" aria-label="常用场景的直接链接" title="常用场景的直接链接" translate="no">​</a></h2>
<h3 class="anchor anchorTargetStickyNavbar_Vzrq" id="内网-vllm-集群的大规模分发">内网 vLLM 集群的大规模分发<a href="https://matrixhub.ai/zh-CN/blog/examples#%E5%86%85%E7%BD%91-vllm-%E9%9B%86%E7%BE%A4%E7%9A%84%E5%A4%A7%E8%A7%84%E6%A8%A1%E5%88%86%E5%8F%91" class="hash-link" aria-label="内网 vLLM 集群的大规模分发的直接链接" title="内网 vLLM 集群的大规模分发的直接链接" translate="no">​</a></h3>
<ul>
<li class=""><strong>场景描述</strong>：内网生产环境部署了一个由 100 台 GPU 服务器组成的 vLLM 推理集群。由于模型文件很大，例如 70B 模型可能超过 130GB，如果每台机器都去公网 Hugging Face 拉取，不仅耗时很长，还可能触发公网带宽限流。</li>
<li class=""><strong>流程概览</strong>：<!-- -->
<ol>
<li class=""><strong>统一接入点</strong>：将所有 vLLM 节点的 <code>HF_ENDPOINT</code> 环境变量统一指向内网 MatrixHub 地址。</li>
<li class=""><strong>拉取即缓存</strong>：首台机器请求模型时，MatrixHub 自动从公网拉取并持久化到本地；后续节点请求将直接命中内网缓存。</li>
</ol>
</li>
</ul>
<blockquote>
<p>作为用户，我希望把 <code>hf download</code> 的 Endpoint 指向 MatrixHub，这样当同一内网里的其他节点再次拉取同一模型时，可以直接享受缓存带来的速度提升。</p>
</blockquote>
<h4 class="anchor anchorTargetStickyNavbar_Vzrq" id="操作步骤">操作步骤<a href="https://matrixhub.ai/zh-CN/blog/examples#%E6%93%8D%E4%BD%9C%E6%AD%A5%E9%AA%A4" class="hash-link" aria-label="操作步骤的直接链接" title="操作步骤的直接链接" translate="no">​</a></h4>
<ol>
<li class="">访问 MatrixHub 地址 <code>http://x.x.x.x:3001</code>，进入登录页面。</li>
</ol>
<p><img decoding="async" loading="lazy" src="https://matrixhub.ai/zh-CN/assets/images/scenario-test-cn-6-95364bc816f5f19cb796f3ca60b57d49.png" width="1280" height="451" class="img_ev3q"></p>
<ol start="2">
<li class="">使用 admin 用户登录平台，进入模型仓库列表。</li>
</ol>
<p><img decoding="async" loading="lazy" src="https://matrixhub.ai/zh-CN/assets/images/scenario-test-cn-9-58d132f074f9c5194077788185b76b1d.png" width="1280" height="434" class="img_ev3q">
<img decoding="async" loading="lazy" src="https://matrixhub.ai/zh-CN/assets/images/scenario-test-cn-5-7b89a1bfe2828b5f7fbe5768097dfc3f.png" width="1280" height="290" class="img_ev3q"></p>
<ol start="3">
<li class="">点击右上角用户菜单，进入平台设置和仓库管理。</li>
</ol>
<p><img decoding="async" loading="lazy" src="https://matrixhub.ai/zh-CN/assets/images/scenario-test-cn-7-5d6d69592209a465863f1d431f2a954e.png" width="1280" height="316" class="img_ev3q"></p>
<ol start="4">
<li class="">创建目标仓库：选择 Hugging Face 作为提供者，填写仓库名称 <code>hf</code>，输入目标 URL <code>https://hf-mirror.com</code>，勾选验证远程证书，然后点击“确定”。</li>
</ol>
<p><img decoding="async" loading="lazy" src="https://matrixhub.ai/zh-CN/assets/images/scenario-test-cn-4-c02ce25629381e66b74e86e42c1e13f5.png" width="1280" height="665" class="img_ev3q">
<img decoding="async" loading="lazy" src="https://matrixhub.ai/zh-CN/assets/images/scenario-test-cn-3-a1b6b3f0385d39828c448945d21c47f5.png" width="1533" height="256" class="img_ev3q"></p>
<ol start="5">
<li class="">进入项目管理，打开项目列表页面。</li>
</ol>
<p><img decoding="async" loading="lazy" src="https://matrixhub.ai/zh-CN/assets/images/scenario-test-cn-10-f2ed65372d92ddb2bcb48c5b999a364e.png" width="1664" height="256" class="img_ev3q"></p>
<ol start="6">
<li class="">点击“创建项目”：输入项目名称 <code>qwen</code>，设为公开，开启代理，选择仓库，填写代理组织 <code>Qwen</code>，然后点击“确定”。</li>
</ol>
<p><img decoding="async" loading="lazy" src="https://matrixhub.ai/zh-CN/assets/images/scenario-test-cn-8-433051b8dde40426ca98820fe5d099be.png" width="1280" height="498" class="img_ev3q"></p>
<ol start="7">
<li class="">
<p>拉取模型。</p>
<ul>
<li class=""><strong>第一个节点</strong>：约 <code>3m37.318s</code></li>
</ul>
</li>
</ol>
<p><img decoding="async" loading="lazy" src="https://matrixhub.ai/zh-CN/assets/images/scenario-test-cn-cc4252c5c3f8e9ffd9964427bf748a94.png" width="1904" height="178" class="img_ev3q"></p>
<ul>
<li class=""><strong>第二个节点</strong>：约 <code>0m8.500s</code></li>
</ul>
<p><img decoding="async" loading="lazy" src="https://matrixhub.ai/zh-CN/assets/images/scenario-test-cn-1-cc4252c5c3f8e9ffd9964427bf748a94.png" width="1904" height="178" class="img_ev3q"></p>
<ol start="8">
<li class="">在 MatrixHub 中查看模型信息。</li>
</ol>
<p><img decoding="async" loading="lazy" src="https://matrixhub.ai/zh-CN/assets/images/scenario-test-cn-2-b4e841a1f4d7df1762961de6305ec76f.png" width="1280" height="517" class="img_ev3q"></p>]]></content>
    </entry>
</feed>