<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:content="http://purl.org/rss/1.0/modules/content/">
    <channel>
        <title>MatrixHub Blog</title>
        <link>https://matrixhub.ai/blog</link>
        <description>MatrixHub Blog</description>
        <lastBuildDate>Mon, 27 Apr 2026 00:00:00 GMT</lastBuildDate>
        <docs>https://validator.w3.org/feed/docs/rss2.html</docs>
        <generator>https://github.com/jpmonette/feed</generator>
        <language>en</language>
        <item>
            <title><![CDATA[DeepSeek v4 won't run? 99% of people get stuck at the distribution stage]]></title>
            <link>https://matrixhub.ai/blog/deepseek-v4-distribution</link>
            <guid>https://matrixhub.ai/blog/deepseek-v4-distribution</guid>
            <pubDate>Mon, 27 Apr 2026 00:00:00 GMT</pubDate>
            <description><![CDATA[Why enterprise DeepSeek rollouts fail at distribution, not model serving, and how MatrixHub fits in.]]></description>
            <content:encoded><![CDATA[<p>Recently, DeepSeek released DeepSeek v4, and many teams rushed to integrate it.</p>
<p>But if you're operating in an enterprise environment, especially air-gapped or private deployments, you'll quickly realize one thing:</p>
<blockquote>
<p>The model is not the biggest problem. Distribution is.</p>
</blockquote>
<p>During our attempt to deploy DeepSeek v4 in an internal network, we ran into a lot of issues. In the end, they can all be boiled down to three fundamental problems.</p>
<h2 class="anchor anchorTargetStickyNavbar_Vzrq" id="1-you-think-its-a-download-problem-but-its-actually-an-architecture-problem">1. You think it's a download problem, but it's actually an architecture problem<a href="https://matrixhub.ai/blog/deepseek-v4-distribution#1-you-think-its-a-download-problem-but-its-actually-an-architecture-problem" class="hash-link" aria-label="Direct link to 1. You think it's a download problem, but it's actually an architecture problem" title="Direct link to 1. You think it's a download problem, but it's actually an architecture problem" translate="no">​</a></h2>
<h3 class="anchor anchorTargetStickyNavbar_Vzrq" id="hugging-face-doesnt-work-well-in-enterprise-environments">Hugging Face doesn't work well in enterprise environments<a href="https://matrixhub.ai/blog/deepseek-v4-distribution#hugging-face-doesnt-work-well-in-enterprise-environments" class="hash-link" aria-label="Direct link to Hugging Face doesn't work well in enterprise environments" title="Direct link to Hugging Face doesn't work well in enterprise environments" translate="no">​</a></h3>
<ul>
<li class="">Unstable or completely unavailable network</li>
<li class="">Slow downloads and large-file interruptions</li>
<li class="">Lack of access control</li>
</ul>
<p>It looks like a slow-download issue, but in reality:</p>
<blockquote>
<p>Hugging Face is built for research collaboration, not controlled enterprise distribution.</p>
</blockquote>
<h2 class="anchor anchorTargetStickyNavbar_Vzrq" id="2-you-try-to-fix-it-yourself-but-make-it-worse">2. You try to fix it yourself, but make it worse<a href="https://matrixhub.ai/blog/deepseek-v4-distribution#2-you-try-to-fix-it-yourself-but-make-it-worse" class="hash-link" aria-label="Direct link to 2. You try to fix it yourself, but make it worse" title="Direct link to 2. You try to fix it yourself, but make it worse" translate="no">​</a></h2>
<h3 class="anchor anchorTargetStickyNavbar_Vzrq" id="common-workarounds-all-break-down">Common workarounds all break down<a href="https://matrixhub.ai/blog/deepseek-v4-distribution#common-workarounds-all-break-down" class="hash-link" aria-label="Direct link to Common workarounds all break down" title="Direct link to Common workarounds all break down" translate="no">​</a></h3>
<ul>
<li class="">Manual file transfer leads to version chaos and no auditability</li>
<li class="">NFS and NAS hit IO bottlenecks and still have no caching</li>
<li class="">Each node downloading independently exhausts bandwidth and slows cold starts</li>
</ul>
<p>Especially in vLLM and SGLang scenarios:</p>
<blockquote>
<p>Every node downloading the same model multiplies bandwidth pressure by N.</p>
</blockquote>
<h2 class="anchor anchorTargetStickyNavbar_Vzrq" id="3-the-real-problem-is-actually-just-one-thing">3. The real problem is actually just one thing<a href="https://matrixhub.ai/blog/deepseek-v4-distribution#3-the-real-problem-is-actually-just-one-thing" class="hash-link" aria-label="Direct link to 3. The real problem is actually just one thing" title="Direct link to 3. The real problem is actually just one thing" translate="no">​</a></h2>
<p>All these issues can be summarized in one sentence:</p>
<blockquote>
<p>You're missing a model distribution infrastructure layer, like a container registry for model artifacts.</p>
</blockquote>
<p>Just like you wouldn't use Docker Hub directly in production, you'd use a private registry instead. But in the model world, this layer has been missing for a long time.</p>
<h2 class="anchor anchorTargetStickyNavbar_Vzrq" id="4-our-solution">4. Our solution<a href="https://matrixhub.ai/blog/deepseek-v4-distribution#4-our-solution" class="hash-link" aria-label="Direct link to 4. Our solution" title="Direct link to 4. Our solution" translate="no">​</a></h2>
<h3 class="anchor anchorTargetStickyNavbar_Vzrq" id="core-idea">Core idea<a href="https://matrixhub.ai/blog/deepseek-v4-distribution#core-idea" class="hash-link" aria-label="Direct link to Core idea" title="Direct link to Core idea" translate="no">​</a></h3>
<div class="language-text codeBlockContainer_Ckt0 theme-code-block" style="--prism-color:#F8F8F2;--prism-background-color:#282A36"><div class="codeBlockContent_QJqH"><pre tabindex="0" class="prism-code language-text codeBlock_bY9V thin-scrollbar" style="color:#F8F8F2;background-color:#282A36"><code class="codeBlockLines_e6Vv"><span class="token-line" style="color:#F8F8F2"><span class="token plain">Public Model Source (Hugging Face)</span><br></span><span class="token-line" style="color:#F8F8F2"><span class="token plain">        ↓</span><br></span><span class="token-line" style="color:#F8F8F2"><span class="token plain">Proxy / Caching Layer</span><br></span><span class="token-line" style="color:#F8F8F2"><span class="token plain">        ↓</span><br></span><span class="token-line" style="color:#F8F8F2"><span class="token plain">Unified Internal Distribution</span><br></span><span class="token-line" style="color:#F8F8F2"><span class="token plain">        ↓</span><br></span><span class="token-line" style="color:#F8F8F2"><span class="token plain">vLLM / Inference Services</span><br></span></code></pre></div></div>
<p>This follows a pattern that has already been proven elsewhere:</p>
<ul>
<li class="">Docker -&gt; Docker Hub -&gt; Harbor</li>
<li class="">Maven -&gt; Central -&gt; Nexus</li>
<li class="">PyPI -&gt; pip -&gt; Private Registry</li>
</ul>
<p>Model distribution is fundamentally the same kind of problem.</p>
<h3 class="anchor anchorTargetStickyNavbar_Vzrq" id="key-capabilities">Key capabilities<a href="https://matrixhub.ai/blog/deepseek-v4-distribution#key-capabilities" class="hash-link" aria-label="Direct link to Key capabilities" title="Direct link to Key capabilities" translate="no">​</a></h3>
<p>This distribution layer should provide:</p>
<ol>
<li class="">Proxy access to Hugging Face, not a replacement</li>
<li class="">Automatic model caching</li>
<li class="">Resume support for interrupted transfers</li>
<li class="">Access control and permissions</li>
<li class="">Internal network distribution</li>
<li class="">Compatibility with vLLM and SGLang</li>
</ol>
<h2 class="anchor anchorTargetStickyNavbar_Vzrq" id="5-we-built-it-into-a-project">5. We built it into a project<a href="https://matrixhub.ai/blog/deepseek-v4-distribution#5-we-built-it-into-a-project" class="hash-link" aria-label="Direct link to 5. We built it into a project" title="Direct link to 5. We built it into a project" translate="no">​</a></h2>
<p><a href="https://github.com/matrixhub-ai/matrixhub" target="_blank" rel="noopener noreferrer" class="">MatrixHub</a> is essentially:</p>
<blockquote>
<p>An enterprise-grade Hugging Face proxy and model distribution acceleration layer.</p>
</blockquote>
<p>It provides:</p>
<ul>
<li class="">A Hugging Face proxy for public-network constraints</li>
<li class="">A model cache layer to eliminate repeated downloads</li>
<li class="">A unified enterprise access entry for permissions and governance</li>
</ul>
<p>You can think of it as:</p>
<ul>
<li class="">Harbor for models</li>
<li class="">The container registry of the AI era</li>
</ul>
<h2 class="anchor anchorTargetStickyNavbar_Vzrq" id="6-quick-start">6. Quick start<a href="https://matrixhub.ai/blog/deepseek-v4-distribution#6-quick-start" class="hash-link" aria-label="Direct link to 6. Quick start" title="Direct link to 6. Quick start" translate="no">​</a></h2>
<h3 class="anchor anchorTargetStickyNavbar_Vzrq" id="step-1-start-the-service">Step 1: Start the service<a href="https://matrixhub.ai/blog/deepseek-v4-distribution#step-1-start-the-service" class="hash-link" aria-label="Direct link to Step 1: Start the service" title="Direct link to Step 1: Start the service" translate="no">​</a></h3>
<p>Download <a href="https://matrixhub.ai/deploy/docker/docker-compose.yaml" download="docker-compose.yaml"><code>docker-compose.yaml</code></a> and <a href="https://matrixhub.ai/deploy/docker/config.yaml" download="config.yaml"><code>config.yaml</code></a>, and make sure the two files are in the same folder.</p>
<div class="language-bash codeBlockContainer_Ckt0 theme-code-block" style="--prism-color:#F8F8F2;--prism-background-color:#282A36"><div class="codeBlockContent_QJqH"><pre tabindex="0" class="prism-code language-bash codeBlock_bY9V thin-scrollbar" style="color:#F8F8F2;background-color:#282A36"><code class="codeBlockLines_e6Vv"><span class="token-line" style="color:#F8F8F2"><span class="token plain">docker compose -f docker-compose.yaml up -d</span><br></span></code></pre></div></div>
<p>Default service endpoint:</p>
<div class="language-text codeBlockContainer_Ckt0 theme-code-block" style="--prism-color:#F8F8F2;--prism-background-color:#282A36"><div class="codeBlockContent_QJqH"><pre tabindex="0" class="prism-code language-text codeBlock_bY9V thin-scrollbar" style="color:#F8F8F2;background-color:#282A36"><code class="codeBlockLines_e6Vv"><span class="token-line" style="color:#F8F8F2"><span class="token plain">http://127.0.0.1:3001</span><br></span></code></pre></div></div>
<p>Verify:</p>
<div class="language-bash codeBlockContainer_Ckt0 theme-code-block" style="--prism-color:#F8F8F2;--prism-background-color:#282A36"><div class="codeBlockContent_QJqH"><pre tabindex="0" class="prism-code language-bash codeBlock_bY9V thin-scrollbar" style="color:#F8F8F2;background-color:#282A36"><code class="codeBlockLines_e6Vv"><span class="token-line" style="color:#F8F8F2"><span class="token plain">curl http://127.0.0.1:3001</span><br></span></code></pre></div></div>
<h3 class="anchor anchorTargetStickyNavbar_Vzrq" id="step-2-login">Step 2: Login<a href="https://matrixhub.ai/blog/deepseek-v4-distribution#step-2-login" class="hash-link" aria-label="Direct link to Step 2: Login" title="Direct link to Step 2: Login" translate="no">​</a></h3>
<ul>
<li class="">Username: <code>admin</code></li>
<li class="">Password: <code>changeme</code></li>
</ul>
<p>Change the password immediately.</p>
<p><img decoding="async" loading="lazy" alt="login" src="https://matrixhub.ai/assets/images/login-cbb09f3ddeec0c99068836ac24eedf92.png" width="842" height="980" class="img_ev3q"></p>
<h3 class="anchor anchorTargetStickyNavbar_Vzrq" id="step-3-create-a-remote-registry-to-proxy-hugging-face">Step 3: Create a remote registry to proxy Hugging Face<a href="https://matrixhub.ai/blog/deepseek-v4-distribution#step-3-create-a-remote-registry-to-proxy-hugging-face" class="hash-link" aria-label="Direct link to Step 3: Create a remote registry to proxy Hugging Face" title="Direct link to Step 3: Create a remote registry to proxy Hugging Face" translate="no">​</a></h3>
<p>Key configuration:</p>
<div class="language-text codeBlockContainer_Ckt0 theme-code-block" style="--prism-color:#F8F8F2;--prism-background-color:#282A36"><div class="codeBlockContent_QJqH"><pre tabindex="0" class="prism-code language-text codeBlock_bY9V thin-scrollbar" style="color:#F8F8F2;background-color:#282A36"><code class="codeBlockLines_e6Vv"><span class="token-line" style="color:#F8F8F2"><span class="token plain">Remote URL: https://hf-mirror.com ( or https://huggingface.co )</span><br></span><span class="token-line" style="color:#F8F8F2"><span class="token plain">Type: HuggingFace</span><br></span><span class="token-line" style="color:#F8F8F2"><span class="token plain">Recommended name: huggingface</span><br></span></code></pre></div></div>
<p>How it works:</p>
<div class="language-text codeBlockContainer_Ckt0 theme-code-block" style="--prism-color:#F8F8F2;--prism-background-color:#282A36"><div class="codeBlockContent_QJqH"><pre tabindex="0" class="prism-code language-text codeBlock_bY9V thin-scrollbar" style="color:#F8F8F2;background-color:#282A36"><code class="codeBlockLines_e6Vv"><span class="token-line" style="color:#F8F8F2"><span class="token plain">Request -&gt; MatrixHub -&gt; Hugging Face -&gt; Response</span><br></span></code></pre></div></div>
<p><img decoding="async" loading="lazy" src="https://matrixhub.ai/assets/images/registry1-ff256b8adaad3cdc3caf42deda818efc.PNG" width="1280" height="572" class="img_ev3q">
<img decoding="async" loading="lazy" src="https://matrixhub.ai/assets/images/registry2-9f6f466e7682ca18fe28f4c7ced13214.PNG" width="1280" height="440" class="img_ev3q">
<img decoding="async" loading="lazy" src="https://matrixhub.ai/assets/images/registry3-20f5e50695b14e8ded28bf6c25fe937b.PNG" width="900" height="807" class="img_ev3q">
<img decoding="async" loading="lazy" src="https://matrixhub.ai/assets/images/registry4-cf7279f2be90bf88df4b8aae4eb683c4.PNG" width="917" height="304" class="img_ev3q"></p>
<h3 class="anchor anchorTargetStickyNavbar_Vzrq" id="step-4-create-a-proxy-project">Step 4: Create a proxy project<a href="https://matrixhub.ai/blog/deepseek-v4-distribution#step-4-create-a-proxy-project" class="hash-link" aria-label="Direct link to Step 4: Create a proxy project" title="Direct link to Step 4: Create a proxy project" translate="no">​</a></h3>
<p>Purpose:</p>
<div class="language-text codeBlockContainer_Ckt0 theme-code-block" style="--prism-color:#F8F8F2;--prism-background-color:#282A36"><div class="codeBlockContent_QJqH"><pre tabindex="0" class="prism-code language-text codeBlock_bY9V thin-scrollbar" style="color:#F8F8F2;background-color:#282A36"><code class="codeBlockLines_e6Vv"><span class="token-line" style="color:#F8F8F2"><span class="token plain">User -&gt; Proxy Project -&gt; Remote Repo (HF) -&gt; Cache</span><br></span></code></pre></div></div>
<p>When creating the project:</p>
<ul>
<li class="">Select the <code>huggingface</code> remote registry</li>
<li class="">Specify the model organization: <code>deepseek-ai</code></li>
</ul>
<p><img decoding="async" loading="lazy" src="https://matrixhub.ai/assets/images/project1-c9d03f0dc077a772ef0c932f313e6548.PNG" width="1175" height="280" class="img_ev3q">
<img decoding="async" loading="lazy" src="https://matrixhub.ai/assets/images/project2-80487645e674ae2b8e569aa29586fa8c.png" width="881" height="625" class="img_ev3q">
<img decoding="async" loading="lazy" src="https://matrixhub.ai/assets/images/project3-1e0c19989c39c9d5412ef149594a4156.png" width="1147" height="290" class="img_ev3q">
<img decoding="async" loading="lazy" src="https://matrixhub.ai/assets/images/project4-e396889bcdf6eb59313dc3225d98eba0.png" width="1168" height="301" class="img_ev3q"></p>
<h3 class="anchor anchorTargetStickyNavbar_Vzrq" id="step-5-client-integration">Step 5: Client integration<a href="https://matrixhub.ai/blog/deepseek-v4-distribution#step-5-client-integration" class="hash-link" aria-label="Direct link to Step 5: Client integration" title="Direct link to Step 5: Client integration" translate="no">​</a></h3>
<div class="language-bash codeBlockContainer_Ckt0 theme-code-block" style="--prism-color:#F8F8F2;--prism-background-color:#282A36"><div class="codeBlockContent_QJqH"><pre tabindex="0" class="prism-code language-bash codeBlock_bY9V thin-scrollbar" style="color:#F8F8F2;background-color:#282A36"><code class="codeBlockLines_e6Vv"><span class="token-line" style="color:#F8F8F2"><span class="token plain">export HF_ENDPOINT="http://127.0.0.1:3001"</span><br></span></code></pre></div></div>
<p>What this does:</p>
<ul>
<li class="">Redirects client requests</li>
<li class="">Lets the first request fetch from Hugging Face</li>
<li class="">Automatically caches locally</li>
<li class="">Keeps all later requests inside the intranet</li>
</ul>
<h3 class="anchor anchorTargetStickyNavbar_Vzrq" id="step-6-download-the-model">Step 6: Download the model<a href="https://matrixhub.ai/blog/deepseek-v4-distribution#step-6-download-the-model" class="hash-link" aria-label="Direct link to Step 6: Download the model" title="Direct link to Step 6: Download the model" translate="no">​</a></h3>
<div class="language-bash codeBlockContainer_Ckt0 theme-code-block" style="--prism-color:#F8F8F2;--prism-background-color:#282A36"><div class="codeBlockContent_QJqH"><pre tabindex="0" class="prism-code language-bash codeBlock_bY9V thin-scrollbar" style="color:#F8F8F2;background-color:#282A36"><code class="codeBlockLines_e6Vv"><span class="token-line" style="color:#F8F8F2"><span class="token plain">hf download deepseek-ai/DeepSeek-V4-Pro</span><br></span></code></pre></div></div>
<p><img decoding="async" loading="lazy" src="https://matrixhub.ai/assets/images/download1-1e0348fac1f42ef95230d7858ae41321.png" width="1574" height="98" class="img_ev3q"></p>
<p>You can see 'DeepSeek-V4-Pro' model under 'deepseek-ai' Project in UI</p>
<p><img decoding="async" loading="lazy" src="https://matrixhub.ai/assets/images/download-abe95b47a5c9f6a4b7399e889bbc5572.png" width="1125" height="336" class="img_ev3q"></p>
<h2 class="anchor anchorTargetStickyNavbar_Vzrq" id="verify-cache-effectiveness">Verify cache effectiveness<a href="https://matrixhub.ai/blog/deepseek-v4-distribution#verify-cache-effectiveness" class="hash-link" aria-label="Direct link to Verify cache effectiveness" title="Direct link to Verify cache effectiveness" translate="no">​</a></h2>
<p>Use <code>curl</code> to observe request behavior.</p>
<h3 class="anchor anchorTargetStickyNavbar_Vzrq" id="first-request-cache-miss">First request: cache miss<a href="https://matrixhub.ai/blog/deepseek-v4-distribution#first-request-cache-miss" class="hash-link" aria-label="Direct link to First request: cache miss" title="Direct link to First request: cache miss" translate="no">​</a></h3>
<div class="language-bash codeBlockContainer_Ckt0 theme-code-block" style="--prism-color:#F8F8F2;--prism-background-color:#282A36"><div class="codeBlockContent_QJqH"><pre tabindex="0" class="prism-code language-bash codeBlock_bY9V thin-scrollbar" style="color:#F8F8F2;background-color:#282A36"><code class="codeBlockLines_e6Vv"><span class="token-line" style="color:#F8F8F2"><span class="token plain">curl -I http://127.0.0.1:3001/deepseek-ai/DeepSeek-V4-Pro/resolve/main/config.json</span><br></span></code></pre></div></div>
<p>Characteristics:</p>
<ul>
<li class="">Longer response time</li>
<li class="">Contains upstream headers</li>
</ul>
<h3 class="anchor anchorTargetStickyNavbar_Vzrq" id="second-request-cache-hit">Second request: cache hit<a href="https://matrixhub.ai/blog/deepseek-v4-distribution#second-request-cache-hit" class="hash-link" aria-label="Direct link to Second request: cache hit" title="Direct link to Second request: cache hit" translate="no">​</a></h3>
<div class="language-bash codeBlockContainer_Ckt0 theme-code-block" style="--prism-color:#F8F8F2;--prism-background-color:#282A36"><div class="codeBlockContent_QJqH"><pre tabindex="0" class="prism-code language-bash codeBlock_bY9V thin-scrollbar" style="color:#F8F8F2;background-color:#282A36"><code class="codeBlockLines_e6Vv"><span class="token-line" style="color:#F8F8F2"><span class="token plain">curl -I http://127.0.0.1:3001/deepseek-ai/DeepSeek-V4-Pro/resolve/main/config.json</span><br></span></code></pre></div></div>
<p>Characteristics:</p>
<ul>
<li class="">Very fast response</li>
<li class="">No longer hits Hugging Face</li>
</ul>
<h2 class="anchor anchorTargetStickyNavbar_Vzrq" id="final-thoughts">Final thoughts<a href="https://matrixhub.ai/blog/deepseek-v4-distribution#final-thoughts" class="hash-link" aria-label="Direct link to Final thoughts" title="Direct link to Final thoughts" translate="no">​</a></h2>
<p>If you're deploying large models in an enterprise environment, you will inevitably face:</p>
<ul>
<li class="">Slow downloads</li>
<li class="">Bandwidth exhaustion</li>
<li class="">Repeated downloads across nodes</li>
<li class="">Lack of access control</li>
</ul>
<p>These are not edge cases. They are architectural gaps.</p>
<p>MatrixHub simply fills that missing layer.</p>
<p>If you're working on similar problems, feel free to connect:</p>
<p><a href="https://github.com/matrixhub-ai/matrixhub" target="_blank" rel="noopener noreferrer" class="">https://github.com/matrixhub-ai/matrixhub</a></p>]]></content:encoded>
        </item>
        <item>
            <title><![CDATA[Examples]]></title>
            <link>https://matrixhub.ai/blog/examples</link>
            <guid>https://matrixhub.ai/blog/examples</guid>
            <pubDate>Mon, 27 Apr 2026 00:00:00 GMT</pubDate>
            <description><![CDATA[Real-world MatrixHub usage examples for internal model distribution and caching.]]></description>
            <content:encoded><![CDATA[<p>Real-world examples of using MatrixHub.</p>
<h2 class="anchor anchorTargetStickyNavbar_Vzrq" id="common-use-cases">Common use cases<a href="https://matrixhub.ai/blog/examples#common-use-cases" class="hash-link" aria-label="Direct link to Common use cases" title="Direct link to Common use cases" translate="no">​</a></h2>
<h3 class="anchor anchorTargetStickyNavbar_Vzrq" id="intranet-vllm-cluster-distribution">Intranet vLLM cluster distribution<a href="https://matrixhub.ai/blog/examples#intranet-vllm-cluster-distribution" class="hash-link" aria-label="Direct link to Intranet vLLM cluster distribution" title="Direct link to Intranet vLLM cluster distribution" translate="no">​</a></h3>
<ul>
<li class=""><strong>Scenario</strong>: A production intranet runs a vLLM inference cluster with 100 GPU servers. Because model files can be huge, such as a 70B model exceeding 130GB, having every machine pull from public Hugging Face is slow and may trigger outbound bandwidth throttling.</li>
<li class=""><strong>Flow overview</strong>:<!-- -->
<ol>
<li class=""><strong>Single access point</strong>: Set the <code>HF_ENDPOINT</code> environment variable of all vLLM nodes to the internal MatrixHub endpoint.</li>
<li class=""><strong>Pull once, cache for all</strong>: When the first node requests a model, MatrixHub pulls it from the public network and persists it locally; subsequent nodes hit the intranet cache directly.</li>
</ol>
</li>
</ul>
<blockquote>
<p>As a user, I want to point the <code>hf download</code> endpoint to MatrixHub so that later downloads inside the same network become much faster after the first request has already cached the model.</p>
</blockquote>
<h4 class="anchor anchorTargetStickyNavbar_Vzrq" id="steps">Steps<a href="https://matrixhub.ai/blog/examples#steps" class="hash-link" aria-label="Direct link to Steps" title="Direct link to Steps" translate="no">​</a></h4>
<ol>
<li class="">Visit the MatrixHub address <code>http://x.x.x.x:3001</code> and open the login page.</li>
</ol>
<p><img decoding="async" loading="lazy" src="https://matrixhub.ai/assets/images/scenario-test-en-9-11a9d9b71042d89118e669d7a1a9f046.png" width="1280" height="493" class="img_ev3q"></p>
<ol start="2">
<li class="">Log in as the admin user and open the model repository list.</li>
</ol>
<p><img decoding="async" loading="lazy" src="https://matrixhub.ai/assets/images/scenario-test-en-7-d7b8efac8a7379cb52072c6dc70ccb7f.png" width="1280" height="427" class="img_ev3q"></p>
<ol start="3">
<li class="">Click the top-right user menu, then go to Platform Settings and Repository Management.</li>
</ol>
<p><img decoding="async" loading="lazy" src="https://matrixhub.ai/assets/images/scenario-test-en-6-e1653e50fb45defa2f561d3a36a94932.png" width="1364" height="256" class="img_ev3q"></p>
<ol start="4">
<li class="">Create a target repository: select Hugging Face as the provider, set the repository name to <code>hf</code>, enter the target URL <code>https://hf-mirror.com</code>, enable remote certificate verification, and click <code>OK</code>.</li>
</ol>
<p><img decoding="async" loading="lazy" src="https://matrixhub.ai/assets/images/scenario-test-en-4-3d38e0bf35f4393bdcf136f4ffb0aa0d.png" width="1280" height="629" class="img_ev3q">
<img decoding="async" loading="lazy" src="https://matrixhub.ai/assets/images/scenario-test-en-8-c3987fcab808e5e8c95be5f153c8a6cf.png" width="1479" height="256" class="img_ev3q"></p>
<ol start="5">
<li class="">Go to Project Management and open the project list page.</li>
</ol>
<p><img decoding="async" loading="lazy" src="https://matrixhub.ai/assets/images/scenario-test-en-5-ea4d67ab467e6f2368869220b38b43b6.png" width="1320" height="256" class="img_ev3q"></p>
<ol start="6">
<li class="">Click <code>Create Project</code>: set the project name to <code>qwen</code>, set it to <code>Public</code>, enable <code>Proxy</code>, select the repository, set the proxy organization to <code>Qwen</code>, and click <code>OK</code>.</li>
</ol>
<p><img decoding="async" loading="lazy" src="https://matrixhub.ai/assets/images/scenario-test-en-3-0972ccc28408ff87a516b47246e146b1.png" width="1280" height="504" class="img_ev3q"></p>
<ol start="7">
<li class="">
<p>Pull the model.</p>
<ul>
<li class=""><strong>First node</strong>: about <code>3m37.318s</code></li>
</ul>
</li>
</ol>
<p><img decoding="async" loading="lazy" src="https://matrixhub.ai/assets/images/scenario-test-en-1-5aeb41c1ce1affca6ca6a91d7980747c.png" width="1785" height="256" class="img_ev3q"></p>
<ul>
<li class=""><strong>Second node</strong>: about <code>0m8.500s</code></li>
</ul>
<p><img decoding="async" loading="lazy" src="https://matrixhub.ai/assets/images/scenario-test-en-5aeb41c1ce1affca6ca6a91d7980747c.png" width="1785" height="256" class="img_ev3q"></p>
<ol start="8">
<li class="">View the model information in MatrixHub.</li>
</ol>
<p><img decoding="async" loading="lazy" src="https://matrixhub.ai/assets/images/scenario-test-en-2-1460f61f3109b83c2052d174a926dacb.png" width="1280" height="532" class="img_ev3q"></p>]]></content:encoded>
        </item>
    </channel>
</rss>