{"id":122,"date":"2025-01-29T13:35:00","date_gmt":"2025-01-29T05:35:00","guid":{"rendered":"https:\/\/blog.liu-qi.cn\/?p=122"},"modified":"2026-04-18T21:37:32","modified_gmt":"2026-04-18T13:37:32","slug":"%e9%86%92%e9%86%92%ef%bc%81%e4%bd%a0%e6%9c%ac%e5%9c%b0%e9%83%a8%e7%bd%b2%e7%9a%84deepseek-r1%ef%bc%8c%e5%ae%83%e4%b8%8d%e6%98%afr1","status":"publish","type":"post","link":"https:\/\/en.blog.liu-qi.cn\/2025\/01\/29\/%e9%86%92%e9%86%92%ef%bc%81%e4%bd%a0%e6%9c%ac%e5%9c%b0%e9%83%a8%e7%bd%b2%e7%9a%84deepseek-r1%ef%bc%8c%e5%ae%83%e4%b8%8d%e6%98%afr1\/","title":{"rendered":"Wake Up! Your Locally Deployed DeepSeek-R1 Isn't the Real R1"},"content":{"rendered":"<p>So, as of late, I&#8217;ve been seeing a lot of content about locally deploying DeepSeek-R1.<\/p>\n<p><img decoding=\"async\" alt=\"\" loading=\"lazy\" src=\"https:\/\/blog.liu-qi.cn\/wp-content\/uploads\/2026\/04\/001-fb7a5c954fff.png\" \/><\/p>\n<p>Many netizens have posted articles or videos claiming they&#8217;ve successfully deployed and used the DeepSeek-R1 model locally, which is said to rival OpenAI-o1. While it still feels a bit off, since DeepSeek-R1 is an open-source model, it can indeed be deployed locally.<\/p>\n<p>As an RTX4090 user, I&#8217;m considered to have a high-end setup among home PC enthusiasts. I figured since so many people have successfully deployed it locally, my setup should be able to handle it too. So, I went to check it out, and well&#8230; then I discovered:<\/p>\n<p>Translation is truly a wonderful thing!<\/p>\n<p>If you don&#8217;t want to use Google&#8217;s web translation, you should really try a translation plugin\u2014<a href=\"https:\/\/blog.liu-qi.cn\/2025\/01\/15\/%E5%B0%8F%E7%BA%A2%E4%B9%A6%E8%8B%B1%E6%96%87%E8%AF%84%E8%AE%BA%E5%A4%AA%E5%A4%9A%E7%9C%8B%E4%B8%8D%E6%87%82%E4%BA%86%EF%BC%9F%E6%8E%A8%E8%8D%90%E4%BD%A0%E8%AF%95%E8%AF%95%E6%B2%89%E6%B5%B8%E5%BC%8F\/\">Are you struggling to read the flood of English comments on Xiaohongshu? Try Immersive Translate!<\/a>\u3002<\/p>\n<p>Indeed, DeepSeek released several smaller parameter models along with R1, including 1.5B, 7B, and 8B. But looking at the names, even if you&#8217;re not familiar with Llama, doesn&#8217;t that Qwen look somewhat familiar? I was about to spell it out in Pinyin.<\/p>\n<p><img decoding=\"async\" alt=\"\" loading=\"lazy\" src=\"https:\/\/blog.liu-qi.cn\/wp-content\/uploads\/2026\/04\/002-b5dd86c40b45.png\" \/><\/p>\n<p>The GitHub and Hugging Face pages state it clearly. If I had to pick one to blame, I think it might be Ollama.<\/p>\n<p><img decoding=\"async\" alt=\"\" loading=\"lazy\" src=\"https:\/\/blog.liu-qi.cn\/wp-content\/uploads\/2026\/04\/003-135e282f6dd8.png\" \/><\/p>\n<p>It says deepseek-r1 at the top, and below it lists 1.5b\/7b\/8b\/14b\/32b\/70b\/671b.<\/p>\n<p>But, friends, if you enable translation and scroll down further.<\/p>\n<p><img decoding=\"async\" alt=\"\" loading=\"lazy\" src=\"https:\/\/blog.liu-qi.cn\/wp-content\/uploads\/2026\/04\/004-9d69ce52ea0a.png\" \/><\/p>\n<p><img decoding=\"async\" alt=\"\" loading=\"lazy\" src=\"https:\/\/blog.liu-qi.cn\/wp-content\/uploads\/2026\/04\/005-14b239d4dfd3.png\" \/><\/p>\n<p>These are models created by fine-tuning several dense models widely used in the research community with reasoning data generated by DeepSeek-R1.<\/p>\n<p>The GitHub page also includes relevant information:<\/p>\n<p><img decoding=\"async\" alt=\"\" loading=\"lazy\" src=\"https:\/\/blog.liu-qi.cn\/wp-content\/uploads\/2026\/04\/006-663e783364bc.png\" \/><\/p>\n<p><img decoding=\"async\" alt=\"\" loading=\"lazy\" src=\"https:\/\/blog.liu-qi.cn\/wp-content\/uploads\/2026\/04\/007-63b5173bc4cd.png\" \/><\/p>\n<p>DeepSeek-R1-Distill models are fine-tuned from open-source models using samples generated by DeepSeek-R1. We have slightly modified their configurations and tokenizers. Please use our settings to run these models.<\/p>\n<p>So, the DeepSeek-R1-Distill-Qwen-1.5B model is essentially still Qwen2.5-Math-1.5B; the DeepSeek-R1-Distill-Llama-8B model is essentially still Llama-3.1-8B.<\/p>\n<p>DeepSeek released these distilled models to demonstrate that &#8216;the reasoning patterns of larger models can be distilled into smaller models, achieving better performance compared to reasoning patterns discovered through RL on small models.&#8217;<\/p>\n<p>The true R1 models officially released are only DeepSeek-R1 and DeepSeek-R1-Zero, both trained based on DeepSeek-V3-Base with a parameter scale of 671B:<\/p>\n<p><img decoding=\"async\" alt=\"\" loading=\"lazy\" src=\"https:\/\/blog.liu-qi.cn\/wp-content\/uploads\/2026\/04\/008-ff646d468bc3.png\" \/><\/p>\n<p>If you absolutely must locally deploy the real R1, you might only try the community-released 1.58-bit precision quantized version of DeepSeek-R1-GGUF. Using a GPU with 24GB VRAM like the RTX 4090 can achieve output speeds of up to 1-3 tokens per second.<\/p>\n<p>So, stop messing around.<\/p>\n<p>The web version is free, and the API isn&#8217;t expensive either\u2014why bother?<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Many online posts about locally deploying DeepSeek-R1 are actually referring to smaller distilled models, not the true 671B parameter R1.<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[24],"tags":[],"class_list":["post-122","post","type-post","status-publish","format-standard","hentry","category-articles"],"_links":{"self":[{"href":"https:\/\/en.blog.liu-qi.cn\/index.php\/wp-json\/wp\/v2\/posts\/122","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/en.blog.liu-qi.cn\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/en.blog.liu-qi.cn\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/en.blog.liu-qi.cn\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/en.blog.liu-qi.cn\/index.php\/wp-json\/wp\/v2\/comments?post=122"}],"version-history":[{"count":0,"href":"https:\/\/en.blog.liu-qi.cn\/index.php\/wp-json\/wp\/v2\/posts\/122\/revisions"}],"wp:attachment":[{"href":"https:\/\/en.blog.liu-qi.cn\/index.php\/wp-json\/wp\/v2\/media?parent=122"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/en.blog.liu-qi.cn\/index.php\/wp-json\/wp\/v2\/categories?post=122"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/en.blog.liu-qi.cn\/index.php\/wp-json\/wp\/v2\/tags?post=122"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}