10 Tips for Building and Reproducing DeepSeek-R1 Models

Published On Thu May 29 2025

10 Tips for Building and Reproducing DeepSeek-R1 Models

huggingface/open-r1: Fully open reproduction of DeepSeek .. - GitHub

We read every piece of feedback, and take your input very seriously. To see all available qualifiers, see our documentation.

A fully open reproduction of DeepSeek-R1. This repo is a work in progress, let's build it together!

Table of Contents

The goal of this repo is to build the missing pieces of the R1 pipeline such that everybody can reproduce and build on top of it. The project is simple by design and mostly consists of:

Caution: Libraries rely on CUDA 12.4. If you see errors related to segmentation faults, double check the version your system is running with nvcc --version. To run the code in this project, first, create a Python virtual environment using e.g. uv. To install uv, follow the UV Installation Guide.

Nvidia-smi segmentation fault in wsl2 but not in Windows - CUDA on ...

Tip: For Hugging Face cluster users, add export UV_LINK_MODE=copy to your .bashrc to suppress cache warnings from uv

Note: As a shortcut, run make install to setup development libraries (spelled out below). Afterwards, if everything is setup correctly you can try out the Open-R1 models.

Tip: If you scale up/down the number of GPUs, we recommend also scaling up the per-device batch size or number of gradient accumulation steps to keep the global batch size constant.

Set Up & Run DeepSeek-R1Model Locally: Step-by-Step Guide Using ...

We provide support to filter datasets by generating and computing pass rate on veriable tasks, see this README

🚨 WARNING 🚨: Most base models like meta-llama/Llama-3.2-1B do not have a chat template, so we set ChatML as the default during training. However, for Qwen base models like Qwen/Qwen2.5-1.5B, a chat template is pre-defined in the tokenizer, so the EOS token must be set accordingly.

RELATED

The Ultimate Guide to Investing in AI-Chip Stocks this July

The Ultimate Guide to Investing in AI-Chip Stocks this July

Wed Jul 02 2025

The Pros and Cons of Using ChatGPT in the Classroom

The Pros and Cons of Using ChatGPT in the Classroom

Sat May 13 2023

The Dark Side of AI: Unmasking its Environmental Impact

The Dark Side of AI: Unmasking its Environmental Impact

Wed Jul 02 2025

10 Things You Didn't Know About ChatGPT

10 Things You Didn't Know About ChatGPT

Sat May 13 2023

5 Potential Partnerships for Apple's Siri AI Upgrades

5 Potential Partnerships for Apple's Siri AI Upgrades

Wed Jul 02 2025

Find Your Dream Home with Redfin’s ChatGPT Plug-In

Find Your Dream Home with Redfin’s ChatGPT Plug-In

Sat May 13 2023

Denmark's Media vs. OpenAI: Legal Battle Unfolds

Denmark's Media vs. OpenAI: Legal Battle Unfolds

Wed Jul 02 2025

10 Catchy Titles for Your Blog Post

10 Catchy Titles for Your Blog Post

Mon Apr 29 2024

Unleashing Gemma 3n: Google's Breakthrough in On-Device AI

Unleashing Gemma 3n: Google's Breakthrough in On-Device AI

Wed Jul 02 2025

Lawsuit Alert: Newspapers vs. OpenAI and Microsoft

Lawsuit Alert: Newspapers vs. OpenAI and Microsoft

Wed May 01 2024

Empowering Vietnam: The Rise of AI in Business

Empowering Vietnam: The Rise of AI in Business

Wed Jul 02 2025

10 Catchy Titles for Your Blog Post:

10 Catchy Titles for Your Blog Post:

Wed May 01 2024

10 Explosive Clashes Between Trump and Musk

10 Explosive Clashes Between Trump and Musk

Wed Jul 02 2025

10 Catchy Titles for Your Blog Post

10 Catchy Titles for Your Blog Post

Thu May 02 2024

10 AI Video Creation Tools You Need to Know About

10 AI Video Creation Tools You Need to Know About

Wed Jul 02 2025

10 Catchy Titles for Your Blog Post:

10 Catchy Titles for Your Blog Post:

Thu May 02 2024

Elevate Your Content with Top AI Tools and Prompts

Elevate Your Content with Top AI Tools and Prompts

Wed Jul 02 2025

Is OpenAI Developing a ChatGPT-Powered AI Search Engine?

Is OpenAI Developing a ChatGPT-Powered AI Search Engine?

Wed May 08 2024

The Secret to Activating OpenAI's Internal Tools in n8n

The Secret to Activating OpenAI's Internal Tools in n8n

Wed Jul 02 2025

10 Catchy Titles for Your Blog Post

10 Catchy Titles for Your Blog Post

Thu May 09 2024

© 2025 - All Rights Reserved

DMCA Policy | Terms of Service