deepseek-r1:7b-qwen-distill-q4_K_M - Ollama 框架

deepseek-r1

DeepSeek 的第一代推理模型，在数学、代码和推理任务上实现了与 OpenAI-o1 相媲美的性能，包括从基于 Llama 和 Qwen 的 DeepSeek-R1 提炼出的六个密集模型。

1.5b 7b 8b 14b 32b 70b 671b

7.7M 下载量更新于 13 天前

更新于 2 周前

2 周前

0a8c26691023 · 4.7GB

{ "stop": [ "<｜begin of sentence｜>", "<｜end of sentence｜>",

{{- if .System }}{{ .System }}{{ end }} {{- range $i, $_ := .Messages }} {{- $last := eq (len (slice

MIT 许可证 Copyright (c) 2023 DeepSeek 兹授予许可，免费向任何人

自述

DeepSeek 的第一代推理模型，在数学、代码和推理任务上实现了与 OpenAI-o1 相媲美的性能。

模型

DeepSeek-R1

ollama run deepseek-r1:671b

蒸馏模型

DeepSeek 团队已经证明，大型模型的推理模式可以被提炼到小型模型中，从而获得比通过强化学习在小型模型上发现的推理模式更好的性能。

以下是通过对研究社区广泛使用的几个密集模型进行微调而创建的模型，使用了由 DeepSeek-R1 生成的推理数据。评估结果表明，蒸馏后的小型密集模型在基准测试中表现非常出色。

DeepSeek-R1-Distill-Qwen-1.5B

ollama run deepseek-r1:1.5b

DeepSeek-R1-Distill-Qwen-7B

ollama run deepseek-r1:7b

DeepSeek-R1-Distill-Llama-8B

ollama run deepseek-r1:8b

DeepSeek-R1-Distill-Qwen-14B

ollama run deepseek-r1:14b

DeepSeek-R1-Distill-Qwen-32B

ollama run deepseek-r1:32b

DeepSeek-R1-Distill-Llama-70B

ollama run deepseek-r1:70b

许可证

模型权重根据 MIT 许可证获得许可。DeepSeek-R1 系列支持商业用途，允许任何修改和衍生作品，包括但不限于为训练其他 LLM 而进行的蒸馏。请注意

Qwen 蒸馏模型来源于 Qwen-2.5 系列，该系列最初根据 Apache 2.0 许可证获得许可，现在使用 DeepSeek-R1 策划的 80 万个样本进行了微调。

Llama 8B 蒸馏模型来源于 Llama3.1-8B-Base，最初根据 llama3.1 许可证获得许可。

Llama 70B 蒸馏模型来源于 Llama3.3-70B-Instruct，最初根据 llama3.3 许可证获得许可。

<img src="/assets/library/deepseek-v3/069ccc94-63b0-41e6-b2b3-e8e56068ab1a" width="320" />

DeepSeek's first-generation reasoning models, achieving performance comparable to OpenAI-o1 across math, code, and reasoning tasks.

## Models

**DeepSeek-R1**

```
ollama run deepseek-r1:671b
```

### Distilled models

DeepSeek team has demonstrated that the reasoning patterns of larger models can be distilled into smaller models, resulting in better performance compared to the reasoning patterns discovered through RL on small models.

Below are the models created via fine-tuning against several dense models widely used in the research community using reasoning data generated by DeepSeek-R1. The evaluation results demonstrate that the distilled smaller dense models perform exceptionally well on benchmarks.

**DeepSeek-R1-Distill-Qwen-1.5B**

```
ollama run deepseek-r1:1.5b
```

**DeepSeek-R1-Distill-Qwen-7B**

```
ollama run deepseek-r1:7b
```

**DeepSeek-R1-Distill-Llama-8B**

```
ollama run deepseek-r1:8b
```

**DeepSeek-R1-Distill-Qwen-14B**

```
ollama run deepseek-r1:14b
```

**DeepSeek-R1-Distill-Qwen-32B**

```
ollama run deepseek-r1:32b
```

**DeepSeek-R1-Distill-Llama-70B**

```
ollama run deepseek-r1:70b
```

![deepseek](/assets/library/deepseek-r1/e44d096e-fa46-4cae-b2f2-53991e8c8da0)

### License

The model weights are licensed under the MIT License. DeepSeek-R1 series support commercial use, allow for any modifications and derivative works, including, but not limited to, distillation for training other LLMs. Please note that:

The Qwen distilled models are derived from Qwen-2.5 series, which are originally licensed under Apache 2.0 License, and now finetuned with 800k samples curated with DeepSeek-R1.

The Llama 8B distilled model is derived from Llama3.1-8B-Base and is originally licensed under llama3.1 license.

The Llama 70B distilled model is derived from Llama3.3-70B-Instruct and is originally licensed under llama3.3 license.

粘贴、拖放或单击以上传图片（.png、.jpeg、.jpg、.svg、.gif）