qwen2.5

Qwen2.5 模型是在阿里巴巴最新的超大规模数据集上预训练的，包含高达 18 万亿个词元。该模型支持高达 128K 个词元，并具有多语言支持。

工具 0.5b 1.5b 3b 7b 14b 32b 72b

1.9M 拉取更新于 7 周前

133 个标签

更新于 7 周前

7 周前

2b0f4078dcc1 · 355MB

Apache 许可证 2.0 版，200 年 1 月

11kB

自述文件

Qwen2.5 是最新的 Qwen 大语言模型系列。对于 Qwen2.5，我们发布了一系列基础语言模型和指令调整模型，其大小从 0.5 亿到 720 亿个参数不等。与 Qwen2 相比，Qwen2.5 引入了以下改进

它拥有 **显著更多知识**，并在 **编码** 和 **数学** 方面显著提升了能力，这得益于这些领域中专门的专家模型。
它在 **指令遵循**、**长文本生成**（超过 8K 个词元）、**理解结构化数据**（例如表格）和 **生成结构化输出**，特别是在 JSON 格式方面，展现出显著的进步。它也 **对不同的系统提示更具弹性**，提高了聊天机器人的角色扮演和条件设置。
它支持高达 128K 个词元的 **长上下文**，并且可以生成高达 8K 个词元的文本。
它提供 **多语言支持**，涵盖超过 29 种语言，包括中文、英文、法文、西班牙文、葡萄牙文、德文、意大利文、俄文、日文、韩文、越南文、泰文、阿拉伯文等等。

请注意：除了 3B 和 72B 模型外，所有模型都在 Apache 2.0 许可证下发布，而 3B 和 72B 模型则在 Qwen 许可证下发布。

参考文献

GitHub

博客文章

HuggingFace

Qwen2.5 is the latest series of Qwen large language models. For Qwen2.5, a range of base language models and instruction-tuned models are released, with sizes ranging from 0.5 to 72 billion parameters. Qwen2.5 introduces the following improvements over Qwen2:

- It possesses **significantly more knowledge** and has greatly enhanced capabilities in **coding** and **mathematics**, due to specialized expert models in these domains.
- It demonstrates significant advancements in **instruction following**, **long-text generation** (over 8K tokens), **understanding structured data** (e.g., tables), and **generating structured outputs**, especially in JSON format. It is also **more resilient to diverse system prompts**, improving role-play and condition-setting for chatbots.
- It supports **long contexts** of up to 128K tokens and can generate up to 8K tokens.
- It offers **multilingual support** for over 29 languages, including Chinese, English, French, Spanish, Portuguese, German, Italian, Russian, Japanese, Korean, Vietnamese, Thai, Arabic, and more.

Please note: all models except the 3B and 72B are released under the Apache 2.0 license, while the 3B and 72B models are under the Qwen license.

## References

[GitHub](https://github.com/QwenLM/Qwen2.5)

[Blog post](https://qwenlm.github.io/blog/qwen2.5/)

[HuggingFace](https://hugging-face.cn/collections/Qwen/qwen25-66e81a666513e518adb90d9e)

粘贴、拖放或点击上传图像（.png、.jpeg、.jpg、.svg、.gif）