sailor2:1b

Sailor2 是一项社区驱动的计划，旨在为东南亚 (SEA) 地区带来最先进的多语言模型。我们的研究强调了对生产使用的 8B 和 20B 参数范围模型以及用于特定应用（如推测解码和研究目的）的 1B 模型的强烈需求。这些模型以 Apache 2.0 许可证发布，为该地区提供了更广泛地访问先进语言技术的途径。

Sailor2 建立在优秀的多语言模型 Qwen 2.5 的基础上，并在 500B 标记上持续进行预训练，以更好地支持 15 种语言的统一模型。这些语言包括英语、中文、缅甸语、宿务语、伊洛卡诺语、印度尼西亚语、爪哇语、高棉语、老挝语、马来语、巽他语、他加禄语、泰语、越南语和瓦瑞语。通过解决对多样化、强大且可访问的语言模型日益增长的需求，Sailor2 寻求通过开放、包容和可访问的多语言 LLM 为东南亚地区服务不足的地区提供服务。Sailor2 模型有三种大小，分别为 1B、8B 和 20B，它们分别从 Qwen2.5 的 0.5B、7B 和 14B 基础模型扩展而来。

![logo](/assets/mchiang0610/sailor2/a76a9182-cc11-47e1-bb50-478ad4ccb157)

Sailor2 is a community-driven initiative that brings cutting-edge multilingual language models to South-East Asia (SEA). Our research highlights a strong demand for models in the **8B and 20B** parameter range for production use, alongside **1B models** for specialized applications, such as speculative decoding and research purposes. These models, released under the **Apache 2.0 license**, provide enhanced accessibility to advanced language technologies across the region.

Sailor2 builds upon the foundation of the awesome multilingual model Qwen 2.5 and is continuously pre-trained on 500B tokens to support 15 languages better with a unified model. These languages include English, Chinese, Burmese, Cebuano, Ilocano, Indonesian, Javanese, Khmer, Lao, Malay, Sundanese, Tagalog, Thai, Vietnamese, and Waray. By addressing the growing demand for diverse, robust, and accessible language models, Sailor2 seeks to serve the underserved in SEA areas with open, inclusive, and accessible multilingual LLMs. The Sailor2 model comes in three sizes, 1B, 8B, and 20B, which are expanded from the Qwen2.5 base models of 0.5B, 7B, and 14B, respectively.

粘贴、拖放或单击以上传图片（.png、.jpeg、.jpg、.svg、.gif）