sailor2:1b-chat-q8_0

Sailor2是一个社区驱动的项目，旨在将最先进的多语言语言模型引入东南亚（SEA）。我们的研究表明，在生产环境中对**80亿和200亿**参数规模的模型有很强的需求，同时**10亿参数规模的模型**也适用于专业应用，例如推测解码和研究目的。这些模型在**Apache 2.0许可证**下发布，提高了该地区对先进语言技术的可访问性。

Sailor2基于强大的多语言模型Qwen 2.5，并持续在5000亿个token上进行预训练，以更好地支持15种语言的统一模型。这些语言包括英语、汉语、缅甸语、宿务语、伊洛卡诺语、印尼语、爪哇语、高棉语、老挝语、马来语、巽他语、塔加拉语、泰语、越南语和瓦拉伊语。通过满足对多样化、强大和易访问的语言模型日益增长的需求，Sailor2旨在为东南亚地区服务不足的地区提供开放、包容和易访问的多语言大型语言模型。Sailor2模型有三种尺寸：10亿、80亿和200亿参数，分别基于Qwen2.5的0.5亿、70亿和140亿参数的基准模型进行扩展。

![logo](/assets/mchiang0610/sailor2/a76a9182-cc11-47e1-bb50-478ad4ccb157)

Sailor2 is a community-driven initiative that brings cutting-edge multilingual language models to South-East Asia (SEA). Our research highlights a strong demand for models in the **8B and 20B** parameter range for production use, alongside **1B models** for specialized applications, such as speculative decoding and research purposes. These models, released under the **Apache 2.0 license**, provide enhanced accessibility to advanced language technologies across the region.

Sailor2 builds upon the foundation of the awesome multilingual model Qwen 2.5 and is continuously pre-trained on 500B tokens to support 15 languages better with a unified model. These languages include English, Chinese, Burmese, Cebuano, Ilocano, Indonesian, Javanese, Khmer, Lao, Malay, Sundanese, Tagalog, Thai, Vietnamese, and Waray. By addressing the growing demand for diverse, robust, and accessible language models, Sailor2 seeks to serve the underserved in SEA areas with open, inclusive, and accessible multilingual LLMs. The Sailor2 model comes in three sizes, 1B, 8B, and 20B, which are expanded from the Qwen2.5 base models of 0.5B, 7B, and 14B, respectively.

粘贴、拖放或点击上传图像（.png、.jpeg、.jpg、.svg、.gif）