granite3.1-moe:1b-instruct-q4_1 - Ollama 框架

granite3.1-moe

IBM Granite 1B 和 3B 模型是 IBM 推出的长上下文专家混合 (MoE) Granite 模型，专为低延迟使用而设计。

工具 1b 3b

20.6K 拉取次数更新于 2 周前

更新于 2 周前

2 周前

d62c69d05bc9 · 861MB

架构granitemoe

知识截止日期：2024 年 4 月。您是 Granite，由 IBM 开发。

<|start_of_role|>system<|end_of_role|> {{- if and (gt (len .Messages) 0) (eq (index .Messages 0).Rol

Apache License Version 2.0, January 2004

自述

Granite 专家混合模型

IBM Granite 1B 和 3B 模型是 IBM 推出的长上下文专家混合 (MoE) Granite 模型，专为低延迟使用而设计。

这些模型使用超过 10 万亿个 tokens 的数据进行训练，Granite MoE 模型是在设备端应用程序或需要即时推理的情况下部署的理想选择。

参数大小

1B

ollama run granite3.1-moe:1b

3B

ollama run granite3.1-moe:3b

支持的语言

英语、德语、西班牙语、法语、日语、葡萄牙语、阿拉伯语、捷克语、意大利语、韩语、荷兰语、中文（简体）

功能

摘要
文本分类
文本提取
问答
检索增强生成 (RAG)
代码相关任务
函数调用任务
多语言对话用例
长上下文任务，包括长文档/会议摘要、长文档问答等。

Granite 稠密模型

Granite 稠密模型提供 2B 和 8B 参数大小，旨在支持基于工具的用例和检索增强生成 (RAG)，从而简化代码生成、翻译和错误修复。

查看模型页面

了解更多

开发者： IBM Research
GitHub 仓库： ibm-granite/granite-language-models
网站: Granite 文档
发布日期: 2024 年 12 月 18 日
许可证： Apache 2.0。

## Granite mixture of experts models

The IBM Granite **1B and 3B models** are long-context mixture of experts (MoE) Granite models from IBM designed for low latency usage.

The models are trained on over 10 trillion tokens of data, the Granite MoE models are ideal for deployment in on-device applications or situations requiring instantaneous inference.

### Parameter Sizes

**1B:**
  
`ollama run granite3.1-moe:1b`

**3B:**

`ollama run granite3.1-moe:3b`

### Supported Languages
English, German, Spanish, French, Japanese, Portuguese, Arabic, Czech, Italian, Korean, Dutch, Chinese (Simplified)

### Capabilities
* Summarization
* Text classification
* Text extraction
* Question-answering
* Retrieval Augmented Generation (RAG)
* Code related tasks
* Function-calling tasks
* Multilingual dialog use cases
* Long-context tasks including long document/meeting summarization, long document QA, etc.

## Granite dense models

The Granite dense models are available in **2B and 8B** parameter sizes designed to support tool-based use cases and for retrieval augmented generation (RAG), streamlining code generation, translation and bug fixing.

[See model page](https://ollama.org.cn/library/granite3-dense)

## Learn more

- **Developers:** IBM Research
- **GitHub Repository:** [ibm-granite/granite-language-models](https://github.com/ibm-granite/granite-3.1-language-models)
- **Website**: [Granite Docs](https://www.ibm.com/granite/docs/)
- **Release Date**: December 18th, 2024
- **License:** [Apache 2.0](https://apache.ac.cn/licenses/LICENSE-2.0).

粘贴、拖放或单击以上传图像（.png、.jpeg、.jpg、.svg、.gif）