3 个月前更新
3 个月前
d1c82fd1af09 · 2.1GB
模型
架构phi3
·
参数3.82B
·
量化Q3_K_L
2.1GB
参数
{"stop":["\u003c|end|\u003e","\u003c|user|\u003e","\u003c|assistant|\u003e"]}
78B
模板
{{- range .Messages }} {{- if eq .Role "user" }}<|user|> {{ .Content }}<|end|> <|assistant|> {{- els
172B
许可证
Apache License Version 2.0, January 2004
11kB
自述文件
NuMind 结构提取模型 🔥
NuExtract 是 phi-3-mini 的一个版本,在私有的高质量合成数据集上微调,用于信息提取。要使用该模型,请提供一段输入文本(不超过 2000 个 token)和一个描述要提取信息的 JSON 模板。
注意:此模型纯粹是提取型的,因此模型输出的所有文本都原样存在于原始文本中。你还可以提供一个输出格式示例,帮助模型更准确地理解你的任务。
用法
提示格式
此模型在使用特定的提示格式提取文本时效果最佳
### Template:
{
"Model": {
"Name": "",
"Number of parameters": "",
},
"Usage": {
"Use case": [],
"Licence": ""
}
}
### Example:
{
"Model": {
"Name": "Llama3",
"Number of parameters": "8 billion",
},
"Usage": {
"Use case":[
"chat",
"code completion"
],
"Licence": "Meta Llama3"
}
}
### Text:
We introduce Mistral 7B, a 7–billion-parameter language model engineered for superior performance and efficiency. Mistral 7B outperforms the best open 13B model (Llama 2) across all evaluated benchmarks, and the best released 34B model (Llama 1) in reasoning, mathematics, and code generation. Our model leverages grouped-query attention (GQA) for faster inference, coupled with sliding window attention (SWA) to effectively handle sequences of arbitrary length with a reduced inference cost. We also provide a model fine-tuned to follow instructions, Mistral 7B – Instruct, that surpasses Llama 2 13B – chat model both on human and automated benchmarks. Our models are released under the Apache 2.0 license.
Code: https://github.com/mistralai/mistral-src
Webpage: https://mistral.ai/news/announcing-mistral-7b/