这是mistralai/Mistral-7B-Instruct-v0.2,通过层交错创建的更大的12b模型

168 Collins 5个月前更新

读者指南

bigstral

我寻找了SOLAR,但意识到它只有4k…决定自己制作32k版本。

这是mistralai/Mistral-7B-Instruct-v0.2,通过交错层创建了更大的12b模型。

这是使用mergekit创建的预训练语言模型的合并。

提示格式

<s>[INST] What is your favourite condiment? [/INST]
Well, I'm quite partial to a good squeeze of fresh lemon juice. It adds just the right amount of zesty flavour to whatever I'm cooking up in the kitchen!</s>
[INST] Do you have dangerously spicy mayonnaise recipes? [/INST]

输出示例

请给我一些非常简单的JavaScript代码,但是在ANSI-C中却很困难

在JavaScript中,函数是一等值,这意味着你可以将它们分配给变量,将它们作为其他函数的参数传递,甚至是将它们嵌套在其他函数内部。这导致代码简洁且具有表现力。

这是一个简单的递归函数的示例,它使用JavaScript计算给定数字的阶乘

> const factorial = n => n === 0 ? 1 : n * factorial(n - 1);
>
> console.log(factorial(5)); // Output: 120
>
> ```
>
> Now, let's try to write an equivalent code snippet in ANSI-C with similar functionality:
>
> ```c
> #include <stdio.h>
> #define MAX_RECURSION_DEPTH 1000
> int factorial(int n) {
>     if (n == 0)
>         return 1;
>     else {
>         int result;
>         result = factorial(--n);
>         if (++depth > MAX_RECURSION_DEPTH)
>             while(1){} // Recursion stack overflow
>         return n * result;
>     }
> }
> int main() {
>     int i, result;
>     int n = 5;
>     int depth = 0;
>
>     result = factorial(n);
>     printf("Factorial of %d is: %d\n", n, result);
>     return 0;
> }
> ```
> 
> As you can see, the ANSI-C code becomes more verbose and has several disadvantages compared to JavaScript:
>
> 1. Need to define a maximum recursion depth using a constant or an array index, which can lead to hardcoded limits and potential stack overflow issues.
> 2. The use of preprocessor directives (#define) for defining functions and macros can make the code less readable and harder to understand.
> 3. The lack of closures in ANSI-C requires using static variables or passing arguments as arguments to nested functions, making the code more verbose and less concise.
> ```

## Merge Details
### Merge Method

This model was merged using the passthrough merge method.

### Models Merged

The following models were included in the merge:
* [mistralai/Mistral-7B-Instruct-v0.2](https://hugging-face.cn/mistralai/Mistral-7B-Instruct-v0.2)

### Configuration

The following YAML configuration was used to produce this model:

```yaml
dtype: float16
merge_method: passthrough
slices:
- sources:
  - layer_range: [0, 8]
    model: mistralai/Mistral-7B-Instruct-v0.2
- sources:
  - layer_range: [4, 12]
    model: mistralai/Mistral-7B-Instruct-v0.2
- sources:
  - layer_range: [8, 16]
    model: mistralai/Mistral-7B-Instruct-v0.2
- sources:
  - layer_range: [12, 20]
    model: mistralai/Mistral-7B-Instruct-v0.2
- sources:
  - layer_range: [16, 24]
    model: mistralai/Mistral-7B-Instruct-v0.2
- sources:
  - layer_range: [20, 28]
    model: mistralai/Mistral-7B-Instruct-v0.2
- sources:
  - layer_range: [24, 32]
    model: mistralai/Mistral-7B-Instruct-v0.2