license: apache-2.0 | |
tags: | |
- merge | |
- mergekit | |
- lazymergekit | |
- nvidia/Llama-3.1-Nemotron-70B-Instruct-HF | |
- tokyotech-llm/Llama-3.1-Swallow-70B-v0.1 | |
# Llama-3.1-Swallow-70B-v0.1-Nemotron-Insturct | |
Llama-3.1-Swallow-70B-v0.1-Nemotron-Insturct is a merge of the following models using [mergekit](https://github.com/cg123/mergekit): | |
* [nvidia/Llama-3.1-Nemotron-70B-Instruct-HF](https://huggingface.co/nvidia/Llama-3.1-Nemotron-70B-Instruct-HF) | |
* [tokyotech-llm/Llama-3.1-Swallow-70B-v0.1](https://huggingface.co/tokyotech-llm/Llama-3.1-Swallow-70B-v0.1) | |
## 🧩 Configuration | |
```yaml | |
merge_method: task_arithmetic | |
base_model: meta-llama/Llama-3.1-70B | |
models: | |
- model: nvidia/Llama-3.1-Nemotron-70B-Instruct-HF | |
parameters: | |
weight: 1.0 | |
- model: tokyotech-llm/Llama-3.1-Swallow-70B-v0.1 | |
parameters: | |
weight: 1.0 | |
dtype: bfloat16 | |
``` |