misdelivery's picture
Add files using upload-large-folder tool
bf6d9e2 verified
|
raw
history blame
866 Bytes
---
license: apache-2.0
tags:
- merge
- mergekit
- lazymergekit
- nvidia/Llama-3.1-Nemotron-70B-Instruct-HF
- tokyotech-llm/Llama-3.1-Swallow-70B-v0.1
---
# Llama-3.1-Swallow-70B-v0.1-Nemotron-Insturct
Llama-3.1-Swallow-70B-v0.1-Nemotron-Insturct is a merge of the following models using [mergekit](https://github.com/cg123/mergekit):
* [nvidia/Llama-3.1-Nemotron-70B-Instruct-HF](https://huggingface.co/nvidia/Llama-3.1-Nemotron-70B-Instruct-HF)
* [tokyotech-llm/Llama-3.1-Swallow-70B-v0.1](https://huggingface.co/tokyotech-llm/Llama-3.1-Swallow-70B-v0.1)
## 🧩 Configuration
```yaml
merge_method: task_arithmetic
base_model: meta-llama/Llama-3.1-70B
models:
- model: nvidia/Llama-3.1-Nemotron-70B-Instruct-HF
parameters:
weight: 1.0
- model: tokyotech-llm/Llama-3.1-Swallow-70B-v0.1
parameters:
weight: 1.0
dtype: bfloat16
```