misdelivery
/

Llama-3.1-Swallow-70B-Nemotron-Instruct-v0.1

Model card Files Files and versions Community

Llama-3.1-Swallow-70B-Nemotron-Instruct-v0.1 / README.md

misdelivery's picture

Add files using upload-large-folder tool

bf6d9e2 verified 3 months ago

|

866 Bytes

	---
	license: apache-2.0
	tags:
	- merge
	- mergekit
	- lazymergekit
	- nvidia/Llama-3.1-Nemotron-70B-Instruct-HF
	- tokyotech-llm/Llama-3.1-Swallow-70B-v0.1
	---

	# Llama-3.1-Swallow-70B-v0.1-Nemotron-Insturct

	Llama-3.1-Swallow-70B-v0.1-Nemotron-Insturct is a merge of the following models using [mergekit](https://github.com/cg123/mergekit):
	* [nvidia/Llama-3.1-Nemotron-70B-Instruct-HF](https://huggingface.co/nvidia/Llama-3.1-Nemotron-70B-Instruct-HF)
	* [tokyotech-llm/Llama-3.1-Swallow-70B-v0.1](https://huggingface.co/tokyotech-llm/Llama-3.1-Swallow-70B-v0.1)

	## 🧩 Configuration

	```yaml
	merge_method: task_arithmetic
	base_model: meta-llama/Llama-3.1-70B
	models:
	- model: nvidia/Llama-3.1-Nemotron-70B-Instruct-HF
	parameters:
	weight: 1.0
	- model: tokyotech-llm/Llama-3.1-Swallow-70B-v0.1
	parameters:
	weight: 1.0
	dtype: bfloat16
	```