DebateLabKIT
/

Llama-3.1-Argunaut-1-8B-SFT

Text Generation

critical-thinking

argument-mapping

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

ggbetz commited on 9 days ago

Commit

16954e7

·

verified ·

1 Parent(s): f3e7417

Update README.md

Files changed (1) hide show

README.md +4 -5

README.md CHANGED Viewed

@@ -37,11 +37,7 @@ print(output["generated_text"])
 ## Evals
-## Training procedure
-SFT dataset mixture:
 |dataset|weight (examples)| weight (tokens)|
 |:------|:----:|:----:|
@@ -49,6 +45,9 @@ SFT dataset mixture:
 |DebateLabKIT/deep-argmap-conversations|25%|18%|
 |allenai/tulu-3-sft-mixture|50%|33%|
 Trained with SFT on **1M examples** and for 1 epoch with
 * context length 8196

 ## Evals
+## SFT dataset mixture
 |dataset|weight (examples)| weight (tokens)|
 |:------|:----:|:----:|
 |DebateLabKIT/deep-argmap-conversations|25%|18%|
 |allenai/tulu-3-sft-mixture|50%|33%|
+## Training procedure
 Trained with SFT on **1M examples** and for 1 epoch with
 * context length 8196