DebateLabKIT
/

Llama-3.1-Argunaut-1-8B-SFT

Text Generation

critical-thinking

argument-mapping

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

ggbetz commited on 9 days ago

Commit

73286d4

·

verified ·

1 Parent(s): 3484ab6

Update README.md

Files changed (1) hide show

README.md +18 -2

README.md CHANGED Viewed

@@ -34,6 +34,11 @@ output = generator([{"role": "user", "content": question}], max_new_tokens=128,
 print(output["generated_text"])
 ```
 ## Training procedure
 SFT dataset mixture:
@@ -61,8 +66,9 @@ warmup_ratio: 0.1
 Hardware: 2 x H100 GPUs.
 ### Framework versions
@@ -71,3 +77,13 @@ Hardware: 2 x H100 GPUs.
 - Pytorch: 2.4.1
 - Datasets: 3.1.0
 - Tokenizers: 0.20.3

 print(output["generated_text"])
 ```
+## Evals
 ## Training procedure
 SFT dataset mixture:
 Hardware: 2 x H100 GPUs.
+_This work was performed on the HoreKa supercomputer funded by the
+Ministry of Science, Research and the Arts Baden-Württemberg and by
+the Federal Ministry of Education and Research._
 ### Framework versions
 - Pytorch: 2.4.1
 - Datasets: 3.1.0
 - Tokenizers: 0.20.3
+## Credits
+This work wouldn't be possible without all the **great contributions from the open LLM community**. Thank you! Special kudos go to
+- @philschmid for his latest [fine-tuning boilerplate](https://www.philschmid.de/fine-tune-llms-in-2025)
+- @lvwerra, @lewtun et al for building and maintaining [trl](https://github.com/huggingface/trl)
+- @cognitivecomputations for sharing [spectrum](https://github.com/cognitivecomputations/spectrum/tree/main)