OpenLLM-France
/

Lucie-7B-optimizer-states-512GPU

Text Generation

Model card Files Files and versions Community

Jeronymous commited on Oct 14, 2024

Commit

93da255

·

verified ·

1 Parent(s): a511b4f

Update README.md

Files changed (1) hide show

README.md +17 -0

README.md CHANGED Viewed

@@ -1,6 +1,7 @@
 ---
 license: apache-2.0
 pipeline_tag: text-generation
 language:
 - fr
 - en
@@ -26,3 +27,19 @@ widget:
 #         top_k: null
 #         max_new_tokens: null
 ---

 ---
 license: apache-2.0
 pipeline_tag: text-generation
+base_model: OpenLLM-France/Lucie-7B
 language:
 - fr
 - en
 #         top_k: null
 #         max_new_tokens: null
 ---
+# Model Card
+This repository contains checkpoints (splitted for 512 GPUs) in DeepSpeed format for the [Lucie-7B model](https://huggingface.co/OpenLLM-France/Lucie-7B),
+which was trained using [this repository of code](https://github.com/OpenLLM-France/Lucie-Training)
+based on [a fork of `Megatron-Deepspeed`](https://github.com/OpenLLM-France/Megatron-DeepSpeed).
+Each checkpoint is in a subbranch (revision), which names specifies the number of training steps.
+For instance `step0400000` corresponds to the checkpoint after 4M training steps.
+Those checkpoints are provided so that the model can be retrained from a given point.
+## Contact
+[email protected]