allenai
/

OLMo-2-1124-13B-SFT-Preview

Text Generation

Inference Endpoints

Model card Files Files and versions Community

hamishivi commited on Nov 26, 2024

Commit

463fea2

·

verified ·

1 Parent(s): 2d96dbe

Update README.md

Files changed (1) hide show

README.md +7 -7

README.md CHANGED Viewed

@@ -4,7 +4,7 @@ language:
 - en
 pipeline_tag: text-generation
 base_model:
-- allenai/OLMo2-13B-1124
 library_name: transformers
 ---
@@ -36,7 +36,7 @@ The core models released in this batch include the following:
 - **Model type:** A model trained on a mix of publicly available, synthetic and human-created datasets.
 - **Language(s) (NLP):** Primarily English
 - **License:** Apache 2.0
-- **Finetuned from model:** allenai/OLMo2-13B-1124
 ### Model Sources
@@ -84,7 +84,7 @@ The model has not been trained with a specific system prompt in mind.
 ### Bias, Risks, and Limitations
-The OLMo2 models have limited safety training, but are not deployed automatically with in-the-loop filtering of responses like ChatGPT, so the model can produce problematic outputs (especially when prompted to do so).
 See the Falcon 180B model card for an example of this.
@@ -92,7 +92,7 @@ See the Falcon 180B model card for an example of this.
 TODO
-## Hyperparamters
 SFT:
 - **Learning Rate**: 1E-5 (7B), 7.5E-06 (13B)
@@ -105,13 +105,13 @@ SFT:
 ## License and use
-OLMo2 is licensed under the Apache 2.0 license.
-OLMo2 is intended for research and educational use.
 For more information, please see our [Responsible Use Guidelines](https://allenai.org/responsible-use).
 ## Citation
-If OLMo2 or any of the related materials were helpful to your work, please cite:
 ```
 TODO
 ```

 - en
 pipeline_tag: text-generation
 base_model:
+- allenai/OLMo-2-13B-1124
 library_name: transformers
 ---
 - **Model type:** A model trained on a mix of publicly available, synthetic and human-created datasets.
 - **Language(s) (NLP):** Primarily English
 - **License:** Apache 2.0
+- **Finetuned from model:** allenai/OLMo-2-13B-1124
 ### Model Sources
 ### Bias, Risks, and Limitations
+The OLMo-2 models have limited safety training, but are not deployed automatically with in-the-loop filtering of responses like ChatGPT, so the model can produce problematic outputs (especially when prompted to do so).
 See the Falcon 180B model card for an example of this.
 TODO
+## Hyperparameters
 SFT:
 - **Learning Rate**: 1E-5 (7B), 7.5E-06 (13B)
 ## License and use
+OLMo-2 is licensed under the Apache 2.0 license.
+OLMo-2 is intended for research and educational use.
 For more information, please see our [Responsible Use Guidelines](https://allenai.org/responsible-use).
 ## Citation
+If OLMo-2 or any of the related materials were helpful to your work, please cite:
 ```
 TODO
 ```