allenai
/

OLMo-2-1124-13B

Safetensors

English

olmo2

Model card Files Files and versions Community

amanrangapur commited on Nov 26, 2024

Commit

94c5532

verified ·

1 Parent(s): 2d59a16

Update README.md

Browse files

Files changed (1) hide show

README.md +21 -17

README.md CHANGED Viewed

@@ -9,10 +9,12 @@ language:
 ## Model Details
-<img src="https://allenai.org/olmo/olmo-7b-animation.gif" alt="OLMo Logo" width="800" style="margin-left:'auto' margin-right:'auto' display:'block'"/>
-# Model Card for OLMo2 13B
 OLMo is a series of **O**pen **L**anguage **Mo**dels designed to enable the science of language models.
 These models are trained on the Dolma dataset. We are releasing all code, checkpoints, logs (coming soon), and associated training details.
@@ -23,6 +25,17 @@ The core models released in this batch include the following:
 | [OLMo2-7B](https://huggingface.co/allenai/OLMo-1124-7B) | 4 Trillion   | 32     | 4096        | 32              |  4096  |
 | [OLMo2- 13B](https://huggingface.co/allenai/OLMo2-1124-13B) | 5 Trillion   | 40     | 5120        | 42              |  4096  |
 ## Inference
 You can use OLMo with the standard HuggingFace transformers library:
@@ -81,30 +94,26 @@ For more documentation, see the [GitHub readme](https://github.com/allenai/OLMo?
 2. Further fine-tuning support is being developing in AI2's Open Instruct repository. Details are [here](https://github.com/allenai/open-instruct).
 ### Model Description
 - **Developed by:** Allen Institute for AI (Ai2)
-- **Supported by:** Databricks, Kempner Institute for the Study of Natural and Artificial Intelligence at Harvard University, AMD, CSC (Lumi Supercomputer), UW
 - **Model type:** a Transformer style autoregressive language model.
 - **Language(s) (NLP):** English
 - **License:** The code and model are released under Apache 2.0.
-- **Contact:** Technical inquiries: `olmo at allenai dot org`. Press: `press at allenai dot org`
-- **Date cutoff:** Oct. 2023, with most data from Feb./March 2023 based on Dolma dataset version.
 ### Model Sources
 - **Project Page:** https://allenai.org/olmo
 - **Repositories:**
     - Core repo (training, inference, fine-tuning etc.): https://github.com/allenai/OLMo
     - Evaluation code: https://github.com/allenai/OLMo-Eval
     - Further fine-tuning code: https://github.com/allenai/open-instruct
-<!-- - **Paper:** [Link](https://arxiv.org/abs/2402.00838) -->
 <!-- - **Technical blog post:** https://blog.allenai.org/olmo-1-7-7b-a-24-point-improvement-on-mmlu-92b43f7d269d  -->
 <!-- - **W&B Logs:** [pretraining](https://wandb.ai/ai2-llm/OLMo-7B/groups/OLMo-1.7-7B), [annealing](https://wandb.ai/ai2-llm/OLMo-7B/groups/OLMo-1.7-7B-anneal) -->
 ## Evaluation
-Core model results for OLMo2 7B and 13B models are found below.
 | Model | Train FLOPs | Average | ARC/C | HSwag | WinoG | MMLU | DROP | NQ | AGIEval | GSM8k | MMWLUPro | TriviaQA |
 |-------------------|------------|---------|--------|--------|--------|-------|-------|-----|----------|--------|-----------|-----------|
@@ -152,17 +161,12 @@ Core model results for OLMo2 7B and 13B models are found below.
 - 7B Model: 3 versions trained on 50B mix, merged via model souping
 - 13B Model: 3 versions on 100B mix + 1 version on 300B mix, merged for final checkpoint
 ## Bias, Risks, and Limitations
 Like any base language model or fine-tuned model without safety filtering, these models can easily be prompted by users to generate harmful and sensitive content. Such content may also be produced unintentionally, especially in cases involving bias, so we recommend that users consider the risks when applying this technology. Additionally, many statements from OLMo or any LLM are often inaccurate, so facts should be verified.
 ## Citation
-`TODO`
 ## Model Card Contact
-For errors in this model card, contact Aman, `{amanr} at allenai dot org`.

 ## Model Details
+<img alt="OLMo Logo" src="https://huggingface.co/datasets/allenai/blog-images/resolve/main/olmo2/olmo.png" width="242px" style="margin-left:'auto' margin-right:'auto' display:'block'">
+# Model Card for OLMo 2 13B
+We introduce OLMo 2, a new family of 7B and 13B models featuring a 9-point increase in MMLU, among other evaluation improvements, compared to the original [OLMo 7B](https://huggingface.co/allenai/OLMo-7B) model. These gains come from training on [OLMo-mix-1124](https://huggingface.co/datasets/allenai/olmo-mix-1124) and [Dolmino-mix-1124](https://huggingface.co/datasets/allenai/dolmino-mix-1124) datasets and staged training approach.
 OLMo is a series of **O**pen **L**anguage **Mo**dels designed to enable the science of language models.
 These models are trained on the Dolma dataset. We are releasing all code, checkpoints, logs (coming soon), and associated training details.
 | [OLMo2-7B](https://huggingface.co/allenai/OLMo-1124-7B) | 4 Trillion   | 32     | 4096        | 32              |  4096  |
 | [OLMo2- 13B](https://huggingface.co/allenai/OLMo2-1124-13B) | 5 Trillion   | 40     | 5120        | 42              |  4096  |
+The core models released in this batch include the following:
+| **Stage**           | **OLMo 2 7B**                                                                                          | **OLMo 2 13B**                                                                                         |
+|----------------------|----------------------------------------------------------------------------------------------------------|----------------------------------------------------------------------------------------------------------|
+| **Base Model**       | [allenai/OLMo2-7B-1124](https://huggingface.co/allenai/OLMo2-7B-1124)                                | [allenai/OLMo-2-13B-1124](https://huggingface.co/allenai/OLMo-2-13B-1124)                             |
+| **SFT**              | [allenai/OLMo-2-1124-7B-SFT](https://huggingface.co/allenai/OLMo-2-1124-7B-SFT)                | [allenai/OLMo-2-1124-13B-SFT](https://huggingface.co/allenai/OLMo-2-1124-13B-SFT)              |
+| **DPO**              | [allenai/OLMo-2-1124-7B-DPO](https://huggingface.co/allenai/OLMo-2-1124-7B-DPO)                | [allenai/OLMo-2-1124-13B-DPO](https://huggingface.co/allenai/OLMo-2-1124-13B-DPO)              |
+| **Final Models (RLVR)** | [allenai/OLMo-2-1124-7B-Instruct](https://huggingface.co/allenai/OLMo-2-1124-7B-Instruct)                        | [allenai/OLMo-2-1124-13B-Instruct](https://huggingface.co/allenai/OLMo-2-1124-13B-Instruct)                      |
+| **Reward Model (RM)**| [allenai/OLMo-2-1124-7B-RM](https://huggingface.co/allenai/OLMo-2-1124-7B-RM)                                                     | (Same as 8B)                                                     |
 ## Inference
 You can use OLMo with the standard HuggingFace transformers library:
 2. Further fine-tuning support is being developing in AI2's Open Instruct repository. Details are [here](https://github.com/allenai/open-instruct).
 ### Model Description
 - **Developed by:** Allen Institute for AI (Ai2)
 - **Model type:** a Transformer style autoregressive language model.
 - **Language(s) (NLP):** English
 - **License:** The code and model are released under Apache 2.0.
+- **Contact:** Technical inquiries: `olmo@allenai.org`. Press: `press@allenai.org`
+- **Date cutoff:** Dec. 2023.
 ### Model Sources
 - **Project Page:** https://allenai.org/olmo
 - **Repositories:**
     - Core repo (training, inference, fine-tuning etc.): https://github.com/allenai/OLMo
     - Evaluation code: https://github.com/allenai/OLMo-Eval
     - Further fine-tuning code: https://github.com/allenai/open-instruct
+- **Paper:** Coming soon
 <!-- - **Technical blog post:** https://blog.allenai.org/olmo-1-7-7b-a-24-point-improvement-on-mmlu-92b43f7d269d  -->
 <!-- - **W&B Logs:** [pretraining](https://wandb.ai/ai2-llm/OLMo-7B/groups/OLMo-1.7-7B), [annealing](https://wandb.ai/ai2-llm/OLMo-7B/groups/OLMo-1.7-7B-anneal) -->
 ## Evaluation
+Core model results for OLMo 2 7B and 13B models are found below.
 | Model | Train FLOPs | Average | ARC/C | HSwag | WinoG | MMLU | DROP | NQ | AGIEval | GSM8k | MMWLUPro | TriviaQA |
 |-------------------|------------|---------|--------|--------|--------|-------|-------|-----|----------|--------|-----------|-----------|
 - 7B Model: 3 versions trained on 50B mix, merged via model souping
 - 13B Model: 3 versions on 100B mix + 1 version on 300B mix, merged for final checkpoint
 ## Bias, Risks, and Limitations
 Like any base language model or fine-tuned model without safety filtering, these models can easily be prompted by users to generate harmful and sensitive content. Such content may also be produced unintentionally, especially in cases involving bias, so we recommend that users consider the risks when applying this technology. Additionally, many statements from OLMo or any LLM are often inaccurate, so facts should be verified.
 ## Citation
+A technical manuscript is forthcoming!
 ## Model Card Contact
+For errors in this model card, contact `[email protected]`.