Safetensors
English
olmo2
amanrangapur commited on
Commit
131e941
·
verified ·
1 Parent(s): bdfd591

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +8 -3
README.md CHANGED
@@ -21,14 +21,14 @@ The core models released in this batch include the following:
21
 
22
  | Size | Training Tokens | Layers | Hidden Size | Attention Heads | Context Length |
23
  |------|--------|---------|-------------|-----------------|----------------|
24
- | [OLMo2-7B](https://huggingface.co/allenai/OLMo-1124-7B) | 4 Trillion | 32 | 4096 | 32 | 4096 |
25
- | [OLMo2- 13B](https://huggingface.co/allenai/OLMo2-1124-13B) | 5 Trillion | 40 | 5120 | 42 | 4096 |
26
 
27
  The core models released in this batch include the following:
28
 
29
  | **Stage** | **OLMo 2 7B** | **OLMo 2 13B** |
30
  |----------------------|----------------------------------------------------------------------------------------------------------|----------------------------------------------------------------------------------------------------------|
31
- | **Base Model** | [allenai/OLMo2-7B-1124](https://huggingface.co/allenai/OLMo2-7B-1124) | [allenai/OLMo-2-13B-1124](https://huggingface.co/allenai/OLMo-2-13B-1124) |
32
  | **SFT** | [allenai/OLMo-2-1124-7B-SFT](https://huggingface.co/allenai/OLMo-2-1124-7B-SFT) | [allenai/OLMo-2-1124-13B-SFT](https://huggingface.co/allenai/OLMo-2-1124-13B-SFT) |
33
  | **DPO** | [allenai/OLMo-2-1124-7B-DPO](https://huggingface.co/allenai/OLMo-2-1124-7B-DPO) | [allenai/OLMo-2-1124-13B-DPO](https://huggingface.co/allenai/OLMo-2-1124-13B-DPO) |
34
  | **Final Models (RLVR)** | [allenai/OLMo-2-1124-7B-Instruct](https://huggingface.co/allenai/OLMo-2-1124-7B-Instruct) | [allenai/OLMo-2-1124-13B-Instruct](https://huggingface.co/allenai/OLMo-2-1124-13B-Instruct) |
@@ -163,6 +163,11 @@ Core model results for OLMo 2 7B and 13B models are found below.
163
  ## Bias, Risks, and Limitations
164
  Like any base language model or fine-tuned model without safety filtering, these models can easily be prompted by users to generate harmful and sensitive content. Such content may also be produced unintentionally, especially in cases involving bias, so we recommend that users consider the risks when applying this technology. Additionally, many statements from OLMo or any LLM are often inaccurate, so facts should be verified.
165
 
 
 
 
 
 
166
 
167
  ## Citation
168
  A technical manuscript is forthcoming!
 
21
 
22
  | Size | Training Tokens | Layers | Hidden Size | Attention Heads | Context Length |
23
  |------|--------|---------|-------------|-----------------|----------------|
24
+ | [OLMo 2-7B](https://huggingface.co/allenai/OLMo-1124-7B) | 4 Trillion | 32 | 4096 | 32 | 4096 |
25
+ | [OLMo 2- 13B](https://huggingface.co/allenai/OLMo2-1124-13B) | 5 Trillion | 40 | 5120 | 42 | 4096 |
26
 
27
  The core models released in this batch include the following:
28
 
29
  | **Stage** | **OLMo 2 7B** | **OLMo 2 13B** |
30
  |----------------------|----------------------------------------------------------------------------------------------------------|----------------------------------------------------------------------------------------------------------|
31
+ | **Base Model** | [allenai/OLMo-2-7B-1124](https://huggingface.co/allenai/OLMo2-7B-1124) | [allenai/OLMo-2-13B-1124](https://huggingface.co/allenai/OLMo-2-13B-1124) |
32
  | **SFT** | [allenai/OLMo-2-1124-7B-SFT](https://huggingface.co/allenai/OLMo-2-1124-7B-SFT) | [allenai/OLMo-2-1124-13B-SFT](https://huggingface.co/allenai/OLMo-2-1124-13B-SFT) |
33
  | **DPO** | [allenai/OLMo-2-1124-7B-DPO](https://huggingface.co/allenai/OLMo-2-1124-7B-DPO) | [allenai/OLMo-2-1124-13B-DPO](https://huggingface.co/allenai/OLMo-2-1124-13B-DPO) |
34
  | **Final Models (RLVR)** | [allenai/OLMo-2-1124-7B-Instruct](https://huggingface.co/allenai/OLMo-2-1124-7B-Instruct) | [allenai/OLMo-2-1124-13B-Instruct](https://huggingface.co/allenai/OLMo-2-1124-13B-Instruct) |
 
163
  ## Bias, Risks, and Limitations
164
  Like any base language model or fine-tuned model without safety filtering, these models can easily be prompted by users to generate harmful and sensitive content. Such content may also be produced unintentionally, especially in cases involving bias, so we recommend that users consider the risks when applying this technology. Additionally, many statements from OLMo or any LLM are often inaccurate, so facts should be verified.
165
 
166
+ ## License and use
167
+
168
+ OLMo 2 is licensed under the Apache 2.0 license.
169
+ OLMo 2 is intended for research and educational use.
170
+ For more information, please see our [Responsible Use Guidelines](https://allenai.org/responsible-use).
171
 
172
  ## Citation
173
  A technical manuscript is forthcoming!