Update README.md
Browse files
README.md
CHANGED
@@ -2,8 +2,20 @@
|
|
2 |
license: other
|
3 |
license_name: nvidia-open-model-license
|
4 |
license_link: LICENSE
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
5 |
---
|
6 |
|
|
|
7 |
## Nemotron-4-340B-Reward
|
8 |
|
9 |
[![Model architecture](https://img.shields.io/badge/Model%20Arch-Transformer%20Decoder-green)](#model-architecture)[![Model size](https://img.shields.io/badge/Params-340B-green)](#model-architecture)[![Language](https://img.shields.io/badge/Language-Multilingual-green)](#datasets)
|
@@ -55,8 +67,8 @@ Nemotron-4 340B-Reward can be used in the alignment stage to align pretrained mo
|
|
55 |
### Required Hardware
|
56 |
|
57 |
BF16 Inference:
|
58 |
-
-
|
59 |
-
-
|
60 |
|
61 |
### Usage:
|
62 |
|
|
|
2 |
license: other
|
3 |
license_name: nvidia-open-model-license
|
4 |
license_link: LICENSE
|
5 |
+
library_name: nemo
|
6 |
+
language:
|
7 |
+
- en
|
8 |
+
inference: false
|
9 |
+
fine-tuning: false
|
10 |
+
tags:
|
11 |
+
- nvidia
|
12 |
+
- steerlm
|
13 |
+
- reward model
|
14 |
+
datasets:
|
15 |
+
- nvidia/HelpSteer2
|
16 |
---
|
17 |
|
18 |
+
|
19 |
## Nemotron-4-340B-Reward
|
20 |
|
21 |
[![Model architecture](https://img.shields.io/badge/Model%20Arch-Transformer%20Decoder-green)](#model-architecture)[![Model size](https://img.shields.io/badge/Params-340B-green)](#model-architecture)[![Language](https://img.shields.io/badge/Language-Multilingual-green)](#datasets)
|
|
|
67 |
### Required Hardware
|
68 |
|
69 |
BF16 Inference:
|
70 |
+
- 16x H100 (2x H100 Nodes)
|
71 |
+
- 16x A100 (2x A100 80GB Nodes)
|
72 |
|
73 |
### Usage:
|
74 |
|