Text Classification
Safetensors
llama
Ray2333 commited on
Commit
e7eafd9
1 Parent(s): 2bc4498

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -19,7 +19,7 @@ We evaluate GRM_Llama3.1_8B_rewardmodel-ft on the [reward model benchmark](https
19
 
20
  | Model | Average | Chat | Chat Hard | Safety | Reasoning |
21
  |:-------------------------:|:-------------:|:---------:|:---------:|:--------:|:-----------:|
22
- |GRM_Llama3.1_8B_rewardmodel-ft| 92.6|95.0 |87.7|91.4|96.4|
23
  |[GRM-Llama3-8B-rewardmodel-ft](https://huggingface.co/Ray2333/GRM-Llama3-8B-rewardmodel-ft)**(8B)**|91.5|95.5|86.2|90.8|93.6|
24
  |[GRM-Llama3.2-3B-rewardmodel-ft](https://huggingface.co/Ray2333/GRM-Llama3.2-3B-rewardmodel-ft)**(ours, 3B)**|90.9|91.6|84.9|92.7|94.6|
25
  | [GRM-gemma2-2B-rewardmodel-ft](https://huggingface.co/Ray2333/GRM-gemma2-2B-rewardmodel-ft) **(Ours, 2B)**| 88.4 | 93.0 | 77.2 | 92.2 | 91.2 |
 
19
 
20
  | Model | Average | Chat | Chat Hard | Safety | Reasoning |
21
  |:-------------------------:|:-------------:|:---------:|:---------:|:--------:|:-----------:|
22
+ |[GRM_Llama3.1_8B_rewardmodel-ft](https://huggingface.co/Ray2333/GRM_Llama3.1_8B_rewardmodel-ft)| 92.6|95.0 |87.7|91.4|96.4|
23
  |[GRM-Llama3-8B-rewardmodel-ft](https://huggingface.co/Ray2333/GRM-Llama3-8B-rewardmodel-ft)**(8B)**|91.5|95.5|86.2|90.8|93.6|
24
  |[GRM-Llama3.2-3B-rewardmodel-ft](https://huggingface.co/Ray2333/GRM-Llama3.2-3B-rewardmodel-ft)**(ours, 3B)**|90.9|91.6|84.9|92.7|94.6|
25
  | [GRM-gemma2-2B-rewardmodel-ft](https://huggingface.co/Ray2333/GRM-gemma2-2B-rewardmodel-ft) **(Ours, 2B)**| 88.4 | 93.0 | 77.2 | 92.2 | 91.2 |