HaoyeZhang
commited on
Update README.md
Browse files
README.md
CHANGED
@@ -18,7 +18,11 @@ RLHF-V is trained on [RLHF-V-Dataset](https://huggingface.co/datasets/HaoyeZhang
|
|
18 |
|
19 |
For more details, please refer to our [paper](https://arxiv.org/abs/2312.00849).
|
20 |
|
21 |
-
![Illustration of the RLHF-V
|
|
|
|
|
|
|
|
|
22 |
|
23 |
## Model Details
|
24 |
|
|
|
18 |
|
19 |
For more details, please refer to our [paper](https://arxiv.org/abs/2312.00849).
|
20 |
|
21 |
+
![Illustration of the RLHF-V framework](https://rlhf-v.github.io/images/rlhf-v_framework.jpg)
|
22 |
+
|
23 |
+
## News
|
24 |
+
* [2024.05.20] 🎉 We introduce [RLAIF-V](https://github.com/RLHF-V/RLAIF-V), our new alignment framework that utilize open-source models for feedback generation and reach **super GPT-4V trustworthiness**. You can download the corresponding [🤗 dataset](https://huggingface.co/datasets/openbmb/RLAIF-V-Dataset) now!
|
25 |
+
* [2024.04.11] 🔥 Our data is used in [MiniCPM-V 2.0](https://huggingface.co/openbmb/MiniCPM-V-2), an **end-side** multimodal large language model that exhibits **comparable trustworthiness with GPT-4V**!
|
26 |
|
27 |
## Model Details
|
28 |
|