Update README.md
Browse files
README.md
CHANGED
@@ -1,3 +1,20 @@
|
|
1 |
# Overview
|
2 |
![image/png](https://cdn-uploads.huggingface.co/production/uploads/6550c4f27bbfce1878f5f280/AnqbCNf6pRiQ_5uNX0r4d.png)
|
3 |
-
Volcano employs a single LMM to generate initial responses, feedback, and revisions, as well as decisions to accept revisions. It follows a sequential procedure of an iterative critique-revision-decide loop.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
# Overview
|
2 |
![image/png](https://cdn-uploads.huggingface.co/production/uploads/6550c4f27bbfce1878f5f280/AnqbCNf6pRiQ_5uNX0r4d.png)
|
3 |
+
Volcano employs a single LMM to generate initial responses, feedback, and revisions, as well as decisions to accept revisions. It follows a sequential procedure of an iterative critique-revision-decide loop.
|
4 |
+
|
5 |
+
# Model details
|
6 |
+
|
7 |
+
**Model type:**
|
8 |
+
Volcano is a multimodal self-feedback guided revision model that was trained using the vicuna model with visual instruction tuning data and multimodal feedback and revision data obtained through gpt-3.5-turbo, following the methodology of LLaVA.
|
9 |
+
|
10 |
+
**Model date:**
|
11 |
+
Volcano-7b was trained in October 2023.
|
12 |
+
|
13 |
+
**Paper or resources for more information:**
|
14 |
+
|
15 |
+
## Training dataset
|
16 |
+
- 274k Volcano-train data
|
17 |
+
- 558K filtered image-text pairs from LAION/CC/SBU, captioned by BLIP.
|
18 |
+
- 158K GPT-generated multimodal instruction-following data.
|
19 |
+
- 450K academic-task-oriented VQA data mixture.
|
20 |
+
- 40K ShareGPT data.
|