Update README.md
Browse files
README.md
CHANGED
@@ -52,8 +52,12 @@ we employ the model's output to predict preferences and use pairwise accuracy as
|
|
52 |
| Idefics2 | 73.0 | 6.5 | 0.3 | 34.6 | 31.7 |
|
53 |
| SSIM-dyn | 42.5 | -5.5 | -17.0 | 28.4 | 36.5 |
|
54 |
| MES-dyn | 36.7 | -12.9 | -26.4 | 31.4 | 44.5 |
|
55 |
-
|
56 |
-
|
|
|
|
|
|
|
|
|
57 |
|
58 |
## Usage
|
59 |
### Installation
|
|
|
52 |
| Idefics2 | 73.0 | 6.5 | 0.3 | 34.6 | 31.7 |
|
53 |
| SSIM-dyn | 42.5 | -5.5 | -17.0 | 28.4 | 36.5 |
|
54 |
| MES-dyn | 36.7 | -12.9 | -26.4 | 31.4 | 44.5 |
|
55 |
+
| Fuyu | - | - | - | - | - |
|
56 |
+
| Kosmos-2 | - | - | - | - | - |
|
57 |
+
| CogVLM | - | - | - | - | - |
|
58 |
+
| OpenFlamingo | - | - | - | - | - |
|
59 |
+
The best in MantisScore series is in bold and the best in baselines is underlined.
|
60 |
+
"-" means the answer of MLLM is meaningless or in wrong format.
|
61 |
|
62 |
## Usage
|
63 |
### Installation
|