TIGER-Lab
/

VideoScore

Visual Question Answering

text-classification

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

hexuan21 commited on Nov 28, 2024

Commit

41cd5c9

·

verified ·

1 Parent(s): c659068

Update README.md

Files changed (1) hide show

README.md +6 -2

README.md CHANGED Viewed

@@ -17,9 +17,13 @@ pipeline_tag: visual-question-answering
 ![VideoScore](https://tiger-ai-lab.github.io/VideoScore/static/images/teaser.png)
 ## Introduction
-- VideoScore is a video quality evaluation model, taking [Mantis-8B-Idefics2](https://huggingface.co/TIGER-Lab/Mantis-8B-Idefics2) as base-model
 and trained on [VideoFeedback](https://huggingface.co/datasets/TIGER-Lab/VideoFeedback),
-a large video evaluation dataset with multi-aspect human scores.
 - VideoScore can reach 75+ Spearman correlation with humans on VideoFeedback-test, surpassing all the MLLM-prompting methods and feature-based metrics.
 VideoScore also beat the best baselines on other three benchmarks EvalCrafter, GenAI-Bench and VBench, showing high alignment with human evaluations.

 ![VideoScore](https://tiger-ai-lab.github.io/VideoScore/static/images/teaser.png)
 ## Introduction
+- 🤯🤯Try on the new version [VideoScore-v1.1](https://huggingface.co/TIGER-Lab/VideoScore-v1.1), which is a variant from [VideoScore](https://huggingface.co/TIGER-Lab/VideoScore) with better performance in "text-to-video alignment" subscore!
+See more details about this new version [here](https://huggingface.co/TIGER-Lab/VideoScore-v1.1).
+- [VideoScore](https://huggingface.co/TIGER-Lab/VideoScore) series is a video quality evaluation model series, taking [Mantis-8B-Idefics2](https://huggingface.co/TIGER-Lab/Mantis-8B-Idefics2) as base-model
 and trained on [VideoFeedback](https://huggingface.co/datasets/TIGER-Lab/VideoFeedback),
+a large video evaluation dataset with multi-aspect human scores.
 - VideoScore can reach 75+ Spearman correlation with humans on VideoFeedback-test, surpassing all the MLLM-prompting methods and feature-based metrics.
 VideoScore also beat the best baselines on other three benchmarks EvalCrafter, GenAI-Bench and VBench, showing high alignment with human evaluations.