Respository for ACL 2024 paper "Tuning Large Multimodal Models for Videos using Reinforcement Learning from AI feedback"
SNUMPR
SNUMPR
AI & ML interests
CV, Multimodal
Recent Activity
updated
a dataset
about 1 month ago
SNUMPR/VCG-bench
updated
a model
about 1 month ago
SNUMPR/vlm_rlaif_awq_w4_g128
Organizations
None yet
spaces
1
models
12
SNUMPR/vlm_rlaif_awq_w4_g128
Updated
SNUMPR/vlm_policy_init_7b_lora
Updated
SNUMPR/vlm_rm_13b_lora
Updated
SNUMPR/hlsm_alfred
Updated
SNUMPR/vlm_sft_video_llava_13b
Updated
•
3
SNUMPR/vlm_sft_video_llava_7b
Updated
SNUMPR/realfred_film_BERT_pretrained
Updated
SNUMPR/realfred_film_bert
Updated
SNUMPR/realfred_film_BERT_data
Updated
SNUMPR/hlsm_realfred_models
Updated