ShareGPTVideo

university

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

ruohongz updated a dataset about 1 month ago

ShareGPTVideo/train_video_and_instruction

ruohongz updated a dataset 3 months ago

ShareGPTVideo/train_raw_video

ruohongz updated a model 3 months ago

ShareGPTVideo/LLaVA-Hound-DPO

View all activity

ShareGPTVideo's activity

ruohongz

updated a dataset about 1 month ago

ShareGPTVideo/train_video_and_instruction

Updated Dec 14, 2024 • 1.14k • 20

ruohongz

updated a dataset 3 months ago

ShareGPTVideo/train_raw_video

Viewer • Updated Oct 31, 2024 • 64.1k • 118 • 1

ruohongz

updated 3 models 3 months ago

ruohongz

authored 4 papers 3 months ago

Improve Vision Language Model Chain-of-thought Reasoning

Paper • 2410.16198 • Published Oct 21, 2024 • 22

SOTOPIA: Interactive Evaluation for Social Intelligence in Language Agents

Paper • 2310.11667 • Published Oct 18, 2023 • 3

A Self-enhancement Approach for Domain-specific Chatbot Training via Knowledge Mining and Digest

Paper • 2311.10614 • Published Nov 17, 2023

Direct Preference Optimization of Video Large Multimodal Models from Language Model Reward

Paper • 2404.01258 • Published Apr 1, 2024 • 11

ZhangYuanhan

authored a paper 4 months ago

Video Instruction Tuning With Synthetic Data

Paper • 2410.02713 • Published Oct 3, 2024 • 39

ZhangYuanhan

authored 10 papers 6 months ago

LLaVA-OneVision: Easy Visual Task Transfer

Paper • 2408.03326 • Published Aug 6, 2024 • 60

FunQA: Towards Surprising Video Comprehension

Paper • 2306.14899 • Published Jun 26, 2023 • 1

MMBench: Is Your Multi-modal Model an All-around Player?

Paper • 2307.06281 • Published Jul 12, 2023 • 5

Neural Prompt Search

Paper • 2206.04673 • Published Jun 9, 2022

VBench: Comprehensive Benchmark Suite for Video Generative Models

Paper • 2311.17982 • Published Nov 29, 2023 • 7

Direct Preference Optimization of Video Large Multimodal Models from Language Model Reward

Paper • 2404.01258 • Published Apr 1, 2024 • 11

Learning without Forgetting for Vision-Language Models

Paper • 2305.19270 • Published May 30, 2023

Bamboo: Building Mega-Scale Vision Dataset Continually with Human-Machine Synergy

Paper • 2203.07845 • Published Mar 15, 2022

LMMs-Eval: Reality Check on the Evaluation of Large Multimodal Models

Paper • 2407.12772 • Published Jul 17, 2024 • 34

LLaVA-NeXT-Interleave: Tackling Multi-image, Video, and 3D in Large Multimodal Models

Paper • 2407.07895 • Published Jul 10, 2024 • 40

AI & ML interests

Recent Activity

Team members 4

ShareGPTVideo's activity