Omni-RGPT: Unifying Image and Video Region-level Understanding via Token Marks
Paper
•
2501.08326
•
Published
•
31
Peacefully Open Source Post-Processing Speech and Language Resources Toward Research Community.