Describing short clips
#3
by
fishcakeday
- opened
When I try to ask the model to describe a short clip with very few frames, it always fails to identify any actions or movements, only talking about the overall description. Trying it with and without do_sample makes no difference. Any way I can use this setup to describe 2-5 second clips?