NJU-LINK/OmniCaptioner-IF-3B
Image-Text-to-Text • 6B • Updated • 48
None defined yet.
OmniCap-IF: Benchmarking and Improving Instruction Following Abilities for Omni-Video Captioning
CoVEBench: Can Video Editing Models Handle Complex Instructions?