Models

21

Full-text search

Active filters: audio-visual

Memories-ai/UGC-VideoCaptioner

Video-Text-to-Text • 6B • Updated Oct 5, 2025 • 29 • 2

bpiyush/sound-of-water-models

Audio Classification • Updated Jan 13, 2025 • 3

bolinlai/CSTS

Updated Mar 18, 2025 • 4

openinterx/UGC-VideoCaptioner

Video-Text-to-Text • 6B • Updated Jul 19, 2025 • 12 • 4

JusperLee/Dolphin

Audio-to-Audio • 7.04M • Updated Apr 13 • 11.2k • 13

matbee/sam-audio-small-onnx

Updated Dec 24, 2025 • 9

matbee/sam-audio-large-onnx

Updated Dec 23, 2025 • 8

square-zero-labs/sam-audio-small-onnx

lopho/ltx2-artist-loras

Updated Apr 2 • 3

dnamodel/tsam-viewer-emotions

Video Classification • Updated Mar 27 • 2

elix3r/LTX-2.3-22b-AV-LoRA-talking-head

Image-to-Video • Updated Mar 24 • 8.78k • 53

oonepieceeyewear/UGC-VideoCaptioner

Video-Text-to-Text • 6B • Updated Apr 15 • 2

ckoutlis/auvire-lavdf

Updated Apr 21 • 6

ckoutlis/auvire-avdeepfake1m

Updated Apr 21 • 5

Vegetabot/AVSQwen-Omni-7B

Image-Text-to-Text • 11B • Updated 29 days ago • 22

Vegetabot/AVSQwen-Omni-3B

Image-Text-to-Text • 6B • Updated 29 days ago • 20

vsro200/models-vsro200

Video-Text-to-Text • Updated 18 days ago

mhussainahmad/averformer-ravdess

Updated 10 days ago • 24

mhussainahmad/averformer-cremad-v4

Updated 2 days ago • 40

mhussainahmad/averformer-ravdess-v4

Updated 2 days ago • 80

mhussainahmad/averformer-meld-v4

Updated 2 days ago • 52