7 15

haiyimei

AI & ML interests

None yet

Recent Activity

upvoted a paper 9 days ago

UniVidX: A Unified Multimodal Framework for Versatile Video Generation via Diffusion Priors

upvoted a collection 13 days ago

SenseNova-U1

liked a model 29 days ago

google/gemma-4-31B-it

View all activity

Organizations

upvoted a paper 9 days ago

UniVidX: A Unified Multimodal Framework for Versatile Video Generation via Diffusion Priors

Paper • 2605.00658 • Published 12 days ago • 81

upvoted a collection 13 days ago

SenseNova-U1

Collection

SenseNova-U1: Unifying Multimodal Understanding and Generation with NEO-Unify Architecture • 7 items • Updated about 2 hours ago • 53

liked 2 models 29 days ago

google/gemma-4-31B-it

Image-Text-to-Text • 33B • Updated 6 days ago • 9.12M • • 2.61k

dealignai/Gemma-4-31B-JANG_4M-CRACK

Image-Text-to-Text • 6B • Updated 17 days ago • 124k • 1.5k

upvoted 2 papers about 2 months ago

Bridging Semantic and Kinematic Conditions with Diffusion-based Discrete Motion Tokenizer

Paper • 2603.19227 • Published Mar 19 • 42

Demystifing Video Reasoning

Paper • 2603.16870 • Published Mar 17 • 371

upvoted a paper 3 months ago

A Very Big Video Reasoning Suite

Paper • 2602.20159 • Published Feb 23 • 523

liked a Space 4 months ago

Qwen3-TTS Demo

🎙

1.92k

Generate speech audio from text with custom or cloned voices

upvoted a paper 6 months ago

The Quest for Generalizable Motion Generation: Data, Model, and Evaluation

Paper • 2510.26794 • Published Oct 30, 2025 • 27

liked a model 8 months ago

openbmb/MiniCPM-V-4_5

Image-Text-to-Text • 9B • Updated Mar 10 • 135k • 1.09k

liked a Space 8 months ago

FastVLM WebGPU

🍎

446

Real-time video captioning powered by FastVLM

liked a model about 1 year ago

sand-ai/MAGI-1

Image-to-Video • Updated Jun 3, 2025 • 609

liked a dataset about 1 year ago

caizhongang/SynBody

Updated Nov 4, 2024 • 361 • 6

authored a paper about 1 year ago

WHAC: World-grounded Humans and Cameras

Paper • 2403.12959 • Published Mar 19, 2024 • 4

upvoted an article about 1 year ago

Article

Open-source DeepResearch – Freeing our search agents

m-ric, albertvillanova, merve, thomwolf, clefourrier

•

Feb 4, 2025

• 1.32k

liked 4 models over 1 year ago

liked a model almost 2 years ago

stabilityai/stable-diffusion-3-medium

Text-to-Image • Updated Aug 12, 2024 • 5.65k • • 4.95k