13 15

Dong Zihan

yangmuyu6

AI & ML interests

None yet

Recent Activity

liked a model 2 days ago

bandtor/Qwen3.6-35B-A3B-TQ4_1S-GGUF

upvoted a paper 3 days ago

Gamma-World: Generative Multi-Agent World Modeling Beyond Two Players

liked a model 9 days ago

tencent/Hy-MT2-1.8B

View all activity

Organizations

None yet

liked a model 2 days ago

bandtor/Qwen3.6-35B-A3B-TQ4_1S-GGUF

36B • Updated 1 day ago • 116 • 1

upvoted a paper 3 days ago

Gamma-World: Generative Multi-Agent World Modeling Beyond Two Players

Paper • 2605.28816 • Published 6 days ago • 410

liked a model 9 days ago

tencent/Hy-MT2-1.8B

Translation • 2B • Updated 7 days ago • 18.1k • • 1.1k

upvoted a paper 10 days ago

SpaceDG: Benchmarking Spatial Intelligence under Visual Degradation

Paper • 2605.22536 • Published 12 days ago • 28

liked a model 10 days ago

tencent/Hy-MT2-30B-A3B

Translation • 30B • Updated 7 days ago • 4.46k • 444

upvoted a paper 11 days ago

Conditional Equivalence of DPO and RLHF: Implicit Assumption, Failure Modes, and Provable Alignment

Paper • 2605.20834 • Published 13 days ago • 5

liked a dataset 12 days ago

eye1patch/opensearch-vl-image-cache

Viewer • Updated 10 days ago • 7.49k • 5.1k • 2

upvoted a paper 12 days ago

SQuTR: A Robustness Benchmark for Spoken Query to Text Retrieval under Acoustic Noise

Paper • 2602.12783 • Published Feb 13 • 246

liked a model 15 days ago

facebook/opt-125m

Text Generation • Updated Sep 15, 2023 • 10.6M • 257

liked a model 19 days ago

Qwen/Qwen3-0.6B

Text Generation • 0.8B • Updated Jul 26, 2025 • 20.6M • • 1.28k

upvoted a paper 22 days ago

APEX: Large-scale Multi-task Aesthetic-Informed Popularity Prediction for AI-Generated Music

Paper • 2605.03395 • Published 28 days ago • 5

liked a model 26 days ago

Bingsu/adetailer

Updated Nov 21, 2024 • 14.6M • 710

liked a dataset about 1 month ago

Demoren/nes-surrogate-architectures

Updated 27 days ago • 8.17k • 1

upvoted 2 papers about 1 month ago

KWBench: Measuring Unprompted Problem Recognition in Knowledge Work

Paper • 2604.15760 • Published Apr 17 • 2

LLaDA2.0-Uni: Unifying Multimodal Understanding and Generation with Diffusion Large Language Model

Paper • 2604.20796 • Published Apr 22 • 242

liked a model about 1 month ago

GaryYang123/Meme-Qwen-7B-Instruct

Updated Apr 28 • 25 • 62

liked a model about 2 months ago

tencent/HY-Embodied-0.5

Image-Text-to-Text • 4B • Updated Apr 14 • 865 • 908

upvoted 2 papers about 2 months ago

Do Audio-Visual Large Language Models Really See and Hear?

Paper • 2604.02605 • Published Apr 3 • 7

Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability

Paper • 2604.06628 • Published Apr 8 • 326

liked a model about 2 months ago

FacebookAI/xlm-roberta-base

Fill-Mask • 0.3B • Updated Feb 19, 2024 • 22.6M • • 838

Dong Zihan

AI & ML interests

Recent Activity

Organizations

yangmuyu6's activity