3 30 6

Yifei Li

JoeLeelyf

https://joeleelyf.github.io/

JoeLeelyf

AI & ML interests

MLLMs, Deepfake Detection, Computer Vision

Recent Activity

upvoted a paper 15 days ago

WildClawBench: A Benchmark for Real-World, Long-Horizon Agent Evaluation

upvoted a paper 19 days ago

Listwise Policy Optimization: Group-based RLVR as Target-Projection on the LLM Response Simplex

upvoted a paper about 1 month ago

UniGenDet: A Unified Generative-Discriminative Framework for Co-Evolutionary Image Generation and Generated Image Detection

View all activity

Organizations

upvoted a paper 15 days ago

WildClawBench: A Benchmark for Real-World, Long-Horizon Agent Evaluation

Paper • 2605.10912 • Published 19 days ago • 46

upvoted a paper 19 days ago

Listwise Policy Optimization: Group-based RLVR as Target-Projection on the LLM Response Simplex

Paper • 2605.06139 • Published 23 days ago • 69

upvoted a paper about 1 month ago

UniGenDet: A Unified Generative-Discriminative Framework for Co-Evolutionary Image Generation and Generated Image Detection

Paper • 2604.21904 • Published Apr 23 • 4

upvoted a paper 2 months ago

MinerU-Diffusion: Rethinking Document OCR as Inverse Rendering via Diffusion Decoding

Paper • 2603.22458 • Published Mar 23 • 136

upvoted 2 papers 4 months ago

DeepGen 1.0: A Lightweight Unified Multimodal Model for Advancing Image Generation and Editing

Paper • 2602.12205 • Published Feb 12 • 83

Unified Personalized Reward Model for Vision Generation

Paper • 2602.02380 • Published Feb 2 • 20

upvoted a paper 5 months ago

Skyra: AI-Generated Video Detection via Grounded Artifact Reasoning

Paper • 2512.15693 • Published Dec 17, 2025 • 18

upvoted 3 papers 6 months ago

ARM-Thinker: Reinforcing Multimodal Generative Reward Models with Agentic Tool Use and Visual Reasoning

Paper • 2512.05111 • Published Dec 4, 2025 • 50

ViSAudio: End-to-End Video-Driven Binaural Spatial Audio Generation

Paper • 2512.03036 • Published Dec 2, 2025 • 22

Think Visually, Reason Textually: Vision-Language Synergy in ARC

Paper • 2511.15703 • Published Nov 19, 2025 • 9

upvoted 3 papers 7 months ago

Spatial-SSRL: Enhancing Spatial Understanding via Self-Supervised Reinforcement Learning

Paper • 2510.27606 • Published Oct 31, 2025 • 31

STAR-Bench: Probing Deep Spatio-Temporal Reasoning as Audio 4D Intelligence

Paper • 2510.24693 • Published Oct 28, 2025 • 19

UniGenBench++: A Unified Semantic Evaluation Benchmark for Text-to-Image Generation

Paper • 2510.18701 • Published Oct 21, 2025 • 68

upvoted 2 papers 8 months ago

CapRL: Stimulating Dense Image Caption Capabilities via Reinforcement Learning

Paper • 2509.22647 • Published Sep 26, 2025 • 37

SIM-CoT: Supervised Implicit Chain-of-Thought

Paper • 2509.20317 • Published Sep 24, 2025 • 43

upvoted 3 papers 10 months ago

SEAgent: Self-Evolving Computer Use Agent with Autonomous Learning from Experience

Paper • 2508.04700 • Published Aug 6, 2025 • 52

Beyond Fixed: Variable-Length Denoising for Diffusion Large Language Models

Paper • 2508.00819 • Published Aug 1, 2025 • 63

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction

Paper • 2507.15852 • Published Jul 21, 2025 • 38

upvoted a paper 11 months ago

ScaleCap: Inference-Time Scalable Image Captioning via Dual-Modality Debiasing

Paper • 2506.19848 • Published Jun 24, 2025 • 27

upvoted a paper about 1 year ago

MM-IFEngine: Towards Multimodal Instruction Following

Paper • 2504.07957 • Published Apr 10, 2025 • 35

Yifei Li

AI & ML interests

Recent Activity

Organizations

JoeLeelyf's activity