2 82 17

Zikai Zhou

Klayand

https://klayand.github.io/

Klayand

AI & ML interests

Knowledge Distillation, Generated Models

Recent Activity

upvoted a paper about 2 hours ago

Wan-Streamer v0.1: End-to-end Real-time Interactive Foundation Models

upvoted a paper 9 days ago

JoyAI-VL-Interaction: Real-Time Vision-Language Interaction Intelligence

upvoted a paper 9 days ago

UniDDT: Unifying Multimodal Understanding and Generation with Decoupled Diffusion Transformer

View all activity

Organizations

None yet

upvoted a paper about 2 hours ago

Wan-Streamer v0.1: End-to-end Real-time Interactive Foundation Models

Paper • 2606.25041 • Published 3 days ago • 35

upvoted 3 papers 9 days ago

JoyAI-VL-Interaction: Real-Time Vision-Language Interaction Intelligence

Paper • 2606.14777 • Published 16 days ago • 201

UniDDT: Unifying Multimodal Understanding and Generation with Decoupled Diffusion Transformer

Paper • 2606.16255 • Published 11 days ago • 14

Qwen-RobotWorld Technical Report: Unifying Embodied World Modeling through Language-Conditioned Video Generation

Paper • 2606.17030 • Published 11 days ago • 30

upvoted a paper 14 days ago

Beyond Scalar Rewards by Internalizing Reasoning into Score Distributions

Paper • 2606.09076 • Published 18 days ago • 61

upvoted 2 papers 15 days ago

Flow-DPPO: Divergence Proximal Policy Optimization for Flow Matching Models

Paper • 2606.11025 • Published 17 days ago • 41

Kwai Keye-VL-2.0 Technical Report

Paper • 2606.10651 • Published 17 days ago • 189

upvoted 2 papers 21 days ago

Cosmos 3: Omnimodal World Models for Physical AI

Paper • 2606.02800 • Published 25 days ago • 135

Qwen-Image-Flash: Beyond Objective Design

Paper • 2606.03746 • Published 24 days ago • 36

upvoted 2 papers 24 days ago

SANA-Streaming: Real-time Streaming Video Editing with Hybrid Diffusion Transformer

Paper • 2605.30409 • Published 29 days ago • 41

Representation Forcing for Bottleneck-Free Unified Multimodal Models

Paper • 2605.31604 • Published 28 days ago • 61

upvoted a paper 27 days ago

Qwen-VLA: Unifying Vision-Language-Action Modeling across Tasks, Environments, and Robot Embodiments

Paper • 2605.30280 • Published 29 days ago • 146

upvoted 6 papers about 1 month ago

SANA-WM: Efficient Minute-Scale World Modeling with Hybrid Linear Diffusion Transformer

Paper • 2605.15178 • Published May 14 • 91

Qwen-Image-VAE-2.0 Technical Report

Paper • 2605.13565 • Published May 13 • 62

Qwen-Image-2.0 Technical Report

Paper • 2605.10730 • Published May 11 • 114

upvoted 2 papers about 2 months ago

Continuous-Time Distribution Matching for Few-Step Diffusion Distillation

Paper • 2605.06376 • Published May 7 • 27

D-OPSD: On-Policy Self-Distillation for Continuously Tuning Step-Distilled Diffusion Models

Paper • 2605.05204 • Published May 6 • 28

Zikai Zhou

AI & ML interests

Recent Activity

Organizations

Klayand's activity