In a Training Loop 🔄

3 33 28

Zinan Tang

Word2Li

https://zinantang.works/

AI & ML interests

NLP、LLM、Data4LLM、LLM4Data

Recent Activity

upvoted a paper 3 days ago

Distilling Long-CoT Reasoning through Collaborative Step-wise Multi-Teacher Decoding

upvoted a paper 3 days ago

AstraFlow: Dataflow-Oriented Reinforcement Learning for Agentic LLMs

upvoted a paper 3 days ago

The MiniMax-M2 Series: Mini Activations Unleashing Max Real-World Intelligence

View all activity

Organizations

None yet

upvoted 4 papers 3 days ago

Distilling Long-CoT Reasoning through Collaborative Step-wise Multi-Teacher Decoding

Paper • 2605.02290 • Published May 4 • 42

AstraFlow: Dataflow-Oriented Reinforcement Learning for Agentic LLMs

Paper • 2605.15565 • Published May 15 • 17

The MiniMax-M2 Series: Mini Activations Unleashing Max Real-World Intelligence

Paper • 2605.26494 • Published 26 days ago • 41

Nemotron 3 Ultra: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning

Paper • 2606.15007 • Published 9 days ago • 15

upvoted 2 papers 4 days ago

VibeThinker-3B: Exploring the Frontier of Verifiable Reasoning in Small Language Models

Paper • 2606.16140 • Published 6 days ago • 100

Ling and Ring 2.6 Technical Report: Efficient and Instant Agentic Intelligence at Trillion-Parameter Scale

Paper • 2606.15079 • Published 8 days ago • 75

commented a paper 5 days ago

Domain-Specific Data Synthesis for LLMs via Minimal Sufficient Representation Learning

Paper • 2605.30039 • Published 23 days ago • 20 •

upvoted 7 papers 5 days ago

Domain-Specific Data Synthesis for LLMs via Minimal Sufficient Representation Learning

Paper • 2605.30039 • Published 23 days ago • 20

MaxProof: Scaling Mathematical Proof with Generative-Verifier RL and Population-Level Test-Time Scaling

Paper • 2606.13473 • Published 10 days ago • 89

liked 2 models about 2 months ago

inclusionAI/Ling-2.6-1T

Text Generation • 1T • Updated 4 days ago • 576 • • 472

inclusionAI/Ling-2.6-flash

Text Generation • 107B • Updated 4 days ago • 10.6k • 496

upvoted a paper 2 months ago

Tracing the Roots: A Multi-Agent Framework for Uncovering Data Lineage in Post-Training LLMs

Paper • 2604.10480 • Published Apr 12 • 20

upvoted a collection 2 months ago

Nemotron-Post-Training-v3

Collection

Collection of datasets used in the post-training phase of Nemotron Nano, Super, and Ultra v3. • 50 items • Updated 9 days ago • 158

liked a dataset 2 months ago

nvidia/Nemotron-Pretraining-Specialized-v1.1

Viewer • Updated Mar 11 • 19.8M • 2.55k • 44

upvoted a paper 4 months ago

Code2Math: Can Your Code Agent Effectively Evolve Math Problems Through Exploration?

Paper • 2603.03202 • Published Mar 3 • 18

Zinan Tang

AI & ML interests

Recent Activity

Organizations

Word2Li's activity