1 6 53

Eric Xian

ericxian1997

AI & ML interests

None yet

Recent Activity

liked a Space 23 days ago

HuggingFaceTB/smol-training-playbook

liked a Space about 1 month ago

AdithyaSK/rl-environments-guide

liked a dataset 3 months ago

nvidia/Nemotron-Terminal-Corpus

View all activity

Organizations

liked a Space 23 days ago

The Smol Training Playbook

📚

3.21k

The secrets to building world-class LLMs

liked a Space about 1 month ago

The ultimate guide to RL environments: building and scaling them in the LLM era

📝

188

Building and scaling RL environments for LLM training

liked a dataset 3 months ago

nvidia/Nemotron-Terminal-Corpus

Viewer • Updated Feb 27 • 366k • 6.1k • 133

upvoted a paper 4 months ago

Joint Selection for Large-Scale Pre-Training Data via Policy Gradient-based Mask Learning

Paper • 2512.24265 • Published Dec 30, 2025 • 4

upvoted a collection 4 months ago

UltraData

Collection

Ultra Scale, Ultra Quality, Ultra Coverage • 11 items • Updated 22 days ago • 98

updated a dataset 5 months ago

DATA-MASK/FineWeb-Mask

Updated Jan 19 • 32.8k • 6

liked a dataset 5 months ago

DATA-MASK/FineWeb-Mask

Updated Jan 19 • 32.8k • 6

published a dataset 5 months ago

DATA-MASK/FineWeb-Mask

Updated Jan 19 • 32.8k • 6

liked 2 datasets 6 months ago

allenai/signal-and-noise

Viewer • Updated Aug 19, 2025 • 898k • 118 • 5

nvidia/Nemotron-Pretraining-Specialized-v1

Viewer • Updated Dec 22, 2025 • 60.7M • 5.41k • 82

upvoted a paper 7 months ago

Virtual Width Networks

Paper • 2511.11238 • Published Nov 14, 2025 • 39

liked a model 8 months ago

briaai/FIBO

Text-to-Image • Updated Mar 30 • 4.08k • • 316

upvoted a paper 8 months ago

Does your data spark joy? Performance gains from domain upsampling at the end of training

Paper • 2406.03476 • Published Jun 5, 2024 • 4

liked a dataset 9 months ago

yczhuang/Hephaestus-Forge

Viewer • Updated Sep 8, 2025 • 3.81k • 123 • 1

upvoted a collection 9 months ago

DeepSeek-V3.2

Collection

4 items • Updated Dec 1, 2025 • 544

liked a model 9 months ago

agentica-org/DeepScaleR-1.5B-Preview

Text Generation • 2B • Updated Apr 9, 2025 • 7.26k • • 584

liked a dataset 10 months ago

nvidia/Nemotron-CC-v2

Viewer • Updated Dec 23, 2025 • 8.79B • 20.6k • 124

liked a model 10 months ago

ByteDance-Seed/Seed-OSS-36B-Instruct

Text Generation • 36B • Updated Aug 26, 2025 • 36.2k • 502

liked 2 models 11 months ago

Qwen/Qwen2.5-Math-1.5B

Text Generation • 2B • Updated Sep 23, 2024 • 680k • • 109

EssentialAI/eai-distill-0.5b

0.6B • Updated Jun 18, 2025 • 252k • 25

Eric Xian

AI & ML interests

Recent Activity

Organizations

ericxian1997's activity

The Smol Training Playbook

The ultimate guide to RL environments: building and scaling them in the LLM era