23 9

吴晨

dibrimatter14

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 days ago

Skill1: Unified Evolution of Skill-Augmented Agents via Reinforcement Learning

upvoted a paper 9 days ago

BARRED: Synthetic Training of Custom Policy Guardrails via Asymmetric Debate

upvoted a paper 16 days ago

Micro Language Models Enable Instant Responses

View all activity

Organizations

None yet

upvoted a paper 2 days ago

Skill1: Unified Evolution of Skill-Augmented Agents via Reinforcement Learning

Paper • 2605.06130 • Published 3 days ago • 63

upvoted a paper 9 days ago

BARRED: Synthetic Training of Custom Policy Guardrails via Asymmetric Debate

Paper • 2604.25203 • Published 12 days ago • 8

upvoted a paper 16 days ago

Micro Language Models Enable Instant Responses

Paper • 2604.19642 • Published 19 days ago • 3

upvoted a paper 17 days ago

DiPO: Disentangled Perplexity Policy Optimization for Fine-grained Exploration-Exploitation Trade-Off

Paper • 2604.13902 • Published 25 days ago • 62

liked a model 17 days ago

WarriorMama777/OrangeMixs

Text-to-Image • Updated Jan 7, 2024 • 2.08k • 3.9k

liked a model 26 days ago

autogluon/chronos-2

Time Series Forecasting • 0.1B • Updated Nov 24, 2025 • 5.99M • 14

upvoted 3 papers 28 days ago

liked a dataset about 1 month ago

dhruvbansalup/dlgenai-nppe-dataset

Viewer • Updated about 1 month ago • 58.2k • 233 • 1

upvoted 2 papers about 1 month ago

PackForcing: Short Video Training Suffices for Long Video Sampling and Long Context Inference

Paper • 2603.25730 • Published Mar 26 • 53

CARLA-Air: Fly Drones Inside a CARLA World -- A Unified Infrastructure for Air-Ground Embodied Intelligence

Paper • 2603.28032 • Published Mar 30 • 341

liked a dataset about 1 month ago

AquaV/genshin-voices-separated

Updated Jul 6, 2024 • 61.4k • 18

upvoted 2 papers about 1 month ago

FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization

Paper • 2603.19835 • Published Mar 20 • 350

Representation Alignment for Just Image Transformers is not Easier than You Think

Paper • 2603.14366 • Published Mar 15 • 13

upvoted 2 papers about 2 months ago

InCoder-32B: Code Foundation Model for Industrial Scenarios

Paper • 2603.16790 • Published Mar 17 • 311

Bootstrapping Exploration with Group-Level Natural Language Feedback in Reinforcement Learning

Paper • 2603.04597 • Published Mar 4 • 210

upvoted 2 papers 2 months ago

DARE: Aligning LLM Agents with the R Statistical Ecosystem via Distribution-Aware Retrieval

Paper • 2603.04743 • Published Mar 5 • 53

Does Your Reasoning Model Implicitly Know When to Stop Thinking?

Paper • 2602.08354 • Published Feb 9 • 264

liked a model 2 months ago

Nanbeige/Nanbeige4.1-3B

Text Generation • 4B • Updated Mar 25 • 232k • • 1.1k

吴晨

AI & ML interests

Recent Activity

Organizations

dibrimatter14's activity