20 12

J14eea2ylo

j14eea2ylo

AI & ML interests

None yet

Recent Activity

upvoted a paper 7 minutes ago

Agent Explorative Policy Optimization for Multimodal Agentic Reasoning

upvoted a paper 2 days ago

GE-Sim 2.0: A Roadmap Towards Comprehensive Closed-loop Video World Simulators for Robotic Manipulation

upvoted a paper 5 days ago

OpenComputer: Verifiable Software Worlds for Computer-Use Agents

View all activity

Organizations

None yet

upvoted a paper 7 minutes ago

Agent Explorative Policy Optimization for Multimodal Agentic Reasoning

Paper • 2605.28774 • Published 6 days ago • 84

upvoted a paper 2 days ago

GE-Sim 2.0: A Roadmap Towards Comprehensive Closed-loop Video World Simulators for Robotic Manipulation

Paper • 2605.27491 • Published 7 days ago • 16

upvoted a paper 5 days ago

OpenComputer: Verifiable Software Worlds for Computer-Use Agents

Paper • 2605.19769 • Published 14 days ago • 81

liked a model 8 days ago

tencent/Hy-MT2-1.8B

Translation • 2B • Updated 7 days ago • 18.9k • • 1.1k

upvoted 2 papers 11 days ago

DelTA: Discriminative Token Credit Assignment for Reinforcement Learning from Verifiable Rewards

Paper • 2605.21467 • Published 13 days ago • 204

Decoupling the Benefits of Subword Tokenization for Language Model Training via Byte-level Simulation

Paper • 2604.27263 • Published 19 days ago • 11

upvoted a paper 12 days ago

Known By Their Actions: Fingerprinting LLM Browser Agents via UI Traces

Paper • 2605.14786 • Published 19 days ago • 2

upvoted a paper 13 days ago

Multi-Objective and Mixed-Reward Reinforcement Learning via Reward-Decorrelated Policy Optimization

Paper • 2605.13641 • Published 20 days ago • 50

upvoted a paper 19 days ago

RemoteZero: Geospatial Reasoning with Zero Human Annotations

Paper • 2605.04451 • Published 27 days ago • 8

upvoted a paper 22 days ago

ComboStoc: Combinatorial Stochasticity for Diffusion Generative Models

Paper • 2405.13729 • Published Apr 29 • 13

liked a dataset 26 days ago

HennyPr/ps2_hf2

Viewer • Updated Apr 5 • 1 • 783k • 16

liked a dataset about 1 month ago

nohurry/Opus-4.6-Reasoning-3000x-filtered

Viewer • Updated Mar 31 • 2.33k • 4.03k • 598

upvoted a paper about 1 month ago

The Cognitive Penalty: Ablating System 1 and System 2 Reasoning in Edge-Native SLMs for Decentralized Consensus

Paper • 2604.16913 • Published Apr 18 • 1

liked a model about 1 month ago

tencent/HY-World-2.0

Image-to-3D • Updated 12 days ago • 3.36k • 663

liked a model about 2 months ago

tencent/HY-Embodied-0.5

Image-Text-to-Text • 4B • Updated Apr 14 • 854 • 908

upvoted 2 papers about 2 months ago

WildDet3D: Scaling Promptable 3D Detection in the Wild

Paper • 2604.08626 • Published Apr 9 • 247

Action Images: End-to-End Policy Learning via Multiview Video Generation

Paper • 2604.06168 • Published Apr 7 • 14

liked a model about 2 months ago

crtal/gemma-4-e2b-finance-alpaca-gguf

5B • Updated Apr 12 • 14 • 1

liked a dataset about 2 months ago

franceskoshahinasilogicleaders/shp-ai-dataset

Viewer • Updated Apr 9 • 15.9k • 49 • 1

upvoted a paper about 2 months ago

Self-Distilled RLVR

Paper • 2604.03128 • Published Apr 3 • 176

J14eea2ylo

AI & ML interests

Recent Activity

Organizations

j14eea2ylo's activity