ricky_33's picture

ricky_33

ricky333

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 9 hours ago

Why Multi-Step Tool-Use Reinforcement Learning Collapses and How Supervisory Signals Fix It

upvoted a paper 1 day ago

Look Light, Think Heavy: What Multimodal Chain-of-Thought Reasoning Can and Cannot Do

upvoted a paper 11 days ago

RedAct: Redacting Agent Capability Traces for Procedural Skill Protection

View all activity

Organizations

models 0

None public yet

datasets 0

None public yet