arxiv:2606.02404
Seungone Kim PRO
seungone
AI & ML interests
Large Language Models, LLM-as-a-Judge, Reward Model Overoptimization, Personalized Alignment
Recent Activity
authored a paper 22 minutes ago
Verus-SpecGym: An Agentic Environment for Evaluating Specification Autoformalization authored a paper 22 minutes ago
K-BrowseComp: A Web Browsing Agent Benchmark Grounded in Korean Contexts upvoted a paper 4 days ago
K-BrowseComp: A Web Browsing Agent Benchmark Grounded in Korean Contexts