arxiv:2605.18827
Prateek Biswas
biswasprateek
ยท
AI & ML interests
None yet
Recent Activity
upvoted a paper about 23 hours ago
Beyond Static Leaderboards: Predictive Validity for the Evaluation of LLM Agents authored a paper about 1 month ago
Code-Guided Reasoning for Small Language Models: Evaluating Executable MCQA Scaffolds upvoted a paper about 1 month ago
Code-Guided Reasoning for Small Language Models: Evaluating Executable MCQA ScaffoldsOrganizations
None yet