Joachim Baumann's picture

Joachim Baumann

joebaumann

·

https://joe-baumann.com/

AI & ML interests

Postdoc @ Stanford

Recent Activity

liked a dataset about 3 hours ago

SALT-NLP/SWE-chat

commentedon a paper about 3 hours ago

SWE-chat: Coding Agent Interactions From Real Users in the Wild

published a dataset about 4 hours ago

SALT-NLP/SWE-chat

View all activity

Organizations

authored a paper about 10 hours ago

SimBench: Benchmarking the Ability of Large Language Models to Simulate Human Behaviors

Paper • 2510.17516 • Published Oct 20, 2025 • 2

authored a paper about 11 hours ago

SWE-chat: Coding Agent Interactions From Real Users in the Wild

Paper • 2604.20779 • Published 6 days ago • 12

authored 2 papers 7 months ago

AILuminate: Introducing v1.0 of the AI Risk and Reliability Benchmark from MLCommons

Paper • 2503.05731 • Published Feb 19, 2025 • 3

Large Language Model Hacking: Quantifying the Hidden Risks of Using LLMs for Text Annotation

Paper • 2509.08825 • Published Sep 10, 2025 • 3