AJ-Bench: Benchmarking Agent-as-a-Judge for Environment-Aware Evaluation Paper • 2604.18240 • Published Apr 20 • 16
Reviving DSP for Advanced Theorem Proving in the Era of Reasoning Models Paper • 2506.11487 • Published Jun 13, 2025 • 3
StepFun-Formalizer: Unlocking the Autoformalization Potential of LLMs through Knowledge-Reasoning Fusion Paper • 2508.04440 • Published Aug 6, 2025 • 9
StepFun-Formalizer: Unlocking the Autoformalization Potential of LLMs through Knowledge-Reasoning Fusion Paper • 2508.04440 • Published Aug 6, 2025 • 9
view article Article Kimina-Prover: Applying Test-time RL Search on Large Formal Reasoning Models AI-MO • Jul 10, 2025 • 56
Reviving DSP for Advanced Theorem Proving in the Era of Reasoning Models Paper • 2506.11487 • Published Jun 13, 2025 • 3