BeamPERL Collection BeamPERL: PE-RLVR-FT for Beam Mechanics Problem-Solving • 6 items • Updated 1 day ago
Agentic Deep Graph Reasoning Yields Self-Organizing Knowledge Networks Paper • 2502.13025 • Published Feb 18, 2025 • 2
PRefLexOR Collection PRefLexOR: Preference-based Recursive Language Modeling for Exploratory Optimization of Reasoning and Agentic Thinking • 13 items • Updated 1 day ago • 3