the best collection of RLXF model including RLHF, RLAIF etc.
lil
Amu
AI & ML interests
None yet
Recent Activity
liked a model 10 days ago
BAAI/OpenSeek-Mid-v1 published a model about 1 year ago
Amu/DeepSeek-R1-Distill-Qwen-1.5B-GRPO liked a Space about 1 year ago
OpenEvals/find-a-leaderboardOrganizations
None yet