CEIA-RL/energy-gpt-regulatorio-v2-GRPO-step140-Safety Text Generation • 4B • Updated about 7 hours ago
CEIA-RL/energy-eval-filtered_responses_multichoice_CEIA-RL_energyv2-dpo-offline-GRPO_v3 Viewer • Updated 2 days ago • 447 • 60
CEIA-RL/energy-eval-filtered_responses_multichoice_Qwen_Qwen3-4B_v3 Viewer • Updated 3 days ago • 447 • 55
CEIA-RL/energy-eval-filtered_responses_multichoice_cemig-nlp-releases_enregy-gpt-regulatorio-v2_v3 Viewer • Updated 3 days ago • 447 • 54
CEIA-RL/energy-eval-filtered_responses_multichoice_CEIA-RL_energyv2-dpo-offline_v3 Viewer • Updated 3 days ago • 447 • 63
CEIA-RL/energy-eval-filtered_responses_multichoice_CEIA-RL_qwen3-4b-dw-lr-dpo-offline-energy-GRPO_v3 Viewer • Updated 3 days ago • 447 • 51
CEIA-RL/energy-eval-filtered_responses_multichoice_CEIA-RL_energy-exp1-dpo-offline_v3 Viewer • Updated 3 days ago • 447 • 60
CEIA-RL/energy-eval-filtered_responses_multichoice_CEIA-RL_qwen3-4b-dw-lr-GRPO_v3 Viewer • Updated 3 days ago • 447 • 55
CEIA-RL/energy-eval-filtered_responses_multichoice_CEIA-RL_qwen3-4b-dw-lr-dpo-offline-energy_v3 Viewer • Updated 3 days ago • 447 • 59