CEIA Reinforcement Learning

university

AI & ML interests

None defined yet.

Recent Activity

luanagbmartins updated a dataset 25 minutes ago

CEIA-RL/energy-eval-filtered_responses_multichoice_Qwen_Qwen3-4B_v3

luanagbmartins published a dataset 32 minutes ago

CEIA-RL/energy-eval-filtered_responses_multichoice_Qwen_Qwen3-4B_v3

luanagbmartins updated a dataset 32 minutes ago

CEIA-RL/energy-eval-filtered_responses_multichoice_CEIA-RL_energy-exp1-dpo-offline_v3

View all activity

models 13

CEIA-RL/energyv2-dpo-offline-GRPO

4B • Updated 1 day ago • 42

CEIA-RL/qwen3-4b-dw-lr-SLERP

Text Generation • 4B • Updated 15 days ago • 67

CEIA-RL/qwen3-4b-dw-lr-GRPO-mix-preference

Updated 15 days ago • 6

CEIA-RL/qwen3-4b-dw-lr-GRPO

Updated 15 days ago • 121

CEIA-RL/energy-exp1-dpo-offline

Text Generation • 4B • Updated 18 days ago • 140

CEIA-RL/energyv2-dpo-offline

Text Generation • 4B • Updated 19 days ago • 308

CEIA-RL/qwen3-4b-dw-lr-dpo-offline-energy-GRPO

Text Generation • 4B • Updated 25 days ago • 229

CEIA-RL/qwen3-4b-dw-lr-dpo-offline-energy

Text Generation • 4B • Updated May 6 • 133

CEIA-RL/Qwen3-4B-Instruct-2507

Text Generation • 4B • Updated May 4 • 5

CEIA-RL/qwen3-4b-dw-lr-dpo

Text Generation • 4B • Updated May 1 • 162

datasets 13

CEIA-RL/energy-eval-filtered_evaluations_v3

Updated 25 minutes ago • 26

CEIA-RL/energy-eval-filtered_responses_multichoice_Qwen_Qwen3-4B_v3

Updated 25 minutes ago

CEIA-RL/energy-eval-filtered_responses_multichoice_CEIA-RL_energy-exp1-dpo-offline_v3

Viewer • Updated 32 minutes ago • 447 • 14

CEIA-RL/energy-eval-filtered_responses_multichoice_CEIA-RL_qwen3-4b-dw-lr-GRPO_v3

Updated 39 minutes ago • 10

CEIA-RL/energy-eval-filtered_responses_multichoice_CEIA-RL_qwen3-4b-dw-lr-dpo-offline-energy_v3

Viewer • Updated about 1 hour ago • 447 • 13

CEIA-RL/energy-eval-filtered_responses_multichoice_CEIA-RL_qwen3-4b-dw-lr-dpo-offline_v3

Viewer • Updated about 1 hour ago • 447 • 13

CEIA-RL/energy-eval-filtered_responses_multichoice_cemig-nlp-releases_enregy-gpt-regulatorio_v3

Viewer • Updated about 1 hour ago • 447 • 15

CEIA-RL/energy-eval-filtered_responses_multichoice_CEIA-RL_qwen3-4b-dw-lr-dpo-offline-energy-GRPO_v3

Updated about 19 hours ago • 9

CEIA-RL/energy-eval-filtered_responses_multichoice_CEIA-RL_energyv2-dpo-offline-GRPO_v3

Updated about 21 hours ago • 6

CEIA-RL/energy-eval-filtered_responses_multichoice_cemig-nlp-releases_enregy-gpt-regulatorio-v2_v3

Updated about 21 hours ago • 7

View 13 datasets