Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
CEIA Reinforcement Learning
university
Activity Feed
Follow
7
AI & ML interests
None defined yet.
Recent Activity
Fazzioni
Â
updated
a model
about 7 hours ago
CEIA-RL/Qwen3-4B-Instruct-2507-GRPO-GPT-OSS-120B
Fazzioni
Â
published
a model
about 7 hours ago
CEIA-RL/Qwen3-4B-Instruct-2507-GRPO-GPT-OSS-120B
luanagbmartins
Â
updated
a model
about 20 hours ago
CEIA-RL/qwen3-4b-dw-lr-hf-dpo
View all activity
Team members
5
spaces
1
pinned
Sleeping
Agents
LLMasJudgeEval
🥇
models
3
Sort:Â Recently updated
CEIA-RL/qwen3-4b-dw-lr-hf-dpo
Text Generation
•
4B
•
Updated
about 3 hours ago
•
6.05k
CEIA-RL/Qwen3-4B-Instruct-2507-GRPO-GPT-OSS-120B
Updated
about 7 hours ago
CEIA-RL/qwen3-4b-dw-lr-dpo-offline
Text Generation
•
4B
•
Updated
13 days ago
•
600
datasets
12
Sort:Â Recently updated
CEIA-RL/questions-GPT-OSS-120B-RL
Viewer
•
Updated
3 days ago
•
4.3k
•
28
CEIA-RL/questions-GPT-OSS-120B
Viewer
•
Updated
3 days ago
•
21.5k
•
41
CEIA-RL/Synthetic-Questions-Energy
Viewer
•
Updated
5 days ago
•
18.2k
•
28
CEIA-RL/Safety-Questions-Energy
Viewer
•
Updated
5 days ago
•
53.1k
•
55
CEIA-RL/synth_regulacao_eng_qa_v0
Viewer
•
Updated
19 days ago
•
2.32k
•
30
CEIA-RL/QA-Energy
Viewer
•
Updated
19 days ago
•
43
•
38
CEIA-RL/Nemotron-SFT-Safety-pt-BR-Cleaned
Viewer
•
Updated
20 days ago
•
45.1k
•
62
CEIA-RL/hh-rlhf-harmless-base-pt-BR
Viewer
•
Updated
21 days ago
•
44.8k
•
36
CEIA-RL/datasets-concat
Viewer
•
Updated
28 days ago
•
172k
•
19
CEIA-RL/energy_prompts
Viewer
•
Updated
Feb 27
•
1.56M
•
85
View 12 datasets