Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
4
5
6
Alexander
djalexj
Follow
samot-samoe's profile picture
21world's profile picture
rudskoy's profile picture
5 followers
·
1 following
agolubev13
alexander-golubev-ml
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
1 day ago
Blockwise Advantage Estimation for Multi-Objective RL with Verifiable Rewards
authored
a paper
1 day ago
Blockwise Advantage Estimation for Multi-Objective RL with Verifiable Rewards
authored
a paper
1 day ago
Guided Search Strategies in Non-Serializable Environments with Applications to Software Engineering Agents
View all activity
Organizations
djalexj
's models
16
Sort: Recently updated
djalexj/gte-large-en-v1.5-rm
Text Classification
•
0.4B
•
Updated
May 9, 2024
•
1
djalexj/gpt-neo-1.3B-rlhf-se-150steps-latest
Text Generation
•
Updated
May 12, 2023
•
1
djalexj/gpt-neo-1.3B-rlhf-se-150steps-lora-latest
Updated
May 12, 2023
djalexj/gpt-neo-1.3B-rlhf-se-250steps-latest
Text Generation
•
Updated
May 12, 2023
•
1
djalexj/gpt-neo-1.3B-rlhf-se-250steps-lora-latest
Updated
May 12, 2023
djalexj/gpt-neo-1.3B-rlhf-se-100steps
Text Generation
•
Updated
May 12, 2023
•
1
djalexj/gpt-neo-1.3B-rlhf-se-100steps-lora
Updated
May 12, 2023
djalexj/bert-base-cased-rm-se-100000steps
Text Classification
•
Updated
May 11, 2023
•
3
djalexj/bert-base-cased-rm-se-100000steps-lora
Updated
May 11, 2023
djalexj/gpt-neo-1.3B-sft-se-4000steps
Text Generation
•
Updated
May 10, 2023
•
3
djalexj/gpt-neo-1.3B-sft-se-4000steps-lora
Updated
May 10, 2023
djalexj/gpt-neo-1.3B-rlhf-se-250steps-lora
Updated
May 9, 2023
djalexj/bert-base-cased-rm-se-50000steps
Text Classification
•
Updated
May 9, 2023
•
1
djalexj/bert-base-cased-rm-se-50000steps-lora
Updated
May 9, 2023
djalexj/gpt-neo-1.3B-sft-se-1500steps
Text Generation
•
Updated
May 8, 2023
•
1
djalexj/gpt-neo-1.3B-sft-se-1500steps-lora
Updated
May 8, 2023