Hudson's picture

1 3 1

Hudson

Hudx111

AI & ML interests

None yet

Recent Activity

upvoted a paper about 23 hours ago

Terminal-Bench: Benchmarking Agents on Hard, Realistic Tasks in Command Line Interfaces

upvoted a paper 2 days ago

SkillsBench: Benchmarking How Well Agent Skills Work Across Diverse Tasks

new activity 3 days ago

harborframework/parity-experiments:add dabstep parity

View all activity

Organizations

None yet

upvoted a paper about 23 hours ago

Terminal-Bench: Benchmarking Agents on Hard, Realistic Tasks in Command Line Interfaces

Paper • 2601.11868 • Published Jan 17 • 32

upvoted a paper 2 days ago

SkillsBench: Benchmarking How Well Agent Skills Work Across Diverse Tasks

Paper • 2602.12670 • Published 8 days ago • 46

New activity in harborframework/parity-experiments 3 days ago

add dabstep parity

#84 opened 7 days ago by

updated a dataset 4 days ago

Hudx111/dabstep-parity-results

Viewer • Updated 4 days ago • 520 • 10

published a dataset 7 days ago

Hudx111/dabstep-parity-results

Viewer • Updated 4 days ago • 520 • 10

upvoted a collection 3 months ago

DeepSeek-R1

10 items • Updated Nov 27, 2025 • 834

liked a model over 1 year ago

meta-llama/Meta-Llama-3-8B-Instruct

Text Generation • 8B • Updated Jun 18, 2025 • 1.39M • • 4.38k

updated a model over 1 year ago

Hudx111/bloomz-560-m-peft-method

Updated May 21, 2024