-
SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training
Paper • 2501.17161 • Published • 125 -
LoRA: Low-Rank Adaptation of Large Language Models
Paper • 2106.09685 • Published • 61 -
Training Compute-Optimal Large Language Models
Paper • 2203.15556 • Published • 11 -
Tree of Thoughts: Deliberate Problem Solving with Large Language Models
Paper • 2305.10601 • Published • 15
B
paperboyw11
AI & ML interests
None yet
Recent Activity
updated a collection about 2 months ago
Papers updated a collection about 2 months ago
Papers updated a collection 2 months ago
PapersOrganizations
None yet