Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
paperboyw11 's Collections
Papers

Papers

updated Apr 9
Upvote
-

  • SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

    Paper • 2501.17161 • Published Jan 28, 2025 • 125

  • LoRA: Low-Rank Adaptation of Large Language Models

    Paper • 2106.09685 • Published Jun 17, 2021 • 61

  • Training Compute-Optimal Large Language Models

    Paper • 2203.15556 • Published Mar 29, 2022 • 11

  • Tree of Thoughts: Deliberate Problem Solving with Large Language Models

    Paper • 2305.10601 • Published May 17, 2023 • 15

  • DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

    Paper • 2402.03300 • Published Feb 5, 2024 • 145
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs