Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
MoonTide's picture
4 4 39

MoonTide PRO

MoonTideF
Relic-Yuexi's profile picture lovelytotoro's profile picture
·

AI & ML interests

NLP,CV

Recent Activity

upvoted a paper about 1 month ago
DoPE: Denoising Rotary Position Embedding
commented on a paper about 1 month ago
DoPE: Denoising Rotary Position Embedding
new activity 2 months ago
GSAI-ML/LLaDA-8B-Instruct:Question about the chat template which ignores add_generation_prompt
View all activity

Organizations

None yet

upvoted a paper about 1 month ago

DoPE: Denoising Rotary Position Embedding

Paper • 2511.09146 • Published Nov 12 • 93
upvoted a collection over 1 year ago

🪐 SmolLM

Collection
A series of smol LLMs: 135M, 360M and 1.7B. We release base and Instruct models as well as the training corpus and some WebGPU demos • 12 items • Updated May 5 • 241
upvoted 2 papers almost 2 years ago

Mamba: Linear-Time Sequence Modeling with Selective State Spaces

Paper • 2312.00752 • Published Dec 1, 2023 • 148

MambaByte: Token-free Selective State Space Model

Paper • 2401.13660 • Published Jan 24, 2024 • 60
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs