aiqwe (Jay Lee)

upvoted an article 5 months ago

Article

Introduction to State Space Models (SSM)

lbourdois

•

Jul 19, 2024

• 228

upvoted a paper 5 months ago

mHC: Manifold-Constrained Hyper-Connections

Paper • 2512.24880 • Published Dec 31, 2025 • 328

upvoted a collection 5 months ago

Deepseek Papers

Collection

Deepseek papers collection • 31 items • Updated about 16 hours ago • 350

upvoted an article 5 months ago

Article

MLA: Redefining KV-Cache Through Low-Rank Projections and On-Demand Decompression

NormalUhr

•

Feb 4, 2025

• 23

upvoted 3 articles about 1 year ago

Article

A Dive into Vision-Language Models

adirik, sayakpaul

•

Feb 3, 2023

• 84

Article

Vision Language Models Explained

merve, edbeeching

•

Apr 11, 2024

• 537

Article

Mixture of Experts Explained

+4

osanseviero, lewtun, philschmid, smangrul, ybelkada, pcuenq

•

Dec 11, 2023

• 1.14k

upvoted a paper almost 3 years ago

Llama 2: Open Foundation and Fine-Tuned Chat Models

Paper • 2307.09288 • Published Jul 18, 2023 • 252

Jay Lee

AI & ML interests

Organizations

Introduction to State Space Models (SSM)

mHC: Manifold-Constrained Hyper-Connections

Deepseek Papers

MLA: Redefining KV-Cache Through Low-Rank Projections and On-Demand Decompression

A Dive into Vision-Language Models

Vision Language Models Explained

Mixture of Experts Explained

Llama 2: Open Foundation and Fine-Tuned Chat Models

Jay Lee

AI & ML interests

Organizations

aiqwe's activity

Introduction to State Space Models (SSM)

MLA: Redefining KV-Cache Through Low-Rank Projections and On-Demand Decompression

A Dive into Vision-Language Models

Vision Language Models Explained

Mixture of Experts Explained