15 18 15

Gabriel Mongaras PRO

gmongaras

https://gmongaras.me/

AI & ML interests

None yet

Recent Activity

liked a Space 8 days ago

microsoft/TRELLIS.2

upvoted a paper 20 days ago

InfiniteVL: Synergizing Linear and Sparse Attention for Highly-Efficient, Unlimited-Input Vision-Language Models

upvoted a paper 28 days ago

DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models

View all activity

Organizations

liked a Space 8 days ago

TRELLIS.2

🏢

665

High-fidelity 3D Generation from images

upvoted a paper 20 days ago

InfiniteVL: Synergizing Linear and Sparse Attention for Highly-Efficient, Unlimited-Input Vision-Language Models

Paper • 2512.08829 • Published 22 days ago • 18

upvoted a paper 28 days ago

DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models

Paper • 2512.02556 • Published 30 days ago • 242

upvoted 2 papers about 2 months ago

Lumine: An Open Recipe for Building Generalist Agents in 3D Open Worlds

Paper • 2511.08892 • Published Nov 12, 2025 • 201

Brain-IT: Image Reconstruction from fMRI via Brain-Interaction Transformer

Paper • 2510.25976 • Published Oct 29, 2025 • 14

upvoted an article 2 months ago

Article

Why Did MiniMax M2 End Up as a Full Attention Model?

Oct 30, 2025

•

upvoted 2 papers 3 months ago

Less is More: Recursive Reasoning with Tiny Networks

Paper • 2510.04871 • Published Oct 6, 2025 • 500

Fast-dLLM v2: Efficient Block-Diffusion LLM

Paper • 2509.26328 • Published Sep 30, 2025 • 55

upvoted an article 3 months ago

Article

There is no such thing as a tokenizer-free lunch

Sep 25, 2025

•

updated 2 datasets 4 months ago

gmongaras/CC12M_and_Imagenet21K_Recap

Viewer • Updated Sep 17, 2025 • 22.7M • 6.99k • 7

gmongaras/Imagenet21K_Recaption

Viewer • Updated Sep 17, 2025 • 13.1M • 4.25k • 9

commented a paper 4 months ago

Gated Associative Memory: A Parallel O(N) Architecture for Efficient Sequence Modeling

Paper • 2509.00605 • Published Aug 30, 2025 • 42 •

authored a paper 5 months ago

On the Expressiveness of Softmax Attention: A Recurrent Neural Network Perspective

Paper • 2507.23632 • Published Jul 31, 2025 • 6

commented a paper 5 months ago

On the Expressiveness of Softmax Attention: A Recurrent Neural Network Perspective

Paper • 2507.23632 • Published Jul 31, 2025 • 6 •

liked a model 5 months ago

ACE-Step/ACE-Step-v1-3.5B

Text-to-Audio • Updated May 22, 2025 • 656

upvoted a paper 6 months ago

A Systematic Analysis of Hybrid Linear Attention

Paper • 2507.06457 • Published Jul 8, 2025 • 25

liked a model 6 months ago

kyutai/tts-1.6b-en_fr

Text-to-Speech • Updated Sep 11, 2025 • 102k • 359

upvoted a paper 6 months ago

Fast and Simplex: 2-Simplicial Attention in Triton

Paper • 2507.02754 • Published Jul 3, 2025 • 25

published a dataset 7 months ago

gmongaras/ReLaion-10TB

Updated May 21, 2025 • 1

updated a model 8 months ago

gmongaras/datav3_attempt5_8GPU_SoftFlash_RoPE2d_2AccSteps_13batchsize_stage3

Updated May 14, 2025