15 15

O2trg64v3s

o2trg64v3s

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

OCTOPUS: Optimized KV Cache for Transformers via Octahedral Parametrization Under optimal Squared error quantization

liked a dataset 4 days ago

wegrthj/l36l5h-qi9l-data

upvoted a paper 4 days ago

Video2GUI: Synthesizing Large-Scale Interaction Trajectories for Generalized GUI Agent Pretraining

View all activity

Organizations

None yet

upvoted a paper 3 days ago

OCTOPUS: Optimized KV Cache for Transformers via Octahedral Parametrization Under optimal Squared error quantization

Paper • 2605.21226 • Published 6 days ago • 9

liked a dataset 4 days ago

wegrthj/l36l5h-qi9l-data

Updated less than a minute ago • 21.6k • 3

upvoted a paper 4 days ago

Video2GUI: Synthesizing Large-Scale Interaction Trajectories for Generalized GUI Agent Pretraining

Paper • 2605.14747 • Published 12 days ago • 143

liked a dataset 5 days ago

kilicai/turkish-sft-formal-writing-10k

Viewer • Updated 5 days ago • 10k • 59 • 1

liked a model 8 days ago

GAi92/anima-base-1-loras-nsfw

Text-to-Image • Updated 3 days ago • 3

upvoted a paper 12 days ago

Mean Mode Screaming: Mean--Variance Split Residuals for 1000-Layer Diffusion Transformers

Paper • 2605.06169 • Published 19 days ago • 229

liked a model 15 days ago

LLM-OS-Models/gemma-4-E4B-it-Terminal-SFT-Native-Liquid-1Epoch

Text Generation • 8B • Updated 12 days ago • 4.1k • 3

liked a model 19 days ago

BAAI/bge-small-en-v1.5

Feature Extraction • 33.4M • Updated Feb 22, 2024 • 50.8M • • 471

upvoted a paper 21 days ago

From Context to Skills: Can Language Models Learn from Context Skillfully?

Paper • 2604.27660 • Published 23 days ago • 163

liked 2 models about 1 month ago

meta-llama/Llama-3.3-70B-Instruct

Text Generation • 71B • Updated Dec 21, 2024 • 944k • • 2.78k

black-forest-labs/FLUX.1-schnell

Text-to-Image • Updated Aug 16, 2024 • 581k • • 4.93k

upvoted 2 papers about 1 month ago

WildDet3D: Scaling Promptable 3D Detection in the Wild

Paper • 2604.08626 • Published Apr 9 • 247

The Geometric Alignment Tax: Tokenization vs. Continuous Geometry in Scientific Foundation Models

Paper • 2604.04155 • Published Apr 5 • 12

upvoted a paper about 2 months ago

GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning

Paper • 2604.02721 • Published Apr 3 • 630

liked a model about 2 months ago

qualcomm/Yolo-v5

Object Detection • Updated 6 days ago • 5 • 1

upvoted a paper about 2 months ago

DataFlex: A Unified Framework for Data-Centric Dynamic Training of Large Language Models

Paper • 2603.26164 • Published Mar 27 • 365

liked a model about 2 months ago

openai/whisper-large-v3-turbo

Automatic Speech Recognition • 0.8B • Updated Oct 4, 2024 • 7.91M • • 3.03k

upvoted a paper about 2 months ago

CARLA-Air: Fly Drones Inside a CARLA World -- A Unified Infrastructure for Air-Ground Embodied Intelligence

Paper • 2603.28032 • Published Mar 30 • 342

liked 2 models about 2 months ago

kulibinai/cadreasoner

2B • Updated Apr 1 • 735 • 3

Hebisuke/Qwen2.5-1.5B-Instruct_numbers_0.5B

Updated Apr 1 • 1

O2trg64v3s

AI & ML interests

Recent Activity

Organizations

o2trg64v3s's activity