Ramanauskiene Edita's picture

Ramanauskiene Edita

EditaZ

·

https://github.com/EditaNEmilis

AI & ML interests

None yet

Recent Activity

upvoted a paper 5 days ago

Youtu-LLM: Unlocking the Native Agentic Potential for Lightweight Large Language Models

upvoted a paper 5 days ago

mHC: Manifold-Constrained Hyper-Connections

upvoted a paper about 1 month ago

DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models

View all activity

Organizations

None yet

upvoted 2 papers 5 days ago

Youtu-LLM: Unlocking the Native Agentic Potential for Lightweight Large Language Models

Paper • 2512.24618 • Published 10 days ago • 125

mHC: Manifold-Constrained Hyper-Connections

Paper • 2512.24880 • Published 10 days ago • 230

upvoted a paper about 1 month ago

DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models

Paper • 2512.02556 • Published Dec 2, 2025 • 247

upvoted an article about 1 month ago

Article

Transformers v5: Simple model definitions powering the AI ecosystem

+2

Dec 1, 2025

•

270

liked a model about 1 month ago

Tongyi-MAI/Z-Image-Turbo

Text-to-Image • Updated 3 days ago • 361k • • 3.68k

reacted to danielhanchen's post with 🔥 about 1 month ago

Post

8515

Qwen3-Next can now be Run locally! (30GB RAM)
Instruct GGUF: unsloth/Qwen3-Next-80B-A3B-Instruct-GGUF

The models come in Thinking and Instruct versions and utilize a new architecture, allowing it to have ~10x faster inference than Qwen32B.
💜 Step-by-step Guide: https://docs.unsloth.ai/models/qwen3-next

Thinking GGUF: unsloth/Qwen3-Next-80B-A3B-Thinking-GGUF

upvoted a paper about 2 months ago

Kandinsky 5.0: A Family of Foundation Models for Image and Video Generation

Paper • 2511.14993 • Published Nov 19, 2025 • 227

upvoted a paper 2 months ago

Concerto: Joint 2D-3D Self-Supervised Learning Emerges Spatial Representations

Paper • 2510.23607 • Published Oct 27, 2025 • 177

upvoted a paper 3 months ago

SINQ: Sinkhorn-Normalized Quantization for Calibration-Free Low-Precision LLM Weights

Paper • 2509.22944 • Published Sep 26, 2025 • 79

upvoted a paper 4 months ago

A.S.E: A Repository-Level Benchmark for Evaluating Security in AI-Generated Code

Paper • 2508.18106 • Published Aug 25, 2025 • 347

commented a paper 4 months ago

Sharing is Caring: Efficient LM Post-Training with Collective RL Experience Sharing

Paper • 2509.08721 • Published Sep 10, 2025 • 661 •

upvoted a paper 4 months ago

Sharing is Caring: Efficient LM Post-Training with Collective RL Experience Sharing

Paper • 2509.08721 • Published Sep 10, 2025 • 661

upvoted an article 4 months ago

Article

Uncensor any LLM with abliteration

Jun 13, 2024

•

760

liked a Space 4 months ago

UGI Leaderboard

Uncensored General Intelligence Leaderboard

liked a Space 5 months ago

Demo Playground

The first journey begins here

upvoted 3 papers 5 months ago

Intern-S1: A Scientific Multimodal Foundation Model

Paper • 2508.15763 • Published Aug 21, 2025 • 259

Qwen-Image Technical Report

Paper • 2508.02324 • Published Aug 4, 2025 • 268

Group Sequence Policy Optimization

Paper • 2507.18071 • Published Jul 24, 2025 • 316

upvoted a paper 6 months ago

ShareGPT-4o-Image: Aligning Multimodal Models with GPT-4o-Level Image Generation

Paper • 2506.18095 • Published Jun 22, 2025 • 66

updated a Space 7 months ago

test-project