16 19

Haoran Song

charles-martin4

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 minutes ago

Forecasting Downstream Performance of LLMs With Proxy Metrics

liked a dataset about 10 hours ago

vctvct321/pointo

liked a model 6 days ago

tencent/Hy-MT2-1.8B

View all activity

Organizations

None yet

upvoted a paper 3 minutes ago

Forecasting Downstream Performance of LLMs With Proxy Metrics

Paper • 2605.18607 • Published 10 days ago • 14

liked a dataset about 10 hours ago

vctvct321/pointo

Updated 2 minutes ago • 63.5k • 9

liked a model 6 days ago

tencent/Hy-MT2-1.8B

Translation • 2B • Updated 2 days ago • 7.47k • • 1.07k

upvoted a paper 6 days ago

MINTEval: Evaluating Memory under Multi-Target Interference in Long-Horizon Agent Systems

Paper • 2605.18565 • Published 9 days ago • 4

upvoted 2 papers 7 days ago

Sparse Autoencoders enable Robust and Interpretable Fine-tuning of CLIP models

Paper • 2605.15961 • Published 13 days ago • 9

Multi-Objective and Mixed-Reward Reinforcement Learning via Reward-Decorrelated Policy Optimization

Paper • 2605.13641 • Published 15 days ago • 49

upvoted a paper 8 days ago

Anti-Self-Distillation for Reasoning RL via Pointwise Mutual Information

Paper • 2605.11609 • Published 16 days ago • 191

liked a model 10 days ago

BAAI/bge-small-en-v1.5

Feature Extraction • 33.4M • Updated Feb 22, 2024 • 51.4M • • 472

upvoted a paper 14 days ago

Q-RAG: Long Context Multi-step Retrieval via Value-based Embedder Training

Paper • 2511.07328 • Published 24 days ago • 16

upvoted a paper 17 days ago

EDU-CIRCUIT-HW: Evaluating Multimodal Large Language Models on Real-World University-Level STEM Student Handwritten Solutions

Paper • 2602.00095 • Published 28 days ago • 3

liked a model 21 days ago

albkue/car-parts-yolov8

Updated 21 days ago • 1

liked a dataset 27 days ago

Brainada/GEPA-M

Updated 27 days ago • 51 • 1

upvoted 2 papers about 1 month ago

Significance and Stability Analysis of Gene-Environment Interaction using RGxEStat

Paper • 2604.03337 • Published Apr 3 • 1

LLaDA2.0-Uni: Unifying Multimodal Understanding and Generation with Diffusion Large Language Model

Paper • 2604.20796 • Published Apr 22 • 242

liked a model about 1 month ago

tencent/HY-Embodied-0.5

Image-Text-to-Text • 4B • Updated Apr 14 • 816 • 908

liked a dataset about 2 months ago

cais/hle

Benchmark • Updated Jan 20 • 2.5k • 43.9k • 808

upvoted a paper about 2 months ago

Adam's Law: Textual Frequency Law on Large Language Models

Paper • 2604.02176 • Published Apr 2 • 504

liked a dataset about 2 months ago

chinh02/UIBenchKit

Updated Apr 12 • 4.88k • 2

upvoted a paper about 2 months ago

Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability

Paper • 2604.06628 • Published Apr 8 • 326

liked a dataset about 2 months ago

roneneldan/TinyStories

Viewer • Updated Aug 12, 2024 • 2.14M • 91.4k • 1k

Haoran Song

AI & ML interests

Recent Activity

Organizations

charles-martin4's activity