Cyril Sterling's picture

10 8

Cyril Sterling

Cyril666

·

https://cyrilsterling.github.io/

CyrilSterling

AI & ML interests

None yet

Recent Activity

updated a model 2 days ago

Cyril666/whisper-large-v3-encoder

published a model 2 days ago

Cyril666/whisper-large-v3-encoder

upvoted a paper 8 days ago

N3D-VLM: Native 3D Grounding Enables Accurate Spatial Reasoning in Vision-Language Models

View all activity

Organizations

upvoted 2 papers 8 days ago

N3D-VLM: Native 3D Grounding Enables Accurate Spatial Reasoning in Vision-Language Models

Paper • 2512.16561 • Published 8 days ago • 19

RePlan: Reasoning-guided Region Planning for Complex Instruction-based Image Editing

Paper • 2512.16864 • Published 8 days ago • 10

upvoted a paper 10 days ago

ShowTable: Unlocking Creative Table Visualization with Collaborative Reflection and Refinement

Paper • 2512.13303 • Published 12 days ago • 16

upvoted a paper 10 months ago

LongPO: Long Context Self-Evolution of Large Language Models through Short-to-Long Preference Optimization

Paper • 2502.13922 • Published Feb 19 • 27

upvoted a collection 11 months ago

VideoLLaMA3

Frontier Multimodal Foundation Models for Video Understanding • 14 items • Updated Sep 25 • 15

upvoted a paper 11 months ago

VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video Understanding

Paper • 2501.13106 • Published Jan 22 • 89

upvoted 2 papers 12 months ago

2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining

Paper • 2501.00958 • Published Jan 1 • 106

VideoRefer Suite: Advancing Spatial-Temporal Object Understanding with Video LLM

Paper • 2501.00599 • Published Dec 31, 2024 • 46

upvoted a paper about 1 year ago

Breaking the Memory Barrier: Near Infinite Batch Size Scaling for Contrastive Loss

Paper • 2410.17243 • Published Oct 22, 2024 • 92

upvoted a paper over 1 year ago

SeaLLMs 3: Open Foundation and Chat Multilingual Large Language Models for Southeast Asian Languages

Paper • 2407.19672 • Published Jul 29, 2024 • 57