Kabir Hamzah Muhammad's picture

Kabir Hamzah Muhammad

marshallhamzah

·

https://marshall-mk.github.io/

AI & ML interests

Computer Vision | Medical Imaging | VLMs

Recent Activity

upvoted a paper 20 days ago

Latent Diffusion Model without Variational Autoencoder

upvoted a paper 20 days ago

Transformers without Normalization

upvoted a paper 20 days ago

InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency

View all activity

Organizations

None yet

upvoted 8 papers 20 days ago

Latent Diffusion Model without Variational Autoencoder

Paper • 2510.15301 • Published Oct 17, 2025 • 50

Transformers without Normalization

Paper • 2503.10622 • Published Mar 13, 2025 • 172

InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency

Paper • 2508.18265 • Published Aug 25, 2025 • 217

Intern-S1: A Scientific Multimodal Foundation Model

Paper • 2508.15763 • Published Aug 21, 2025 • 273

Why Language Models Hallucinate

Paper • 2509.04664 • Published Sep 4, 2025 • 199

VLA-Adapter: An Effective Paradigm for Tiny-Scale Vision-Language-Action Model

Paper • 2509.09372 • Published Sep 11, 2025 • 254

FLUX-Reason-6M & PRISM-Bench: A Million-Scale Text-to-Image Reasoning Dataset and Comprehensive Benchmark

Paper • 2509.09680 • Published Sep 11, 2025 • 44

Directly Aligning the Full Diffusion Trajectory with Fine-Grained Human Preference

Paper • 2509.06942 • Published Sep 8, 2025 • 18

upvoted an article 12 months ago

Article

The Annotated Diffusion Model

Jun 7, 2022

•

337