Submitted by akhaliq 98 Design2Code: How Far Are We From Automating Front-End Engineering? · 5 authors 568 2
Submitted by akhaliq 71 Scaling Rectified Flow Transformers for High-Resolution Image Synthesis · 17 authors 4
Submitted by akhaliq 30 OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on · 4 authors 6.52k 2
Submitted by akhaliq 30 MovieLLM: Enhancing Long Video Understanding with AI-Generated Movies · 7 authors 186 6
Submitted by akhaliq 19 DenseMamba: State Space Models with Dense Hidden Connection for Efficient Large Language Models HUAWEI Noah's Ark Lab 89 2
Submitted by akhaliq 16 TripoSR: Fast 3D Object Reconstruction from a Single Image · 10 authors 6.22k 3
Submitted by akhaliq 16 InfiMM-HD: A Leap Forward in High-Resolution Multimodal Understanding · 10 authors 16 1
Submitted by akhaliq 15 ResAdapter: Domain Consistent Resolution Adapter for Diffusion Models · 10 authors 768 1
Submitted by akhaliq 9 Tuning-Free Noise Rectification for High Fidelity Image-to-Video Generation · 7 authors 1
Submitted by akhaliq 9 ViewDiff: 3D-Consistent Image Generation with Text-to-Image Models · 8 authors 379 1
Submitted by akhaliq 6 3DGStream: On-the-Fly Training of 3D Gaussians for Efficient Streaming of Photo-Realistic Free-Viewpoint Videos · 6 authors 465