Submitted by akhaliq 141 DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models DeepSeek 3.18k 6
Submitted by akhaliq 28 OpenMoE: An Early Effort on Open Mixture-of-Experts Language Models · 7 authors 1.66k 4
Submitted by akhaliq 23 Rethinking Interpretability in the Era of Large Language Models · 5 authors 175 1
Submitted by akhaliq 20 LiPO: Listwise Preference Optimization through Learning-to-Rank · 12 authors 6
Submitted by akhaliq 19 Direct-a-Video: Customized Video Generation with User-Directed Camera Movement and Object Motion · 8 authors 1
Submitted by akhaliq 19 InteractiveVideo: User-Centric Controllable Video Generation with Synergistic Multimodal Instructions · 6 authors 132 1
Submitted by akhaliq 17 Shortened LLaMA: A Simple Depth Pruning for Large Language Models · 7 authors 1
Submitted by akhaliq 17 Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and Dialogue Abilities · 6 authors 5
Submitted by akhaliq 16 Video-LaVIT: Unified Video-Language Pre-training with Decoupled Visual-Motional Tokenization · 13 authors 603 2
Submitted by akhaliq 13 Rethinking Optimization and Architecture for Tiny Language Models · 10 authors 127 1
Submitted by akhaliq 8 DiffEditor: Boosting Accuracy and Flexibility on Diffusion-based Image Editing · 5 authors 1