Papers
updated
Fast Chain-of-Thought: A Glance of Future from Parallel Decoding Leads
to Answers Faster
Paper
• 2311.08263
• Published
• 16
Exponentially Faster Language Modelling
Paper
• 2311.10770
• Published
• 119
Text Generation
• Updated
• 2.9k
• 666
Memory Augmented Language Models through Mixture of Word Experts
Paper
• 2311.10768
• Published
• 19
VMC: Video Motion Customization using Temporal Attention Adaption for
Text-to-Video Diffusion Models
Paper
• 2312.00845
• Published
• 39
DiffiT: Diffusion Vision Transformers for Image Generation
Paper
• 2312.02139
• Published
• 15
COSMO: COntrastive Streamlined MultimOdal Model with Interleaved
Pre-Training
Paper
• 2401.00849
• Published
• 17
Deconstructing Denoising Diffusion Models for Self-Supervised Learning
Paper
• 2401.14404
• Published
• 18
SliceGPT: Compress Large Language Models by Deleting Rows and Columns
Paper
• 2401.15024
• Published
• 73
Larimar: Large Language Models with Episodic Memory Control
Paper
• 2403.11901
• Published
• 33
VLOGGER: Multimodal Diffusion for Embodied Avatar Synthesis
Paper
• 2403.08764
• Published
• 36
Vid2Robot: End-to-end Video-conditioned Policy Learning with
Cross-Attention Transformers
Paper
• 2403.12943
• Published
• 15