Abstract
ArcDeck is a multi-agent framework that enhances paper-to-slide generation by modeling logical flow through discourse trees and iterative agent refinement, outperforming direct summarization methods.
We introduce ArcDeck, a multi-agent framework that formulates paper-to-slide generation as a structured narrative reconstruction task. Unlike existing methods that directly summarize raw text into slides, ArcDeck explicitly models the source paper's logical flow. It first parses the input to construct a discourse tree and establish a global commitment document, ensuring the high-level intent is preserved. These structural priors then guide an iterative multi-agent refinement process, where specialized agents iteratively critique and revise the presentation outline before rendering the final visual layouts and designs. To evaluate our approach, we also introduce ArcBench, a newly curated benchmark of academic paper-slide pairs. Experimental results demonstrate that explicit discourse modeling, combined with role-specific agent coordination, significantly improves the narrative flow and logical coherence of the generated presentations.
Community
ArcDeck is an end-to-end slide generation framework that converts academic PDF papers into polished .pptx presentation slides. ArcDeck frames slide generation around the paper’s narrative structure instead of simple summarization. By combining narrative-driven outline generation with visually strong slide rendering, ArcDeck produces polished and engaging slide decks.
This is an automated message from the Librarian Bot. I found the following papers similar to this paper.
The following papers were recommended by the Semantic Scholar API
- Story2Proposal: A Scaffold for Structured Scientific Paper Writing (2026)
- Camera Artist: A Multi-Agent Framework for Cinematic Language Storytelling Video Generation (2026)
- coDrawAgents: A Multi-Agent Dialogue Framework for Compositional Image Generation (2026)
- LogiStory: A Logic-Aware Framework for Multi-Image Story Visualization (2026)
- DeepPresenter: Environment-Grounded Reflection for Agentic Presentation Generation (2026)
- Mind-of-Director: Multi-modal Agent-Driven Film Previsualization via Collaborative Decision-Making (2026)
- EvoDiagram: Agentic Editable Diagram Creation via Design Expertise Evolution (2026)
Please give a thumbs up to this comment if you found it helpful!
If you want recommendations for any Paper on Hugging Face checkout this Space
You can directly ask Librarian Bot for paper recommendations by tagging it in a comment: @librarian-bot recommend
Get this paper in your agent:
hf papers read 2604.11969 Don't have the latest CLI?
curl -LsSf https://hf.co/cli/install.sh | bash Models citing this paper 0
No model linking this paper
Datasets citing this paper 1
Spaces citing this paper 0
No Space linking this paper
Collections including this paper 0
No Collection including this paper