Does Synthetic Layered Design Data Benefit Layered Design Decomposition?
Abstract
Synthetic layered image data improves graphic design decomposition by enabling scalable training and better layer distribution control compared to traditional methods.
Recent advances in image generation have made it easy to produce high-quality images. However, these outputs are inherently flattened, entangling foreground elements, background, and text within a fixed canvas. As a result, flexible post-generation editing remains challenging, revealing a clear last-mile gap toward practical usability. Existing approaches either rely on scarce proprietary layered assets or construct partially synthetic data from limited structural priors. However, both strategies face fundamental challenges in scalability. In this work, we investigate whether pure synthetic layered data can improve graphic design decomposition. We make the assumption that, in graphic design, effective decomposition does not require modeling inter-layer dependencies as precisely as in natural-image composition, since design elements are often intentionally arranged as modular and semantically separable components. Concretely, we conduct a data-centric study based on CLD baseline, which is a state-of-the-art layer decomposition framework. Based on the baseline, we construct our own synthetic dataset, SynLayers, generate textual supervision using vision language models, and automate inference inputs with VLM-predicted bounding boxes. Our study reveals three key findings: (1) even training with purely synthetic data can outperform non-scalable alternatives such as the widely used PrismLayersPro dataset, demonstrating its viability as a scalable and effective substitute; (2) performance consistently improves with increased training data scale, while gains begin to saturate at around 50K samples; and (3) synthetic data enables balanced control over layer-count distributions, avoiding the layer-count imbalance commonly observed in real-world datasets. We hope this data-centric study encourages broader adoption of synthetic data as a practical foundation for layered design editing systems.
Community
Pure synthetic layered design dataset can indeed benefit the layer decomposition task.
This is an automated message from the Librarian Bot. I found the following papers similar to this paper.
The following papers were recommended by the Semantic Scholar API
- CreatiParser: Generative Image Parsing of Raster Graphic Designs into Editable Layers (2026)
- simpleposter: a simple baseline for product poster generation (2026)
- Multimodal Large Language Models for Multi-Subject In-Context Image Generation (2026)
- ScribbleEdit: Synthetic Data for Image Editing with Scribbles and Text (2026)
- Towards Design Compositing (2026)
- FontCrafter: High-Fidelity Element-Driven Artistic Font Creation with Visual In-Context Generation (2026)
- RefineAnything: Multimodal Region-Specific Refinement for Perfect Local Details (2026)
Please give a thumbs up to this comment if you found it helpful!
If you want recommendations for any Paper on Hugging Face checkout this Space
You can directly ask Librarian Bot for paper recommendations by tagging it in a comment: @librarian-bot recommend
Models citing this paper 1
Datasets citing this paper 0
No dataset linking this paper
Spaces citing this paper 1
Collections including this paper 0
No Collection including this paper