I've built and deployed Panorama FLUX, a Gradio app for creating ultra-wide panoramic images from three different text prompts using the FLUX.1-schnell model.
It uses a custom "Mixture of Diffusers" pipeline to generate and seamlessly blend each section of the image.
Key Features:
- Multi-Prompt Input: Control the left, center, and right of the scene with unique prompts.
- Seamless Blending: Choose between Cosine and Gaussian blending methods to eliminate seams between tiles.
- Optimized for FLUX.1-schnell: Designed for fast, 4-step generation with embedded guidance.
- Multi-Language Support: On-the-fly translation for prompts written in Korean and Chinese.
- Memory Efficient: Supports both custom (mmgp) and standard diffusers offloading for use on consumer GPUs or in Spaces.
This was a fun project that involved deep-diving into the FLUX architecture to get the tiling, guidance, and positional embeddings right.
Try it out!
š Live Demo on Hugging Face Spaces:
elismasilva/flux-1-panorama