Flow-DPPO: Divergence Proximal Policy Optimization for Flow Matching Models
Paper • 2606.11025 • Published • 40
How to use Eculid/sd3.5-flowdppo with Diffusers:
pip install -U diffusers transformers accelerate
import torch
from diffusers import DiffusionPipeline
# switch to "mps" for apple devices
pipe = DiffusionPipeline.from_pretrained("Eculid/sd3.5-flowdppo", dtype=torch.bfloat16, device_map="cuda")
prompt = "Astronaut in a jungle, cold color palette, muted colors, detailed, 8k"
image = pipe(prompt).images[0]This repository hosts a Flow-DPPO checkpoint trained from Stable Diffusion 3.5 Medium using the UniRL Flow-DPPO recipe.
This checkpoint is provided as a community-contributed artifact for research use. It follows the usage and limitations of the Stable Diffusion 3.5 Medium base model.
Base model
stabilityai/stable-diffusion-3.5-medium