Image-Text-to-Video
Safetensors
bernini_renderer
SsyzeChen commited on
Commit
bcede06
Β·
verified Β·
1 Parent(s): 6976951

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +31 -4
README.md CHANGED
@@ -83,14 +83,41 @@ Optional extras:
83
 
84
  ### Weights
85
 
86
- Bernini-R uses two sets of weights:
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
87
 
88
  1. **Wan2.2 base** β€” [`Wan-AI/Wan2.2-T2V-A14B-Diffusers`](https://huggingface.co/Wan-AI/Wan2.2-T2V-A14B-Diffusers) on Hugging Face. Supplies the
89
  VAE, UMT5 text encoder, tokenizer, and the transformer architecture/base weights.
90
  It is downloaded automatically on first run (configured by `wan22_base` in
91
  `configs/bernini_renderer_wan22/config.json`).
92
  2. **Bernini-R checkpoint** β€” the trained high-noise / low-noise transformer weights
93
- (safetensors) from [Hugging Face](https://huggingface.co/ByteDance/Bernini), passed with
94
  `--high_noise_ckpt` / `--low_noise_ckpt`. Both a local directory and a Hugging
95
  Face repo id are accepted.
96
 
@@ -99,7 +126,7 @@ Download models using huggingface-cli:
99
  ```bash
100
  pip install -U "huggingface_hub"
101
  hf download Wan-AI/Wan2.2-T2V-A14B-Diffusers --local-dir Wan2.2-T2V-A14B-Diffusers
102
- hf download ByteDance/Bernini --local-dir Bernini
103
  ```
104
 
105
  ## πŸš€ Usage
@@ -112,7 +139,7 @@ keeps long prompts out of the command line. Each task has a directory under
112
  [`assets/testcases/`](assets/testcases/) for the format and the bundled
113
  `t2i` / `i2i` / `t2v` / `v2v` / `rv2v` /`r2v` examples.
114
 
115
- ### Prompt enhancer (recommended)
116
 
117
  `--use_pe` enhances the prompt through an OpenAI-compatible endpoint and is
118
  recommended for best generation quality. The `openai` SDK is installed by
 
83
 
84
  ### Weights
85
 
86
+ Bernini-R provides two ways to obtain the renderer weights. The **diffusers
87
+ format is recommended** β€” it is a self-contained diffusers-format directory whose
88
+ `transformer` / `transformer_2` already hold the Bernini-R weights, so you point
89
+ `--config` at it and the weights load directly, with **no** `--high_noise_ckpt` /
90
+ `--low_noise_ckpt` needed.
91
+
92
+ #### Option A β€” diffusers format (recommended)
93
+
94
+ A single ready-to-use diffusers-format model from
95
+ [`ByteDance/Bernini-R-Diffusers`](https://huggingface.co/ByteDance/Bernini-R-Diffusers).
96
+ It bundles the Wan2.2 base components (VAE, UMT5 text encoder, tokenizer) together
97
+ with the Bernini-R transformer weights, so nothing else is downloaded at runtime.
98
+
99
+ ```bash
100
+ pip install -U "huggingface_hub"
101
+ hf download ByteDance/Bernini-R-Diffusers --local-dir Bernini-R-Diffusers
102
+ ```
103
+
104
+ Then pass it via `--config` and omit the checkpoint flags, e.g.:
105
+
106
+ ```bash
107
+ python infer_single_gpu.py --config Bernini-R-Diffusers \
108
+ --case assets/testcases/t2i/t2i.json --num_frames 1
109
+ ```
110
+
111
+ #### Option B β€” separate checkpoints
112
+
113
+ The original layout, where Bernini-R uses two sets of weights loaded separately:
114
 
115
  1. **Wan2.2 base** β€” [`Wan-AI/Wan2.2-T2V-A14B-Diffusers`](https://huggingface.co/Wan-AI/Wan2.2-T2V-A14B-Diffusers) on Hugging Face. Supplies the
116
  VAE, UMT5 text encoder, tokenizer, and the transformer architecture/base weights.
117
  It is downloaded automatically on first run (configured by `wan22_base` in
118
  `configs/bernini_renderer_wan22/config.json`).
119
  2. **Bernini-R checkpoint** β€” the trained high-noise / low-noise transformer weights
120
+ (safetensors) from [ByteDance/Bernini-R](https://huggingface.co/ByteDance/Bernini-R), passed with
121
  `--high_noise_ckpt` / `--low_noise_ckpt`. Both a local directory and a Hugging
122
  Face repo id are accepted.
123
 
 
126
  ```bash
127
  pip install -U "huggingface_hub"
128
  hf download Wan-AI/Wan2.2-T2V-A14B-Diffusers --local-dir Wan2.2-T2V-A14B-Diffusers
129
+ hf download ByteDance/Bernini-R --local-dir Bernini-R
130
  ```
131
 
132
  ## πŸš€ Usage
 
139
  [`assets/testcases/`](assets/testcases/) for the format and the bundled
140
  `t2i` / `i2i` / `t2v` / `v2v` / `rv2v` /`r2v` examples.
141
 
142
+ ### Prompt enhancer (highly recommended)
143
 
144
  `--use_pe` enhances the prompt through an OpenAI-compatible endpoint and is
145
  recommended for best generation quality. The `openai` SDK is installed by