Upload README.md with huggingface_hub
Browse files
README.md
CHANGED
|
@@ -83,14 +83,41 @@ Optional extras:
|
|
| 83 |
|
| 84 |
### Weights
|
| 85 |
|
| 86 |
-
Bernini-R
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 87 |
|
| 88 |
1. **Wan2.2 base** β [`Wan-AI/Wan2.2-T2V-A14B-Diffusers`](https://huggingface.co/Wan-AI/Wan2.2-T2V-A14B-Diffusers) on Hugging Face. Supplies the
|
| 89 |
VAE, UMT5 text encoder, tokenizer, and the transformer architecture/base weights.
|
| 90 |
It is downloaded automatically on first run (configured by `wan22_base` in
|
| 91 |
`configs/bernini_renderer_wan22/config.json`).
|
| 92 |
2. **Bernini-R checkpoint** β the trained high-noise / low-noise transformer weights
|
| 93 |
-
(safetensors) from [
|
| 94 |
`--high_noise_ckpt` / `--low_noise_ckpt`. Both a local directory and a Hugging
|
| 95 |
Face repo id are accepted.
|
| 96 |
|
|
@@ -99,7 +126,7 @@ Download models using huggingface-cli:
|
|
| 99 |
```bash
|
| 100 |
pip install -U "huggingface_hub"
|
| 101 |
hf download Wan-AI/Wan2.2-T2V-A14B-Diffusers --local-dir Wan2.2-T2V-A14B-Diffusers
|
| 102 |
-
hf download ByteDance/Bernini --local-dir Bernini
|
| 103 |
```
|
| 104 |
|
| 105 |
## π Usage
|
|
@@ -112,7 +139,7 @@ keeps long prompts out of the command line. Each task has a directory under
|
|
| 112 |
[`assets/testcases/`](assets/testcases/) for the format and the bundled
|
| 113 |
`t2i` / `i2i` / `t2v` / `v2v` / `rv2v` /`r2v` examples.
|
| 114 |
|
| 115 |
-
### Prompt enhancer (recommended)
|
| 116 |
|
| 117 |
`--use_pe` enhances the prompt through an OpenAI-compatible endpoint and is
|
| 118 |
recommended for best generation quality. The `openai` SDK is installed by
|
|
|
|
| 83 |
|
| 84 |
### Weights
|
| 85 |
|
| 86 |
+
Bernini-R provides two ways to obtain the renderer weights. The **diffusers
|
| 87 |
+
format is recommended** β it is a self-contained diffusers-format directory whose
|
| 88 |
+
`transformer` / `transformer_2` already hold the Bernini-R weights, so you point
|
| 89 |
+
`--config` at it and the weights load directly, with **no** `--high_noise_ckpt` /
|
| 90 |
+
`--low_noise_ckpt` needed.
|
| 91 |
+
|
| 92 |
+
#### Option A β diffusers format (recommended)
|
| 93 |
+
|
| 94 |
+
A single ready-to-use diffusers-format model from
|
| 95 |
+
[`ByteDance/Bernini-R-Diffusers`](https://huggingface.co/ByteDance/Bernini-R-Diffusers).
|
| 96 |
+
It bundles the Wan2.2 base components (VAE, UMT5 text encoder, tokenizer) together
|
| 97 |
+
with the Bernini-R transformer weights, so nothing else is downloaded at runtime.
|
| 98 |
+
|
| 99 |
+
```bash
|
| 100 |
+
pip install -U "huggingface_hub"
|
| 101 |
+
hf download ByteDance/Bernini-R-Diffusers --local-dir Bernini-R-Diffusers
|
| 102 |
+
```
|
| 103 |
+
|
| 104 |
+
Then pass it via `--config` and omit the checkpoint flags, e.g.:
|
| 105 |
+
|
| 106 |
+
```bash
|
| 107 |
+
python infer_single_gpu.py --config Bernini-R-Diffusers \
|
| 108 |
+
--case assets/testcases/t2i/t2i.json --num_frames 1
|
| 109 |
+
```
|
| 110 |
+
|
| 111 |
+
#### Option B β separate checkpoints
|
| 112 |
+
|
| 113 |
+
The original layout, where Bernini-R uses two sets of weights loaded separately:
|
| 114 |
|
| 115 |
1. **Wan2.2 base** β [`Wan-AI/Wan2.2-T2V-A14B-Diffusers`](https://huggingface.co/Wan-AI/Wan2.2-T2V-A14B-Diffusers) on Hugging Face. Supplies the
|
| 116 |
VAE, UMT5 text encoder, tokenizer, and the transformer architecture/base weights.
|
| 117 |
It is downloaded automatically on first run (configured by `wan22_base` in
|
| 118 |
`configs/bernini_renderer_wan22/config.json`).
|
| 119 |
2. **Bernini-R checkpoint** β the trained high-noise / low-noise transformer weights
|
| 120 |
+
(safetensors) from [ByteDance/Bernini-R](https://huggingface.co/ByteDance/Bernini-R), passed with
|
| 121 |
`--high_noise_ckpt` / `--low_noise_ckpt`. Both a local directory and a Hugging
|
| 122 |
Face repo id are accepted.
|
| 123 |
|
|
|
|
| 126 |
```bash
|
| 127 |
pip install -U "huggingface_hub"
|
| 128 |
hf download Wan-AI/Wan2.2-T2V-A14B-Diffusers --local-dir Wan2.2-T2V-A14B-Diffusers
|
| 129 |
+
hf download ByteDance/Bernini-R --local-dir Bernini-R
|
| 130 |
```
|
| 131 |
|
| 132 |
## π Usage
|
|
|
|
| 139 |
[`assets/testcases/`](assets/testcases/) for the format and the bundled
|
| 140 |
`t2i` / `i2i` / `t2v` / `v2v` / `rv2v` /`r2v` examples.
|
| 141 |
|
| 142 |
+
### Prompt enhancer (highly recommended)
|
| 143 |
|
| 144 |
`--use_pe` enhances the prompt through an OpenAI-compatible endpoint and is
|
| 145 |
recommended for best generation quality. The `openai` SDK is installed by
|