File size: 1,816 Bytes
43bdfe8 | 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 | ---
license: mit
tags:
- vector-quantization
- image-tokenizer
- codebook-regularization
- icml2026
datasets:
- imagenet-1k
---
# DimVQ: Unveiling And Addressing Dimensional Collapse In Vector Quantization Models Via Codebook Regularization
Official pre-trained checkpoints for the **ICML 2026** paper.
## Model Description
DimVQ identifies **dimensional collapse** in vector quantization models and proposes a simple **codebook regularization** to restore suppressed low-variance components. This regularization bridges the spectral gap between discrete codebook spaces and continuous representations.
## Available Checkpoints
| File | Model | Resolution | Codebook Size (K) | Embedding Dim (D) |
|------|-------|-----------|-------------------|-------------------|
| `simvq_K65536/65536.ckpt` | SimVQ + Codebook Reg. | 128x128 | 65,536 | 128 |
| `simvq_K65536/65536.yaml` | Config for above | - | - | - |
| `simvq_K262144/262144.ckpt` | SimVQ + Codebook Reg. | 128x128 | 262,144 | 128 |
| `simvq_K262144/262144.yaml` | Config for above | - | - | - |
## Usage
```python
# Load checkpoint
import torch
checkpoint = torch.load("262144.ckpt", map_location="cpu")
model.load_state_dict(checkpoint["state_dict"])
```
## TODO
- [ ] IBQ checkpoints (K=16384, K=262144, 256x256)
- [ ] Downstream autoregressive generation models (IBQ-B, IBQ-L, IBQ-XXL)
## Citation
```bibtex
@inproceedings{zhang2026dimvq,
title={Unveiling And Addressing Dimensional Collapse In Vector Quantization Models Via Codebook Regularization},
author={Zhang, Fang and Zhu, Yongxin and Liu, Yihao and Fu, Bin and Xu, Linli},
booktitle={International Conference on Machine Learning (ICML)},
year={2026}
}
```
## Links
- [Paper (arXiv)](https://arxiv.org/abs/TODO)
- [Code (GitHub)](https://github.com/ksblk2116/dimvq)
|