EEG-DINO Small โ€” Self-Distillation EEG Foundation Model

EEG-DINO-Small encoder pretrained with DINO-v2 hierarchical self-distillation (Wang et al., MICCAI 2025).

This is the eegdino-small-pretrained checkpoint for braindecode.models.EEGDINO, curated and re-uploaded as part of the OpenEEG-Bench effort.

Quick start

pip install braindecode[hub]
from braindecode.models import EEGDINO

model = EEGDINO.from_pretrained(
    "braindecode/eegdino-small-pretrained",
    n_outputs=2,            # set to your downstream task
    n_chans=19,
    sfreq=200,
)

from_pretrained reads both the architecture configuration (config.json) and the weights (model.safetensors or pytorch_model.bin) and returns a ready-to-fine-tune nn.Module.

Model details

Architecture braindecode.models.EEGDINO
Expected channels 19
Expected sampling frequency 200 Hz
Library braindecode โ‰ฅ 1.5
Loaded via huggingface_hub.PyTorchModelHubMixin (free with braindecode[hub])

For the full architecture description, parameter table, and references, see the rendered docstring at https://braindecode.org/stable/generated/braindecode.models.EEGDINO.html or in the interactive Model Explorer Space.

Training data

Temple University Hospital EEG Corpus (TUEG), 19 common 10-20 channels resampled to 200 Hz (>9000 hours), following CBraMod's preprocessing. Pretrained by hierarchical self-distillation.

Intended use

EEG feature extraction or fine-tuning for downstream classification (e.g., TUEV, TUAB). The classification head is re-initialized on load; fine-tune or linear-probe before use.

Limitations

  • Channel layout matters. Performance degrades when the input montage differs from the pretraining montage. Use the Interpolated* variant (where available) or resample channels with MNE before fine-tuning.
  • Sampling rate matters. Resample your data to 200 Hz before inference; the positional / patch embeddings assume this rate.
  • Inherited license restrictions. Downstream weights derived from this checkpoint inherit the license of the original training corpus (some braindecode pretraining corpora are CC-BY-NC). Verify the upstream dataset licence before commercial use.

Citation

If you use this checkpoint, please cite both the original architecture paper and braindecode.

@inproceedings{wang2025eegdino,
  title     = {{EEG-DINO}: Learning {EEG} Foundation Models via Hierarchical
               Self-Distillation},
  author    = {Wang, Xujia and Liu, Xuhui and Liu, Xi and Si, Qian and Xu,
               Zhaoliang and Li, Yang and Zhen, Xiantong},
  booktitle = {Medical Image Computing and Computer Assisted Intervention (MICCAI)},
  year      = {2025},
}

@article{aristimunha2025braindecode,
  title   = {Braindecode: a deep learning library for raw electrophysiological data},
  author  = {Aristimunha, Bruno and others},
  journal = {Zenodo},
  year    = {2025},
  doi     = {10.5281/zenodo.17699192},
}

License

BSD-3-Clause for the model code (matching braindecode). The pretraining data may impose additional restrictions โ€” see Limitations.

Downloads last month
24
Safetensors
Model size
4.54M params
Tensor type
F32
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support