Instructions to use parlange/deit-gravit-s3 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- timm
How to use parlange/deit-gravit-s3 with timm:
import timm model = timm.create_model("hf_hub:parlange/deit-gravit-s3", pretrained=True) - Notebooks
- Google Colab
- Kaggle
🌌 deit-gravit-s3
🔭 This model is part of GraViT: Transfer Learning with Vision Transformers and MLP-Mixer for Strong Gravitational Lens Discovery
🔗 GitHub Repository: https://github.com/parlange/gravit
🛰️ Model Details
🤖 Model Type: DeiT
🧪 Experiment: S3 - C21-all-blocks-ResNet18-18660
🌌 Dataset: C21
🪐 Fine-tuning Strategy: all-blocks
🎲 Random Seed: 18660
💻 Quick Start
import torch
import timm
# Load the model directly from the Hub
model = timm.create_model(
'hf-hub:parlange/deit-gravit-s3',
pretrained=True
)
model.eval()
# Example inference
dummy_input = torch.randn(1, 3, 224, 224)
with torch.no_grad():
output = model(dummy_input)
predictions = torch.softmax(output, dim=1)
print(f"Lens probability: {predictions[0][1]:.4f}")
⚡️ Training Configuration
Training Dataset: C21 (Cañameras et al. 2021)
Fine-tuning Strategy: all-blocks
| 🔧 Parameter | 📝 Value |
|---|---|
| Batch Size | 192 |
| Learning Rate | AdamW with ReduceLROnPlateau |
| Epochs | 100 |
| Patience | 10 |
| Optimizer | AdamW |
| Scheduler | ReduceLROnPlateau |
| Image Size | 224x224 |
| Fine Tune Mode | all_blocks |
| Stochastic Depth Probability | 0.1 |
📈 Training Curves
🏁 Final Epoch Training Metrics
| Metric | Training | Validation |
|---|---|---|
| 📉 Loss | 0.0077 | 0.0527 |
| 🎯 Accuracy | 0.9971 | 0.9900 |
| 📊 AUC-ROC | 1.0000 | 0.9983 |
| ⚖️ F1 Score | 0.9971 | 0.9900 |
☑️ Evaluation Results
ROC Curves and Confusion Matrices
Performance across all test datasets (a through l) in the Common Test Sample (More et al. 2024):
📋 Performance Summary
Average performance across 12 test datasets from the Common Test Sample (More et al. 2024):
| Metric | Value |
|---|---|
| 🎯 Average Accuracy | 0.8787 |
| 📈 Average AUC-ROC | 0.8777 |
| ⚖️ Average F1-Score | 0.6524 |
📘 Citation
If you use this model in your research, please cite:
@misc{parlange2025gravit,
title={GraViT: Transfer Learning with Vision Transformers and MLP-Mixer for Strong Gravitational Lens Discovery},
author={René Parlange and Juan C. Cuevas-Tello and Octavio Valenzuela and Omar de J. Cabrera-Rosas and Tomás Verdugo and Anupreeta More and Anton T. Jaelani},
year={2025},
eprint={2509.00226},
archivePrefix={arXiv},
primaryClass={cs.CV},
url={https://arxiv.org/abs/2509.00226},
}
Model Card Contact
For questions about this model, please contact the author through: https://github.com/parlange/
- Downloads last month
- 9
Collection including parlange/deit-gravit-s3
Paper for parlange/deit-gravit-s3
Evaluation results
- Average Accuracy on Common Test Sample (More et al. 2024)self-reported0.879
- Average AUC-ROC on Common Test Sample (More et al. 2024)self-reported0.878
- Average F1-Score on Common Test Sample (More et al. 2024)self-reported0.652












