DINOv3 Collection DINOv3: foundation models producing excellent dense features, outperforming SotA w/o fine-tuning - https://arxiv.org/abs/2508.10104 • 13 items • Updated Aug 21, 2025 • 469
GasTwinFormer: A Hybrid Vision Transformer for Livestock Methane Emission Segmentation and Dietary Classification in Optical Gas Imaging Paper • 2508.15057 • Published Aug 20, 2025 • 1
WeedSense: Multi-Task Learning for Weed Segmentation, Height Estimation, and Growth Stage Classification Paper • 2508.14486 • Published Aug 20, 2025
GasTwinFormer: A Hybrid Vision Transformer for Livestock Methane Emission Segmentation and Dietary Classification in Optical Gas Imaging Paper • 2508.15057 • Published Aug 20, 2025 • 1
Gasformer: A Transformer-based Architecture for Segmenting Methane Emissions from Livestock in Optical Gas Imaging Paper • 2404.10841 • Published Apr 16, 2024
Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model Paper • 2401.09417 • Published Jan 17, 2024 • 62
FiT: Flexible Vision Transformer for Diffusion Model Paper • 2402.12376 • Published Feb 19, 2024 • 48
Patch n' Pack: NaViT, a Vision Transformer for any Aspect Ratio and Resolution Paper • 2307.06304 • Published Jul 12, 2023 • 34
Solar Event Tracking with Deep Regression Networks: A Proof of Concept Evaluation Paper • 1911.08350 • Published Nov 19, 2019