• All detected tissue tiles are encoded (not a sampled subset) • Features can be downloaded per slide instead of large ZIP archives • QC overlay images are provided for visual inspection • UNI2-h 1536-D tile embeddings stored in H5 format • Organized by TCGA project for easier use in MIL / retrieval pipelines
Example layout:
TCGA-HNSC/
features/*.h5
vis/*__overlay.png
Hope this helps others working on computational pathology and TCGA WSI research.
• All detected tissue tiles are encoded (not a sampled subset) • Features can be downloaded per slide instead of large ZIP archives • QC overlay images are provided for visual inspection • UNI2-h 1536-D tile embeddings stored in H5 format • Organized by TCGA project for easier use in MIL / retrieval pipelines
Example layout:
TCGA-HNSC/
features/*.h5
vis/*__overlay.png
Hope this helps others working on computational pathology and TCGA WSI research.