| ---
|
| license: cc-by-nc-3.0
|
| pipeline_tag: image-to-image
|
| tags:
|
| - vision
|
| - document-processing
|
| - binarization
|
| - segmentation
|
| ---
|
|
|
| # Tzefa Binarization Model (mit_b5 HighResMAnet)
|
|
|
| Custom-trained document binarization model for the Tzefa OCR pipeline.
|
|
|
| ## Architecture
|
| - **Encoder:** MiT-B5 (Mix Transformer)
|
| - **Decoder:** MAnet with custom High-Resolution Stem + Fusion Head
|
| - **Framework:** segmentation-models-pytorch
|
| - **Input:** RGB image tiles (640x640)
|
| - **Output:** Binary mask (ink=0, paper=255)
|
|
|
| ## Usage
|
| ```python
|
| from huggingface_hub import hf_hub_download
|
| import torch
|
|
|
| # Download weights
|
| ckpt_path = hf_hub_download("WARAJA/b5_model", "b5_model.pth")
|
|
|
| # Load model (see Tzefa Binarization Space for full architecture code)
|
| checkpoint = torch.load(ckpt_path, map_location="cpu")
|
| ```
|
|
|
| ## Related
|
| - [Binarization Demo](https://huggingface.co/spaces/WARAJA/Tzefa-Binarization)
|
| - [Full Tzefa Pipeline](https://huggingface.co/spaces/WARAJA/Tzefa)
|
| - [Binarization Dataset](https://huggingface.co/datasets/WARAJA/Tzefa-Binarization-Dataset)
|
| |