YOLOv7 Finetuned Figure/Graph Detection Model

This repository provides a YOLOv7 model finetuned for detecting figures and graphs in document images, such as those found in scientific papers. The model and code are open for public use and research.

Model Overview

  • Base Model: YOLOv7
  • Task: Figure/Graph detection in document images
  • Finetuned on: Custom dataset of figures and graphs

Quick Start

1. Clone the repository and install dependencies

# Clone this repository
cd https://huggingface.co/vichetkao/graph_detection_model
python -m venv .venv
.\.venv\Scripts\activate
cd ../..
pip install -r requirements.txt

2. Download the Model Weights

Download the best.pt weights from this Hugging Face model page and place it in run/train/weights/best.pt or specify the path with --weights.

3. Run Detection

You can run detection on your images using the provided script:

python run_detect_testing.py --source "testing" --name testing_graph_bbox --device 0
  • --source: Path to your image folder or file (default: testing)
  • --weights: (Optional) Path to your model weights (best.pt)
  • --device: Set to 0 for GPU or cpu for CPU inference

4. Output

  • Results will be saved in run/detect/testing_graph_bbox
  • YOLO label files will be in run/detect/testing_graph_bbox/labels

Example Detection Results

Below are some example detection results from the model:

Detection Example 92 Detection Example 145 Detection Example 188

Evaluation Results

The following plots show the evaluation metrics and analysis of the model's performance:

Overall Detection Results GT vs Predicted Boxes per Image Distribution of TP, FP, FN per Image Detection Results by Language Group

import subprocess
subprocess.call([
    "python", "run_detect_testing.py",
    "--source", "testing",
    "--weights", "run/train/weights/best.pt",
    "--device", "0"
])

Files

  • run_detect_testing.py: Main script to run detection
  • detect.py: YOLOv7 detection logic
  • requirements.txt: Python dependencies
  • run_detect_testing.bat: Windows batch file for quick testing

Citation

If you use this model, please cite the original YOLOv7 paper and this repository.

License & Credits

This project is released under an open-source license for research and educational use.


Author & Credits

Author: Kao Vichet
Bachelor Student, Cambodia Academy of Digital Technology
AI Full Stack Developer Internship, Techo Startup Center
LinkedIn: https://www.linkedin.com/in/vichet-kao/


Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support