GLiREL Multi-Domain Zero-Shot Relation Extraction

This model is a fine-tuned version of jackboyla/glirel-large-v0 for multi-domain zero-shot relation extraction.

Model Description

GLiREL (Generalist and Lightweight model for Relation Extraction) is a state-of-the-art model for zero-shot relation extraction. This version has been specifically fine-tuned on multi-domain data to improve performance across diverse domains in zero-shot scenarios.

Training Data

The model was trained on a multi-domain dataset with domain-based splits to ensure true zero-shot evaluation:

Training Examples: N/A
Training Domains: N/A
Relation Types: N/A
Entity Types: N/A

Key Features

Zero-shot relation extraction: Can extract relations for unseen relation types
Multi-domain capability: Trained on diverse domains for better generalization
Domain-based splitting: Training and evaluation use different domains for true zero-shot evaluation
Lightweight: Efficient inference while maintaining high performance

Usage

from glirel import GLiREL

# Load the model
model = GLiREL.from_pretrained("skv03/ner-span-glirel")

# Example usage
text = "John works at OpenAI in San Francisco."
labels = ["works_at", "located_in", "founded_by"]

# Extract relations
relations = model.predict_relations(text, labels)
print(relations)

Training Configuration

Base Model: jackboyla/glirel-large-v0
Training Steps: 15,000
Batch Size: 6
Learning Rate (Encoder): 1e-5
Learning Rate (Others): 5e-5
Max Length: 512
Evaluation Strategy: Every 4,000 steps
Zero-shot Setup: Domain-based splits (no domain overlap between train/test)

Model Architecture

Label Embedding Strategy: both (label + entity token)
Loss Function: Binary Cross Entropy
Scheduler: Cosine with Warmup
Dropout: 0.1
Max Types per Batch: 50

Performance

This model is designed for zero-shot relation extraction across multiple domains. Performance metrics will vary depending on the specific domains and relation types in your use case.

Limitations

Performance may vary significantly across different domains
Best suited for English text
Requires entity spans to be provided for relation extraction

Citation

If you use this model, please cite the original GLiREL paper:

@misc{boylan2025glirelgeneralistmodel,
      title={GLiREL -- Generalist Model for Zero-Shot Relation Extraction}, 
      author={Jack Boylan and Chris Hokamp and Demian Gholipour Ghalandari},
      year={2025},
      eprint={2501.03172},
      archivePrefix={arXiv},
      primaryClass={cs.CL},
      url={https://arxiv.org/abs/2501.03172},
}

Model Card Authors

Created by the GLiREL fine-tuning team.

Downloads last month: -; Downloads are not tracked for this model. How to track