GLiREL Multi-Domain Zero-Shot Relation Extraction

This model is a fine-tuned version of jackboyla/glirel-large-v0 for multi-domain zero-shot relation extraction.

Model Description

GLiREL (Generalist and Lightweight model for Relation Extraction) is a state-of-the-art model for zero-shot relation extraction. This version has been specifically fine-tuned on multi-domain data to improve performance across diverse domains in zero-shot scenarios.

Training Data

The model was trained on a multi-domain dataset with domain-based splits to ensure true zero-shot evaluation:

  • Training Examples: N/A
  • Training Domains: N/A
  • Relation Types: N/A
  • Entity Types: N/A

Key Features

  • Zero-shot relation extraction: Can extract relations for unseen relation types
  • Multi-domain capability: Trained on diverse domains for better generalization
  • Domain-based splitting: Training and evaluation use different domains for true zero-shot evaluation
  • Lightweight: Efficient inference while maintaining high performance

Usage

from glirel import GLiREL

# Load the model
model = GLiREL.from_pretrained("skv03/ner-span-glirel")

# Example usage
text = "John works at OpenAI in San Francisco."
labels = ["works_at", "located_in", "founded_by"]

# Extract relations
relations = model.predict_relations(text, labels)
print(relations)

Training Configuration

  • Base Model: jackboyla/glirel-large-v0
  • Training Steps: 15,000
  • Batch Size: 6
  • Learning Rate (Encoder): 1e-5
  • Learning Rate (Others): 5e-5
  • Max Length: 512
  • Evaluation Strategy: Every 4,000 steps
  • Zero-shot Setup: Domain-based splits (no domain overlap between train/test)

Model Architecture

  • Label Embedding Strategy: both (label + entity token)
  • Loss Function: Binary Cross Entropy
  • Scheduler: Cosine with Warmup
  • Dropout: 0.1
  • Max Types per Batch: 50

Performance

This model is designed for zero-shot relation extraction across multiple domains. Performance metrics will vary depending on the specific domains and relation types in your use case.

Limitations

  • Performance may vary significantly across different domains
  • Best suited for English text
  • Requires entity spans to be provided for relation extraction

Citation

If you use this model, please cite the original GLiREL paper:

@misc{boylan2025glirelgeneralistmodel,
      title={GLiREL -- Generalist Model for Zero-Shot Relation Extraction}, 
      author={Jack Boylan and Chris Hokamp and Demian Gholipour Ghalandari},
      year={2025},
      eprint={2501.03172},
      archivePrefix={arXiv},
      primaryClass={cs.CL},
      url={https://arxiv.org/abs/2501.03172},
}

Model Card Authors

Created by the GLiREL fine-tuning team.

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support