Text Generation
PEFT
Safetensors

OPD-Evolver

This repository provides PEFT/LoRA adapters for OPD-Evolver, a self-evolving agent framework introduced in the paper OPD-Evolver: Cultivating Holistic Agent Evolver via On-Policy Distillation.

OPD-Evolver is designed to help agents not only store experience, but also learn how to select useful experience, use it for task solving, write reusable knowledge, and maintain an evolving memory repository.

Released Adapters

This repository contains three adapter modules:

  • selector: selects useful experiences from memory.
  • executor: uses retrieved experience to solve the current task.
  • writer: writes reusable knowledge back into memory.

Note that the experience management module is not included in this version. We will update subsequently.

Resources

Citation

@misc{zhang2026opdevolver,
  title={OPD-Evolver: Cultivating Holistic Agent Evolver via On-Policy Distillation},
  author={Guibin Zhang and Xun Xu and Yanwei Yue and Zikun Su and Wangchunshu Zhou and Xiaobin Hu and Shuicheng Yan},
  year={2026},
  eprint={2606.17628},
  archivePrefix={arXiv},
  primaryClass={cs.CL},
  url={https://arxiv.org/abs/2606.17628}
}
Downloads last month
-
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for greeky/OPDEvolver

Finetuned
Qwen/Qwen3.5-9B
Adapter
(365)
this model

Paper for greeky/OPDEvolver