best-of-Agent-Harnesses

AgentRL

Multitask, multiturn RL for LLM agents; Ray-based scaling, rollout/actor workers—for teams that want to train agents, not just run them.

trainingpython

Stars: 302
Adoption surface: complex
Autonomy: headless
Recovery: resumable
License: ✅ open-source
Category: Multi-agent and orchestration

Repository ↗ Example: Async GRPO trainer ↗

Related in Multi-agent and orchestration