Agent Lightning
Microsoft's training-oriented harness: optimization loops for agent behavior—when you need to improve policies over rollouts, not only score a fixed prompt.
evalstrainingpython
- Stars
- 17.3k
- Adoption surface
- complex
- Autonomy
- headless
- Recovery
- resumable
- License
- ✅ open-source
Repository ↗ Example: APO room-booking example ↗