TRAIL

Trace reasoning and agentic issue localization; 148 long-context traces, 841 errors, 20+ error types; Hugging Face dataset.

Stars
19
Adoption surface
mostly simple
Autonomy
n/a
Recovery
n/a
License
✅ open-source
Category
Evaluation and benchmarking harnesses

Repository ↗ Example: TRAIL dataset card ↗

Related in Evaluation and benchmarking harnesses