Coding harness configs and SDKs
Skill packs, slash-command libraries, meta-prompting frameworks, and official SDKs that give you the harness (the agent loop, planning, memory, hooks) without bundling a specific IDE or CLI shell.
| # | Project | Stars | Tier | OSS | Description |
|---|---|---|---|---|---|
| 1 | superpowers memoryide | 235k | complex | ✅ | Performance-oriented harness pack for Claude Code, Codex, OpenCode, Cursor: skills, instincts, memory, security, research-first workflows. Treats harness engineering itself as the performance lever. |
| 2 | everything-claude-code multi-agent | 219k | complex | ✅ | The breakout 2026 harness pack for Claude Code: 28 specialized subagents, 119 reusable skills, 60 slash commands, 34 rules, 20+ automated hooks. Ships a full "AI engineering team" as config. |
| 3 | Anthropic Skills | 153k | mostly simple | ✅ | Anthropic's official Agent Skills repository: SKILL.md-based folders (instructions, scripts, resources) Claude dynamically loads on Claude Code, Claude.ai, and the API. The reference for progressive-disclosure skill packs in 2026. |
| 4 | GStack typescript | 112k | slightly complex | ✅ | Garry Tan's Claude Code skill stack: 23 slash-command modes (CEO/eng/design review, QA, ship, browse, retro, …) that structure one assistant as a virtual engineering team. Daily driver while running YC. |
| 5 | get-shit-done clipython | 64.4k | mostly simple | ✅ | Goal-backward planning and wave-based execution over fresh context windows; avoids context rot by design. Python/JS meta-prompting for Claude Code, OpenCode, Gemini CLI. |
| 6 | SWE-agent memoryevalspython | 19.6k | slightly complex | ✅ | LM-driven harness built for SWE-bench: edit state, command execution, and issue-focused loop—the reference agent stack next to the benchmark itself. |
| 7 | Claude Agent SDK mcpmemorypythontypescript | 7.4k | complex | ✅ | Official Anthropic SDK (Python + [TypeScript](https://github.com/anthropics/claude-agent-sdk-typescript), [demos](https://github.com/anthropics/claude-agent-sdk-demos), [quickstarts](https://github.com/anthropics/claude-quickstarts)): built-in tools, MCP, long-running coding agents with session bridging. |
| 8 | RepoMaster workflowpython | 529 | slightly complex | ❓ | Repo-scoped research harness: builds function-call and module-dependency graphs to explore only what's needed; large relative gains on MLE-bench and GitTaskBench with lower token use. |
| 9 | AutoHarness memorymulti-agentprovider-agnosticpython | 326 | super simple | ✅ | Lightweight governance harness: wraps any LLM client in ~2 lines for automated harness engineering—6–14 step pipeline, YAML constitution, risk-pattern matching, session persistence with cost tracking, multi-agent profiles. |
| 10 | pmstack evals | 2 | super simple | ✅ | Claude Code config for AI product managers: CLAUDE.md plus skills for competitive analysis, PRD-from-signal, metric frameworks, stakeholder briefs, and agent eval design. "GStack for PMs." |