OpenClaw vs Hermes: the always-on personal-agent debate

The loudest harness argument of 2026. Both are open-source (MIT), self-hosted, always-on personal agents you talk to from chat apps — and they're two of the most-starred projects on this entire list. The flame war treats them as rivals; the architecture says they're opposite design philosophies that happen to share a category.

OpenClaw Hermes
⭐ Stars 380k 199k
Steward OpenClaw Foundation (community) Nous Research
License MIT MIT
Core language TypeScript Python
The bet Presence — one event loop where messages, heartbeats, crons, and webhooks are a single input queue feeding one conversation; it accumulates unbounded memory of you and feels like a person Discipline — separated execution domains: sandboxed cron, a deliberately bounded (~3k-char) user model, script-gated wake-ups; it feels like a harness, by design
Proactivity Native HEARTBEAT — background events can inject into the main conversation No heartbeat; cron runs are isolated from the main conversation, but a wakeAgent script gate decides whether a check is worth waking the LLM at all
Ecosystem 13,700+ community skills (ClawHub), the most channels and integrations, native multi-agent Lean: skills are mostly self-generated by the learning loop; checkpoint/rollback built in
Security posture Relaxed by default — "free reign," users sandbox it themselves Restrictive by default — permission prompts, confined script locations, per-domain isolation (users report it "works on the third try")
Config surface Fully drivable from chat Several commands are CLI/TUI-only — hard to administer from Telegram alone
Release tempo 82 releases and counting; fast-moving, frequently breaking 6 releases at last count; conservative
Tokens per request ~48k avg (one user's OpenRouter telemetry) ~62k avg, same source — heavier per request, lighter when idle
Autonomy (list axis) headless headless
Recovery (list axis) resumable resumable (checkpoint/rollback is a marquee feature)

Stars as captured for the main list (see README for the capture date).

What the field reports actually say

Primary-source notes from the debate threads (r/openclaw, r/hermesagent, r/AskClaw, r/AI_Agents — links inline), claims attributed as claims:

The billing earthquake (April → June 2026)

What the ops threads add — the cost problem is usage shape, not harness choice:

Our read

Forget the flame war's framing of better/worse. OpenClaw optimizes for presence; Hermes optimizes for discipline — almost every observed difference (heartbeat vs sandboxed cron, unbounded vs bounded memory, chat-everything vs CLI-first admin, relaxed vs restrictive security) falls out of that one fork in the design tree. Pick by which failure you'd rather live with: an exuberant agent you have to contain and patch weekly, or a careful one you have to coax and can't fully drive from your phone. If you genuinely can't choose, the orchestrator/executor + mutual-repair pattern is real, ACP makes it practical, and ~+30% cost is a fair price for redundancy. And whatever you pick, your bill is determined by questions 3–5 of How to pick a harness, not by this page.


Part of best-of-Agent-Harnesses. New to this decision? Start with How to pick a harness. Spot an error or a stale claim? Open an issue.