A Shared Memory for Hermes and Claude Code
Hermes ships with a strong built-in memory system, but it lives inside Hermes. If you drive a second agent (in my case, Claude Code), the memory stays behind...
Hermes ships with a strong built-in memory system, but it lives inside Hermes. If you drive a second agent (in my case, Claude Code), the memory stays behind...
I packed my box for Hermes and it held almost nothing. Two weeks of OpenClaw, and the things worth carrying over fit in a single cardboard box. I spent the f...
The moment I stopped letting gemma4:26b write patch_file calls, my skill stopped breaking. The fix wasn’t a bigger model — it was a three-layer responsibilit...
I ran Hermes Agent for a full day for the first time. The five places I tripped aren’t bugs — they’re forks in the road every adopter walks through on day on...
Yesterday I collected real user stories about Hermes Agent. Today I’m walking through the actual setup — one command, a few prompts, and you’re in.
The setup guide I wrote yesterday was based on research. Today I ran the actual installer, connected it to Telegram, and tested whether the memory loop works...
I’ve spent the last three weeks wiring local LLMs into my daily work. Somewhere along the way, four distinct roles emerged — and the gaps between them told m...
Most OpenClaw guides tell you what files to create. None tell you what to actually write in them. After 6 weeks of daily iteration, here’s the configuration ...
I’ve been running OpenClaw daily for months. Hermes Agent keeps coming up. Instead of blindly switching, I went looking for what people who actually tried it...
This looks risky. This looks like it’s only for engineers. That’s exactly what I thought — and exactly what AI is solving right now.
I built a 6-agent AI system that runs 24/7 on my Mac Studio. Telegram for input, Ollama for inference, Obsidian for memory. Here’s the full architecture — ho...
Hermes ships with a strong built-in memory system, but it lives inside Hermes. If you drive a second agent (in my case, Claude Code), the memory stays behind...
I packed my box for Hermes and it held almost nothing. Two weeks of OpenClaw, and the things worth carrying over fit in a single cardboard box. I spent the f...
The moment I stopped letting gemma4:26b write patch_file calls, my skill stopped breaking. The fix wasn’t a bigger model — it was a three-layer responsibilit...
I ran Hermes Agent for a full day for the first time. The five places I tripped aren’t bugs — they’re forks in the road every adopter walks through on day on...
Yesterday I collected real user stories about Hermes Agent. Today I’m walking through the actual setup — one command, a few prompts, and you’re in.
The setup guide I wrote yesterday was based on research. Today I ran the actual installer, connected it to Telegram, and tested whether the memory loop works...
I’ve spent the last three weeks wiring local LLMs into my daily work. Somewhere along the way, four distinct roles emerged — and the gaps between them told m...
I’ve been running OpenClaw daily for months. Hermes Agent keeps coming up. Instead of blindly switching, I went looking for what people who actually tried it...
Stop manually restarting gateways. Wire up launchd WatchPaths once, and every AGENTS.md edit auto-reloads both OpenClaw gateways in 30 seconds.
Llama 3.3 70B is the most capable open-source model you can run at home — but it demands serious hardware. Here’s exactly what you need, what to expect, and ...
What are local LLMs, why would you run one, and how do you get started? A practical guide — primarily for Mac users — from zero to running your first AI mode...
You understand the hybrid LLM concept. Now build it. This is the complete implementation guide — from installing your local models to deploying a team-ready ...
A practical comparison of Ollama and LM Studio for running local LLMs. Features, performance, API compatibility, and which tool fits your workflow.
A step-by-step LM Studio setup guide for Mac and Windows to run local LLMs. No cloud, no API keys, no monthly bills.
Real benchmark data for running local LLMs on Apple Silicon. Token speeds, memory usage, and quality ratings for every Mac configuration from M2 Air to M4 Max.
If you’ve thought about switching the local LLM server on your Mac from Ollama to llama.cpp, there are two things that don’t show up in the obvious benchmark...
I built a 4-branch falsification looking for the iteration ceiling on a local 27B-class agent. The quality cliff didn’t surface. The wall-clock divergence di...
The community framing is simple: ‘Ollama is a ggml fork, llama.cpp is faster.’ I ran the same model blob through both runtimes on a Mac Studio M2 Max with th...
OpenClaw has been brittle since late March. On my local Ollama setup I can now point at the exact line of code, the exact minute it went wrong, and the three...
Most OpenClaw guides tell you what files to create. None tell you what to actually write in them. After 6 weeks of daily iteration, here’s the configuration ...
This looks risky. This looks like it’s only for engineers. That’s exactly what I thought — and exactly what AI is solving right now.
Stop sending everything to GPT-4. Five factors decide whether a task should run locally or hit a cloud API — here’s the framework to make that call in 30 sec...
A 5-person dev team was spending $2,000/month on LLM APIs. After applying these 7 techniques, they cut it to $400 — without losing output quality. Here’s exa...
Most teams route every AI task to GPT-4 or Claude. That’s like hiring a senior engineer to do data entry. Here’s the hybrid architecture that cuts API bills ...
GPT-4 costs $10-30 per million tokens. Llama 3.3 costs $0. But is the free option actually good enough? Here’s a side-by-side comparison across quality, spee...
You understand the hybrid LLM concept. Now build it. This is the complete implementation guide — from installing your local models to deploying a team-ready ...
Llama 3.3 70B is the most capable open-source model you can run at home — but it demands serious hardware. Here’s exactly what you need, what to expect, and ...
GPT-4 costs $10-30 per million tokens. Llama 3.3 costs $0. But is the free option actually good enough? Here’s a side-by-side comparison across quality, spee...
Real benchmark data for running local LLMs on Apple Silicon. Token speeds, memory usage, and quality ratings for every Mac configuration from M2 Air to M4 Max.
I built a 6-agent AI system that runs 24/7 on my Mac Studio. Telegram for input, Ollama for inference, Obsidian for memory. Here’s the full architecture — ho...
Leaderboard scores don’t tell you which models work for AI agents. I tested 5 local models on my M2 Max for real agent tasks — orchestration, coding, researc...
Theory says hybrid LLM routing saves money. I built a system that actually does it — 6 AI agents, 3 local models, 1 cloud API, running 24/7 on a Mac Studio. ...
Stop sending everything to GPT-4. Five factors decide whether a task should run locally or hit a cloud API — here’s the framework to make that call in 30 sec...
What are local LLMs, why would you run one, and how do you get started? A practical guide — primarily for Mac users — from zero to running your first AI mode...
Theory says hybrid LLM routing saves money. I built a system that actually does it — 6 AI agents, 3 local models, 1 cloud API, running 24/7 on a Mac Studio. ...
Most teams route every AI task to GPT-4 or Claude. That’s like hiring a senior engineer to do data entry. Here’s the hybrid architecture that cuts API bills ...
OpenClaw has been brittle since late March. On my local Ollama setup I can now point at the exact line of code, the exact minute it went wrong, and the three...
Stop manually restarting gateways. Wire up launchd WatchPaths once, and every AGENTS.md edit auto-reloads both OpenClaw gateways in 30 seconds.
Perplexity cut Pro Deep Research to 20 queries a month in February 2026, a 900x downgrade. I rebuilt the loop on Hermes Agent, Exa, x_search, one local model...
xAI shipped x_search as part of their Agent Tools API. The price is $0.005 per query. Twenty dollars of credits covers roughly 4,000 searches. Here is what t...
A step-by-step LM Studio setup guide for Mac and Windows to run local LLMs. No cloud, no API keys, no monthly bills.
A practical comparison of Ollama and LM Studio for running local LLMs. Features, performance, API compatibility, and which tool fits your workflow.
A 5-person dev team was spending $2,000/month on LLM APIs. After applying these 7 techniques, they cut it to $400 — without losing output quality. Here’s exa...
Leaderboard scores don’t tell you which models work for AI agents. I tested 5 local models on my M2 Max for real agent tasks — orchestration, coding, researc...
The moment I stopped letting gemma4:26b write patch_file calls, my skill stopped breaking. The fix wasn’t a bigger model — it was a three-layer responsibilit...
OpenClaw has been brittle since late March. On my local Ollama setup I can now point at the exact line of code, the exact minute it went wrong, and the three...
xAI shipped x_search as part of their Agent Tools API. The price is $0.005 per query. Twenty dollars of credits covers roughly 4,000 searches. Here is what t...
Perplexity cut Pro Deep Research to 20 queries a month in February 2026, a 900x downgrade. I rebuilt the loop on Hermes Agent, Exa, x_search, one local model...