My Always-On AI Agent System: Telegram, Ollama, and an Obsidian Vault on a Mac Studio
I built a 6-agent AI system that runs 24/7 on my Mac Studio. Telegram for input, Ollama for inference, Obsidian for memory. Here’s the full architecture — ho...
This looks risky. This looks like it’s only for engineers. That’s exactly what I thought — and exactly what AI is solving right now.
Read Article →I built a 6-agent AI system that runs 24/7 on my Mac Studio. Telegram for input, Ollama for inference, Obsidian for memory. Here’s the full architecture — ho...
Leaderboard scores don’t tell you which models work for AI agents. I tested 5 local models on my M2 Max for real agent tasks — orchestration, coding, researc...
Theory says hybrid LLM routing saves money. I built a system that actually does it — 6 AI agents, 3 local models, 1 cloud API, running 24/7 on a Mac Studio. ...
Llama 3.3 70B is the most capable open-source model you can run at home — but it demands serious hardware. Here’s exactly what you need, what to expect, and ...
Stop sending everything to GPT-4. Five factors decide whether a task should run locally or hit a cloud API — here’s the framework to make that call in 30 sec...
A 5-person dev team was spending $2,000/month on LLM APIs. After applying these 7 techniques, they cut it to $400 — without losing output quality. Here’s exa...
Most teams route every AI task to GPT-4 or Claude. That’s like hiring a senior engineer to do data entry. Here’s the hybrid architecture that cuts API bills ...