Guides

Hands-on walkthroughs that go deeper than a single tool. Each one answers a real question a working developer has, with worked numbers and concrete steps, and links to the tools here that do the work.

LLM How much VRAM do you need to run Llama 3 or Gemma locally? The real math behind local LLM memory: weights, KV cache, and overhead, with worked numbers for Llama 3 8B and Gemma 2 9B. June 22, 2026
LLM Self-Hosting a Local LLM vs Paying for an API: Where's the Break-Even? When does self-hosting an LLM beat a pay-per-token API? The real break-even math on GPU cost, throughput, and volume, with a calculator that does it for you. June 22, 2026