Programming
January 15, 2025 · 9 min read
Serving SLMs at Scale with vLLM, A Production Guide
January 13, 2025 · 10 min read
llama.cpp Deep Dive, Quantization, GGUF, and Inference Speed
January 8, 2025 · 9 min read
Running SLMs Locally with Ollama, A Step by Step Tutorial
January 6, 2025 · 10 min read
Small Language Models in January 2025, A Practical Survey
December 20, 2024 · 9 min read
Lessons From a Year of Rust, Postgres, and AI Agents
December 18, 2024 · 9 min read
Predictions for 2025, Platform Engineering and Agentic AI
December 16, 2024 · 8 min read
The 2024 Wrap Up, The Agentic Era for Backend Engineers
December 13, 2024 · 8 min read
Roadmaps That Survive Contact with Reality
December 11, 2024 · 9 min read
Cost Justifying Platform Investments, The CFO Friendly Pitch
December 9, 2024 · 9 min read
Communicating Tradeoffs to Non Engineers Without Dumbing Down
December 6, 2024 · 8 min read
Reading the Room, Navigating Politics in Technical Decisions
December 4, 2024 · 8 min read