<?xml version="1.0" encoding="utf-8" standalone="yes"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom">
  <channel>
    <title>Sre on Hi, I&#39;m Muhammad Amal</title>
    <link>https://muhammadamal.my.id/tags/sre/</link>
    <description>Recent content in Sre on Hi, I&#39;m Muhammad Amal</description>
    <generator>Hugo</generator>
    <language>en-us</language>
    <lastBuildDate>Mon, 12 May 2025 09:00:00 +0700</lastBuildDate>
    <atom:link href="https://muhammadamal.my.id/tags/sre/index.xml" rel="self" type="application/rss+xml" />
    <item>
      <title>Building an SRE Copilot for On Call Engineers</title>
      <link>https://muhammadamal.my.id/blog/building-an-sre-copilot-for-on-call-engineers/</link>
      <pubDate>Mon, 12 May 2025 09:00:00 +0700</pubDate>
      <guid>https://muhammadamal.my.id/blog/building-an-sre-copilot-for-on-call-engineers/</guid>
      <description>A senior backend engineer&amp;rsquo;s design for an LLM-powered on-call assistant with tool use, context shaping, and a read-only blast radius.</description>
    </item>
    <item>
      <title>AIOps in May 2025, What Actually Works in Production</title>
      <link>https://muhammadamal.my.id/blog/aiops-in-may-2025-what-actually-works-in-production/</link>
      <pubDate>Mon, 05 May 2025 09:00:00 +0700</pubDate>
      <guid>https://muhammadamal.my.id/blog/aiops-in-may-2025-what-actually-works-in-production/</guid>
      <description>Field notes on AIOps in production, what to adopt, what to defer, and where LLMs earn their keep on the platform team.</description>
    </item>
    <item>
      <title>Synthetic Monitoring and Canary Deploys, A Practical Pairing</title>
      <link>https://muhammadamal.my.id/blog/synthetic-monitoring-and-canary-deploys/</link>
      <pubDate>Wed, 26 Jun 2024 09:00:00 +0700</pubDate>
      <guid>https://muhammadamal.my.id/blog/synthetic-monitoring-and-canary-deploys/</guid>
      <description>Canaries catch regressions. Synthetics catch silent failures. Wire them together and you get progressive delivery that knows when to roll back without a human.</description>
    </item>
    <item>
      <title>Blameless Postmortems That Actually Change Behavior</title>
      <link>https://muhammadamal.my.id/blog/blameless-postmortems-that-actually-change-behavior/</link>
      <pubDate>Mon, 24 Jun 2024 09:00:00 +0700</pubDate>
      <guid>https://muhammadamal.my.id/blog/blameless-postmortems-that-actually-change-behavior/</guid>
      <description>Blameless doesn&amp;rsquo;t mean toothless. A postmortem template, the five questions that matter, and the followup ritual that closes the loop.</description>
    </item>
    <item>
      <title>Service Mesh Resilience, Istio Ambient vs Linkerd in 2024</title>
      <link>https://muhammadamal.my.id/blog/service-mesh-resilience-istio-ambient-vs-linkerd-2024/</link>
      <pubDate>Wed, 19 Jun 2024 09:00:00 +0700</pubDate>
      <guid>https://muhammadamal.my.id/blog/service-mesh-resilience-istio-ambient-vs-linkerd-2024/</guid>
      <description>Istio Ambient just hit GA. Linkerd 2.15 is still the simplicity champion. Here&amp;rsquo;s how they compare for the resilience patterns that actually matter.</description>
    </item>
    <item>
      <title>eBPF Plus OpenTelemetry, The Observability Pairing for 2024</title>
      <link>https://muhammadamal.my.id/blog/ebpf-plus-opentelemetry-observability-2024/</link>
      <pubDate>Mon, 17 Jun 2024 09:00:00 +0700</pubDate>
      <guid>https://muhammadamal.my.id/blog/ebpf-plus-opentelemetry-observability-2024/</guid>
      <description>eBPF gives you kernel-truth signals without instrumenting code. OpenTelemetry gives you a vendor-neutral pipeline. Together they&amp;rsquo;re the cheapest observability you can stand up in 2024.</description>
    </item>
    <item>
      <title>Auto Remediation on Kubernetes, Argo Events and Policy as Code</title>
      <link>https://muhammadamal.my.id/blog/auto-remediation-kubernetes-argo-events-policy/</link>
      <pubDate>Wed, 12 Jun 2024 09:00:00 +0700</pubDate>
      <guid>https://muhammadamal.my.id/blog/auto-remediation-kubernetes-argo-events-policy/</guid>
      <description>Auto-remediation is high-leverage and high-risk. Argo Events plus Kyverno gives you declarative remediation. Here&amp;rsquo;s the pattern and the guardrails it needs.</description>
    </item>
    <item>
      <title>Chaos Engineering on Kubernetes, Litmus and Chaos Mesh in 2024</title>
      <link>https://muhammadamal.my.id/blog/chaos-engineering-kubernetes-litmus-chaos-mesh-2024/</link>
      <pubDate>Mon, 10 Jun 2024 09:00:00 +0700</pubDate>
      <guid>https://muhammadamal.my.id/blog/chaos-engineering-kubernetes-litmus-chaos-mesh-2024/</guid>
      <description>Litmus and Chaos Mesh have both matured. Here&amp;rsquo;s how to pick between them, the experiments worth running first, and the safety scaffolding nobody talks about.</description>
    </item>
    <item>
      <title>SLOs and Error Budgets That Engineers Actually Use</title>
      <link>https://muhammadamal.my.id/blog/slos-error-budgets-engineers-actually-use/</link>
      <pubDate>Wed, 05 Jun 2024 09:00:00 +0700</pubDate>
      <guid>https://muhammadamal.my.id/blog/slos-error-budgets-engineers-actually-use/</guid>
      <description>SLOs are easy to write and hard to use. Here&amp;rsquo;s how to build budgets your team will reach for during incidents and planning, not just QBRs.</description>
    </item>
    <item>
      <title>Digital Immune Systems for Engineers, What Gartner Got Right</title>
      <link>https://muhammadamal.my.id/blog/digital-immune-systems-for-engineers/</link>
      <pubDate>Mon, 03 Jun 2024 09:00:00 +0700</pubDate>
      <guid>https://muhammadamal.my.id/blog/digital-immune-systems-for-engineers/</guid>
      <description>Cutting through the analyst gloss on Digital Immune Systems. Six concrete pillars, the ones worth your time, and the ones that are just rebrands.</description>
    </item>
    <item>
      <title>Service-Level Objectives in Practice</title>
      <link>https://muhammadamal.my.id/blog/slo-practice/</link>
      <pubDate>Fri, 23 Sep 2022 09:00:00 +0700</pubDate>
      <guid>https://muhammadamal.my.id/blog/slo-practice/</guid>
      <description>SLOs in practice: SLIs, targets, error budgets from Prometheus, org dynamics.</description>
    </item>
  </channel>
</rss>
