<?xml version="1.0" encoding="utf-8" standalone="yes"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom">
  <channel>
    <title>Aiops on Hi, I&#39;m Muhammad Amal</title>
    <link>https://muhammadamal.my.id/tags/aiops/</link>
    <description>Recent content in Aiops on Hi, I&#39;m Muhammad Amal</description>
    <generator>Hugo</generator>
    <language>en-us</language>
    <lastBuildDate>Fri, 23 May 2025 09:00:00 +0700</lastBuildDate>
    <atom:link href="https://muhammadamal.my.id/tags/aiops/index.xml" rel="self" type="application/rss+xml" />
    <item>
      <title>Postmortem Automation with LLMs, Drafts That Don&#39;t Lie</title>
      <link>https://muhammadamal.my.id/blog/postmortem-automation-with-llms-drafts-that-dont-lie/</link>
      <pubDate>Fri, 23 May 2025 09:00:00 +0700</pubDate>
      <guid>https://muhammadamal.my.id/blog/postmortem-automation-with-llms-drafts-that-dont-lie/</guid>
      <description>A draft-only postmortem pipeline that respects timestamps, refuses to invent causes, and produces a blameless template a human can finish in 30 minutes.</description>
    </item>
    <item>
      <title>Chaos Engineering with AI Augmented Hypotheses</title>
      <link>https://muhammadamal.my.id/blog/chaos-engineering-with-ai-augmented-hypotheses/</link>
      <pubDate>Wed, 21 May 2025 09:00:00 +0700</pubDate>
      <guid>https://muhammadamal.my.id/blog/chaos-engineering-with-ai-augmented-hypotheses/</guid>
      <description>AI-proposed chaos hypotheses, human-approved blast radii, and LitmusChaos execution on Kubernetes 1.32 with rollback on SLO breach.</description>
    </item>
    <item>
      <title>SLOs and Burn Rate Alerting in 2025, A Practical Guide</title>
      <link>https://muhammadamal.my.id/blog/slos-and-burn-rate-alerting-in-2025-a-practical-guide/</link>
      <pubDate>Mon, 19 May 2025 09:00:00 +0700</pubDate>
      <guid>https://muhammadamal.my.id/blog/slos-and-burn-rate-alerting-in-2025-a-practical-guide/</guid>
      <description>Practical SLO design, error budget math, and multi-window burn rate alerting rules ready to paste into Prometheus 3.0.</description>
    </item>
    <item>
      <title>Incident Response Automation with LangGraph, A Step by Step Tutorial</title>
      <link>https://muhammadamal.my.id/blog/incident-response-automation-with-langgraph-a-step-by-step-tutorial/</link>
      <pubDate>Fri, 16 May 2025 09:00:00 +0700</pubDate>
      <guid>https://muhammadamal.my.id/blog/incident-response-automation-with-langgraph-a-step-by-step-tutorial/</guid>
      <description>Treat incident response as a typed state machine in LangGraph 0.2, with deterministic transitions, audit logging, and bounded LLM use.</description>
    </item>
    <item>
      <title>Anomaly Detection on Prometheus Metrics, A Hands On Guide</title>
      <link>https://muhammadamal.my.id/blog/anomaly-detection-on-prometheus-metrics-a-hands-on-guide/</link>
      <pubDate>Wed, 14 May 2025 09:00:00 +0700</pubDate>
      <guid>https://muhammadamal.my.id/blog/anomaly-detection-on-prometheus-metrics-a-hands-on-guide/</guid>
      <description>A working senior SRE&amp;rsquo;s tour through metric anomaly detection, from cheap z-score rules to isolation forest sidecars on Prometheus 3.0.</description>
    </item>
    <item>
      <title>Building an SRE Copilot for On Call Engineers</title>
      <link>https://muhammadamal.my.id/blog/building-an-sre-copilot-for-on-call-engineers/</link>
      <pubDate>Mon, 12 May 2025 09:00:00 +0700</pubDate>
      <guid>https://muhammadamal.my.id/blog/building-an-sre-copilot-for-on-call-engineers/</guid>
      <description>A senior backend engineer&amp;rsquo;s design for an LLM-powered on-call assistant with tool use, context shaping, and a read-only blast radius.</description>
    </item>
    <item>
      <title>AI Driven Log Analysis at Scale, A Production Tutorial</title>
      <link>https://muhammadamal.my.id/blog/ai-driven-log-analysis-at-scale-a-production-tutorial/</link>
      <pubDate>Fri, 09 May 2025 09:00:00 +0700</pubDate>
      <guid>https://muhammadamal.my.id/blog/ai-driven-log-analysis-at-scale-a-production-tutorial/</guid>
      <description>A production pattern for AI log analysis using template mining, vector retrieval, and bounded LLM summarization on Loki 3.3.</description>
    </item>
    <item>
      <title>Auto Remediation Pipelines with LLM Agents and Argo Events</title>
      <link>https://muhammadamal.my.id/blog/auto-remediation-pipelines-with-llm-agents-and-argo-events/</link>
      <pubDate>Wed, 07 May 2025 09:00:00 +0700</pubDate>
      <guid>https://muhammadamal.my.id/blog/auto-remediation-pipelines-with-llm-agents-and-argo-events/</guid>
      <description>A practical walkthrough of LLM-proposed, deterministically-executed remediation using Argo Events and Argo Workflows on Kubernetes 1.32.</description>
    </item>
    <item>
      <title>AIOps in May 2025, What Actually Works in Production</title>
      <link>https://muhammadamal.my.id/blog/aiops-in-may-2025-what-actually-works-in-production/</link>
      <pubDate>Mon, 05 May 2025 09:00:00 +0700</pubDate>
      <guid>https://muhammadamal.my.id/blog/aiops-in-may-2025-what-actually-works-in-production/</guid>
      <description>Field notes on AIOps in production, what to adopt, what to defer, and where LLMs earn their keep on the platform team.</description>
    </item>
  </channel>
</rss>
