<?xml version="1.0" encoding="utf-8" standalone="yes"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom">
  <channel>
    <title>Observability on Hi, I&#39;m Muhammad Amal</title>
    <link>https://muhammadamal.my.id/tags/observability/</link>
    <description>Recent content in Observability on Hi, I&#39;m Muhammad Amal</description>
    <generator>Hugo</generator>
    <language>en-us</language>
    <lastBuildDate>Fri, 22 Aug 2025 09:00:00 +0700</lastBuildDate>
    <atom:link href="https://muhammadamal.my.id/tags/observability/index.xml" rel="self" type="application/rss+xml" />
    <item>
      <title>Observability for n8n in 2025, Metrics, Logs, and Traces</title>
      <link>https://muhammadamal.my.id/blog/observability-for-n8n-2025-metrics-logs-traces/</link>
      <pubDate>Fri, 22 Aug 2025 09:00:00 +0700</pubDate>
      <guid>https://muhammadamal.my.id/blog/observability-for-n8n-2025-metrics-logs-traces/</guid>
      <description>Build production observability for n8n with Prometheus metrics, structured logs, OpenTelemetry traces, and the dashboards that catch incidents early.</description>
    </item>
    <item>
      <title>Reading pg_stat_io and Modern Postgres Internals</title>
      <link>https://muhammadamal.my.id/blog/reading-pg-stat-io-and-modern-postgres-internals/</link>
      <pubDate>Mon, 23 Jun 2025 09:00:00 +0700</pubDate>
      <guid>https://muhammadamal.my.id/blog/reading-pg-stat-io-and-modern-postgres-internals/</guid>
      <description>A deep dive on pg_stat_io in PostgreSQL 17, explaining buffer cache mechanics, IO contexts, and how to diagnose storage and memory pressure from the view.</description>
    </item>
    <item>
      <title>AIOps in May 2025, What Actually Works in Production</title>
      <link>https://muhammadamal.my.id/blog/aiops-in-may-2025-what-actually-works-in-production/</link>
      <pubDate>Mon, 05 May 2025 09:00:00 +0700</pubDate>
      <guid>https://muhammadamal.my.id/blog/aiops-in-may-2025-what-actually-works-in-production/</guid>
      <description>Field notes on AIOps in production, what to adopt, what to defer, and where LLMs earn their keep on the platform team.</description>
    </item>
    <item>
      <title>Observability for Edge Fleets at Scale, Patterns That Work</title>
      <link>https://muhammadamal.my.id/blog/observability-edge-fleets-at-scale-patterns-that-work/</link>
      <pubDate>Wed, 30 Apr 2025 09:00:00 +0700</pubDate>
      <guid>https://muhammadamal.my.id/blog/observability-edge-fleets-at-scale-patterns-that-work/</guid>
      <description>Production observability for edge fleets. OpenTelemetry Collector at the edge, local aggregation, smart sampling, and alerts that survive WAN outages.</description>
    </item>
    <item>
      <title>Observability for Multi Agent Systems, LangSmith and Phoenix in 2025</title>
      <link>https://muhammadamal.my.id/blog/observability-for-multi-agent-systems-langsmith-and-phoenix-in-2025/</link>
      <pubDate>Wed, 26 Mar 2025 09:00:00 +0700</pubDate>
      <guid>https://muhammadamal.my.id/blog/observability-for-multi-agent-systems-langsmith-and-phoenix-in-2025/</guid>
      <description>Hands-on observability for multi-agent systems with LangSmith, Phoenix, OpenTelemetry, plus what to alert on and what to ignore.</description>
    </item>
    <item>
      <title>SLOs and Error Budgets That Engineers Actually Use</title>
      <link>https://muhammadamal.my.id/blog/slos-error-budgets-engineers-actually-use/</link>
      <pubDate>Wed, 05 Jun 2024 09:00:00 +0700</pubDate>
      <guid>https://muhammadamal.my.id/blog/slos-error-budgets-engineers-actually-use/</guid>
      <description>SLOs are easy to write and hard to use. Here&amp;rsquo;s how to build budgets your team will reach for during incidents and planning, not just QBRs.</description>
    </item>
    <item>
      <title>Rust Service Observability in 2024, Metrics, Logs, and Traces That Help</title>
      <link>https://muhammadamal.my.id/blog/rust-service-observability-metrics-logs-traces-2024/</link>
      <pubDate>Mon, 25 Mar 2024 09:00:00 +0700</pubDate>
      <guid>https://muhammadamal.my.id/blog/rust-service-observability-metrics-logs-traces-2024/</guid>
      <description>Production observability for Rust services with the metrics crate, Prometheus, tracing, and OpenTelemetry.</description>
    </item>
    <item>
      <title>Measuring Developer Experience, DORA Metrics in Practice</title>
      <link>https://muhammadamal.my.id/blog/developer-experience-metrics-dora-2024/</link>
      <pubDate>Mon, 22 Jan 2024 09:00:00 +0700</pubDate>
      <guid>https://muhammadamal.my.id/blog/developer-experience-metrics-dora-2024/</guid>
      <description>DX metrics that survive a year. DORA, SPACE, and the practical instrumentation for a platform team in 2024 without surveillance optics.</description>
    </item>
    <item>
      <title>LLM Observability in Practice, Logs, Traces, and a Useful Dashboard</title>
      <link>https://muhammadamal.my.id/blog/llm-observability-monitoring-dashboard/</link>
      <pubDate>Wed, 22 Nov 2023 09:00:00 +0700</pubDate>
      <guid>https://muhammadamal.my.id/blog/llm-observability-monitoring-dashboard/</guid>
      <description>Practical LLM observability — what to log, what to alert on, and what a useful dashboard for a RAG system looks like.</description>
    </item>
    <item>
      <title>LLM Cost Control and Token Budgets</title>
      <link>https://muhammadamal.my.id/blog/llm-cost-control-token-budgets/</link>
      <pubDate>Tue, 24 Jan 2023 09:00:00 +0700</pubDate>
      <guid>https://muhammadamal.my.id/blog/llm-cost-control-token-budgets/</guid>
      <description>LLM cost control: budgets, compression, caching, model selection, alerts on runaway.</description>
    </item>
    <item>
      <title>September Retro, One Stack to Watch Them All</title>
      <link>https://muhammadamal.my.id/blog/september-retro-observability/</link>
      <pubDate>Fri, 30 Sep 2022 09:00:00 +0700</pubDate>
      <guid>https://muhammadamal.my.id/blog/september-retro-observability/</guid>
      <description>September retro: full obs stack for factory. Wins, costs, cuts.</description>
    </item>
    <item>
      <title>Prometheus Cardinality and Cost Control</title>
      <link>https://muhammadamal.my.id/blog/prometheus-cardinality-cost/</link>
      <pubDate>Wed, 28 Sep 2022 09:00:00 +0700</pubDate>
      <guid>https://muhammadamal.my.id/blog/prometheus-cardinality-cost/</guid>
      <description>Control Prometheus cost: cardinality limits, drop high-card metrics, recording rules, retention math.</description>
    </item>
    <item>
      <title>Service-Level Objectives in Practice</title>
      <link>https://muhammadamal.my.id/blog/slo-practice/</link>
      <pubDate>Fri, 23 Sep 2022 09:00:00 +0700</pubDate>
      <guid>https://muhammadamal.my.id/blog/slo-practice/</guid>
      <description>SLOs in practice: SLIs, targets, error budgets from Prometheus, org dynamics.</description>
    </item>
    <item>
      <title>Tempo for Distributed Tracing</title>
      <link>https://muhammadamal.my.id/blog/tempo-distributed-tracing/</link>
      <pubDate>Wed, 21 Sep 2022 09:00:00 +0700</pubDate>
      <guid>https://muhammadamal.my.id/blog/tempo-distributed-tracing/</guid>
      <description>Tempo 1.5: distributed tracing, OTel SDK for Go/Node, sampling, exploration in Grafana.</description>
    </item>
    <item>
      <title>Loki for Logs, Self-Hosted Aggregation</title>
      <link>https://muhammadamal.my.id/blog/loki-log-aggregation/</link>
      <pubDate>Fri, 16 Sep 2022 09:00:00 +0700</pubDate>
      <guid>https://muhammadamal.my.id/blog/loki-log-aggregation/</guid>
      <description>Loki 2.6: self-hosted logs, label index, LogQL, retention. Cheaper than ELK; where it doesn&amp;rsquo;t fit.</description>
    </item>
    <item>
      <title>Alerting with Prometheus Alertmanager</title>
      <link>https://muhammadamal.my.id/blog/prometheus-alertmanager/</link>
      <pubDate>Wed, 14 Sep 2022 09:00:00 +0700</pubDate>
      <guid>https://muhammadamal.my.id/blog/prometheus-alertmanager/</guid>
      <description>Alertmanager: routing, grouping, inhibition, silencing, Slack/PD. Useful, not noisy.</description>
    </item>
    <item>
      <title>Grafana Dashboards That Actually Help</title>
      <link>https://muhammadamal.my.id/blog/grafana-dashboards-that-help/</link>
      <pubDate>Mon, 12 Sep 2022 09:00:00 +0700</pubDate>
      <guid>https://muhammadamal.my.id/blog/grafana-dashboards-that-help/</guid>
      <description>Grafana dashboards engineers use, not just look at. RED/USE, hierarchy, 6-panel rule.</description>
    </item>
    <item>
      <title>Instrumenting Node.js Services for Prometheus</title>
      <link>https://muhammadamal.my.id/blog/node-prometheus-instrumentation/</link>
      <pubDate>Fri, 09 Sep 2022 09:00:00 +0700</pubDate>
      <guid>https://muhammadamal.my.id/blog/node-prometheus-instrumentation/</guid>
      <description>Node + Prometheus via prom-client: defaults, HTTP middleware, business metrics, Express/Fastify.</description>
    </item>
    <item>
      <title>Instrumenting Go Services for Prometheus</title>
      <link>https://muhammadamal.my.id/blog/go-prometheus-instrumentation/</link>
      <pubDate>Wed, 07 Sep 2022 09:00:00 +0700</pubDate>
      <guid>https://muhammadamal.my.id/blog/go-prometheus-instrumentation/</guid>
      <description>Go + Prometheus: client_golang, counters, histograms, HTTP middleware, runtime metrics.</description>
    </item>
    <item>
      <title>Prometheus 101, Metrics, Scraping, and PromQL</title>
      <link>https://muhammadamal.my.id/blog/prometheus-101-metrics-promql/</link>
      <pubDate>Mon, 05 Sep 2022 09:00:00 +0700</pubDate>
      <guid>https://muhammadamal.my.id/blog/prometheus-101-metrics-promql/</guid>
      <description>Prometheus 2.37 basics: metric types, scraping, labels, PromQL with backend examples.</description>
    </item>
    <item>
      <title>Building an Observability Stack in 2022</title>
      <link>https://muhammadamal.my.id/blog/observability-stack-2022/</link>
      <pubDate>Fri, 02 Sep 2022 09:00:00 +0700</pubDate>
      <guid>https://muhammadamal.my.id/blog/observability-stack-2022/</guid>
      <description>Self-host obs stack 2022: Prometheus + Grafana + Loki + Tempo + Alertmanager. Metrics, logs, traces.</description>
    </item>
    <item>
      <title>Monitoring n8n in Production</title>
      <link>https://muhammadamal.my.id/blog/n8n-monitoring-production/</link>
      <pubDate>Fri, 27 May 2022 09:00:00 +0700</pubDate>
      <guid>https://muhammadamal.my.id/blog/n8n-monitoring-production/</guid>
      <description>Monitor n8n in prod: health checks, execution rates, Prometheus, log aggregation, useful dashboards.</description>
    </item>
    <item>
      <title>Structured Logging in Rust with tracing</title>
      <link>https://muhammadamal.my.id/blog/rust-structured-logging-tracing/</link>
      <pubDate>Wed, 23 Mar 2022 09:00:00 +0700</pubDate>
      <guid>https://muhammadamal.my.id/blog/rust-structured-logging-tracing/</guid>
      <description>Rust structured logging with tracing. Spans, events, request context, JSON output, production patterns.</description>
    </item>
    <item>
      <title>Structured Logging in Go Microservices with Zap</title>
      <link>https://muhammadamal.my.id/blog/go-structured-logging-zap/</link>
      <pubDate>Fri, 21 Jan 2022 09:00:00 +0700</pubDate>
      <guid>https://muhammadamal.my.id/blog/go-structured-logging-zap/</guid>
      <description>Structured logging in Go with zap. JSON output, request-scoped fields, sampling, and the pattern that survives production.</description>
    </item>
  </channel>
</rss>
