<?xml version="1.0" encoding="utf-8" standalone="yes"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom">
  <channel>
    <title>Llm on Hi, I&#39;m Muhammad Amal</title>
    <link>https://muhammadamal.my.id/tags/llm/</link>
    <description>Recent content in Llm on Hi, I&#39;m Muhammad Amal</description>
    <generator>Hugo</generator>
    <language>en-us</language>
    <lastBuildDate>Wed, 10 Sep 2025 09:00:00 +0700</lastBuildDate>
    <atom:link href="https://muhammadamal.my.id/tags/llm/index.xml" rel="self" type="application/rss+xml" />
    <item>
      <title>Securing RAG Systems Against Data Exfiltration in 2025</title>
      <link>https://muhammadamal.my.id/blog/securing-rag-against-data-exfiltration-2025/</link>
      <pubDate>Wed, 10 Sep 2025 09:00:00 +0700</pubDate>
      <guid>https://muhammadamal.my.id/blog/securing-rag-against-data-exfiltration-2025/</guid>
      <description>Practical controls that stop the most common RAG exfiltration vectors without breaking retrieval quality.</description>
    </item>
    <item>
      <title>Advanced Prompt Injection Defenses in 2025, A Practical Guide</title>
      <link>https://muhammadamal.my.id/blog/advanced-prompt-injection-defenses-2025-practical-guide/</link>
      <pubDate>Mon, 01 Sep 2025 09:00:00 +0700</pubDate>
      <guid>https://muhammadamal.my.id/blog/advanced-prompt-injection-defenses-2025-practical-guide/</guid>
      <description>Layered prompt injection defenses that actually hold up in production, with code, diagrams, and the failure modes nobody talks about.</description>
    </item>
    <item>
      <title>LLM Red Teaming, Practical Techniques for 2024</title>
      <link>https://muhammadamal.my.id/blog/llm-red-teaming-practical-techniques-2024/</link>
      <pubDate>Wed, 30 Oct 2024 09:00:00 +0700</pubDate>
      <guid>https://muhammadamal.my.id/blog/llm-red-teaming-practical-techniques-2024/</guid>
      <description>How to run an LLM red team that produces actionable findings instead of party tricks, with attack inventory and triage flow.</description>
    </item>
    <item>
      <title>Securing RAG Systems Against Data Exfiltration</title>
      <link>https://muhammadamal.my.id/blog/securing-rag-systems-against-data-exfiltration/</link>
      <pubDate>Wed, 23 Oct 2024 09:00:00 +0700</pubDate>
      <guid>https://muhammadamal.my.id/blog/securing-rag-systems-against-data-exfiltration/</guid>
      <description>How to design RAG systems so that prompt injection and over-eager retrieval don&amp;rsquo;t become an exfiltration channel.</description>
    </item>
    <item>
      <title>Prompt Injection Defenses in LLM Apps, Patterns for 2024</title>
      <link>https://muhammadamal.my.id/blog/prompt-injection-defenses-llm-apps-2024/</link>
      <pubDate>Mon, 07 Oct 2024 09:00:00 +0700</pubDate>
      <guid>https://muhammadamal.my.id/blog/prompt-injection-defenses-llm-apps-2024/</guid>
      <description>Hardening patterns for prompt injection across system prompts, tools, and retrieval, with code and policy guidance.</description>
    </item>
    <item>
      <title>Evaluating RAG, Beyond Vibes-Based Testing</title>
      <link>https://muhammadamal.my.id/blog/rag-evaluation-ragas-trulens-deepeval/</link>
      <pubDate>Mon, 26 Feb 2024 09:00:00 +0700</pubDate>
      <guid>https://muhammadamal.my.id/blog/rag-evaluation-ragas-trulens-deepeval/</guid>
      <description>Ragas, TruLens, DeepEval — measuring RAG quality. Faithfulness, context precision, answer relevance. CI integration without LLM-as-judge bills.</description>
    </item>
    <item>
      <title>Why Naive RAG Fails in Production, A 2024 Reality Check</title>
      <link>https://muhammadamal.my.id/blog/naive-rag-failures-production-2024/</link>
      <pubDate>Fri, 02 Feb 2024 09:00:00 +0700</pubDate>
      <guid>https://muhammadamal.my.id/blog/naive-rag-failures-production-2024/</guid>
      <description>Naive RAG breaks in prod. Recall gaps, chunk boundaries, stale data. What the 2024 RAG stack changed and where the demo-to-prod gap still hides.</description>
    </item>
    <item>
      <title>The 2023 LLM Tooling Retrospective, What Actually Changed About My Workflow</title>
      <link>https://muhammadamal.my.id/blog/2023-llm-tooling-retrospective/</link>
      <pubDate>Wed, 27 Dec 2023 09:00:00 +0700</pubDate>
      <guid>https://muhammadamal.my.id/blog/2023-llm-tooling-retrospective/</guid>
      <description>Which 2023 LLM tools actually earned their place in a senior engineer&amp;rsquo;s daily workflow, and which got dropped.</description>
    </item>
    <item>
      <title>LLM Vendor Risk, A Failover Playbook After the OpenAI Weekend</title>
      <link>https://muhammadamal.my.id/blog/llm-vendor-risk-failover-strategy/</link>
      <pubDate>Thu, 30 Nov 2023 09:00:00 +0700</pubDate>
      <guid>https://muhammadamal.my.id/blog/llm-vendor-risk-failover-strategy/</guid>
      <description>A failover playbook for LLM apps after the OpenAI weekend — multi-provider routing, abstraction layers, and what&amp;rsquo;s worth doing.</description>
    </item>
    <item>
      <title>LangChain LCEL vs LlamaIndex, Picking a Framework in Late 2023</title>
      <link>https://muhammadamal.my.id/blog/langchain-lcel-vs-llamaindex-routing/</link>
      <pubDate>Tue, 28 Nov 2023 09:00:00 +0700</pubDate>
      <guid>https://muhammadamal.my.id/blog/langchain-lcel-vs-llamaindex-routing/</guid>
      <description>Picking between LangChain LCEL and LlamaIndex in late 2023 — orchestration vs retrieval, when to use each, and where they overlap.</description>
    </item>
    <item>
      <title>Claude 2.1 vs GPT-4 Turbo, A Side-by-Side at 100K Context</title>
      <link>https://muhammadamal.my.id/blog/claude-2-1-200k-context-comparison/</link>
      <pubDate>Fri, 24 Nov 2023 09:00:00 +0700</pubDate>
      <guid>https://muhammadamal.my.id/blog/claude-2-1-200k-context-comparison/</guid>
      <description>Side-by-side notes on Claude 2.1 200K vs GPT-4 Turbo 128K — long-context recall, document QA, function calling, and production fit.</description>
    </item>
    <item>
      <title>LLM Observability in Practice, Logs, Traces, and a Useful Dashboard</title>
      <link>https://muhammadamal.my.id/blog/llm-observability-monitoring-dashboard/</link>
      <pubDate>Wed, 22 Nov 2023 09:00:00 +0700</pubDate>
      <guid>https://muhammadamal.my.id/blog/llm-observability-monitoring-dashboard/</guid>
      <description>Practical LLM observability — what to log, what to alert on, and what a useful dashboard for a RAG system looks like.</description>
    </item>
    <item>
      <title>Putting a RAG Evaluation Pipeline in CI, The Setup I Actually Use</title>
      <link>https://muhammadamal.my.id/blog/rag-evaluation-pipeline-ci/</link>
      <pubDate>Mon, 20 Nov 2023 09:00:00 +0700</pubDate>
      <guid>https://muhammadamal.my.id/blog/rag-evaluation-pipeline-ci/</guid>
      <description>A practical RAG eval setup wired into CI — retrieval and generation metrics, golden questions, and catching silent regressions.</description>
    </item>
    <item>
      <title>Hybrid Retrieval with pgvector and BM25, A Practical Walkthrough</title>
      <link>https://muhammadamal.my.id/blog/hybrid-retrieval-pgvector-bm25/</link>
      <pubDate>Thu, 16 Nov 2023 09:00:00 +0700</pubDate>
      <guid>https://muhammadamal.my.id/blog/hybrid-retrieval-pgvector-bm25/</guid>
      <description>Building hybrid retrieval on Postgres with pgvector 0.5 and BM25 — schema, query, score fusion, and trade-offs vs managed vector DBs.</description>
    </item>
    <item>
      <title>Securing an Internal LLM Chatbot, Threats, Boundaries, and What I Got Wrong</title>
      <link>https://muhammadamal.my.id/blog/securing-internal-llm-chatbot-data/</link>
      <pubDate>Tue, 14 Nov 2023 09:00:00 +0700</pubDate>
      <guid>https://muhammadamal.my.id/blog/securing-internal-llm-chatbot-data/</guid>
      <description>A practical guide to securing internal LLM chatbots — prompt injection, leakage, access control, and the gaps people miss.</description>
    </item>
    <item>
      <title>The OpenAI Assistants API in Production, A Cautious Take</title>
      <link>https://muhammadamal.my.id/blog/openai-assistants-api-production-review/</link>
      <pubDate>Fri, 10 Nov 2023 09:00:00 +0700</pubDate>
      <guid>https://muhammadamal.my.id/blog/openai-assistants-api-production-review/</guid>
      <description>An honest production review of the OpenAI Assistants API beta — what it solves, where it falls short, and why I&amp;rsquo;m cautious about adoption.</description>
    </item>
    <item>
      <title>Migrating to GPT-4 Turbo, What 128K Context Actually Changes</title>
      <link>https://muhammadamal.my.id/blog/gpt-4-turbo-128k-context-migration/</link>
      <pubDate>Wed, 08 Nov 2023 09:00:00 +0700</pubDate>
      <guid>https://muhammadamal.my.id/blog/gpt-4-turbo-128k-context-migration/</guid>
      <description>Migrating a production RAG chatbot from gpt-3.5-turbo-16k to GPT-4 Turbo 128K — cost, latency, and when context matters.</description>
    </item>
    <item>
      <title>Shipping an Internal RAG Chatbot with LlamaIndex 0.8, What Actually Matters</title>
      <link>https://muhammadamal.my.id/blog/internal-rag-chatbot-llamaindex/</link>
      <pubDate>Thu, 02 Nov 2023 09:00:00 +0700</pubDate>
      <guid>https://muhammadamal.my.id/blog/internal-rag-chatbot-llamaindex/</guid>
      <description>Lessons from building a production internal RAG chatbot with LlamaIndex 0.8 — retrieval design, chunking, and the plumbing that makes it work.</description>
    </item>
    <item>
      <title>LangChain 0.0.13x, The Framework, the Hype, and the Real Engineering Tradeoffs</title>
      <link>https://muhammadamal.my.id/blog/langchain-framework-intro/</link>
      <pubDate>Thu, 27 Apr 2023 09:00:00 +0700</pubDate>
      <guid>https://muhammadamal.my.id/blog/langchain-framework-intro/</guid>
      <description>A senior engineer&amp;rsquo;s view of LangChain 0.0.13x - what it actually does, where it earns its complexity, and where you should write the code yourself instead.</description>
    </item>
    <item>
      <title>Error Handling and Retries for LLM APIs</title>
      <link>https://muhammadamal.my.id/blog/llm-error-handling-retries/</link>
      <pubDate>Fri, 27 Jan 2023 09:00:00 +0700</pubDate>
      <guid>https://muhammadamal.my.id/blog/llm-error-handling-retries/</guid>
      <description>OpenAI error handling: transient vs permanent, backoff, fallbacks, keep service up.</description>
    </item>
    <item>
      <title>LLM Cost Control and Token Budgets</title>
      <link>https://muhammadamal.my.id/blog/llm-cost-control-token-budgets/</link>
      <pubDate>Tue, 24 Jan 2023 09:00:00 +0700</pubDate>
      <guid>https://muhammadamal.my.id/blog/llm-cost-control-token-budgets/</guid>
      <description>LLM cost control: budgets, compression, caching, model selection, alerts on runaway.</description>
    </item>
    <item>
      <title>Streaming Responses from LLM APIs</title>
      <link>https://muhammadamal.my.id/blog/streaming-llm-responses-sse/</link>
      <pubDate>Fri, 20 Jan 2023 09:00:00 +0700</pubDate>
      <guid>https://muhammadamal.my.id/blog/streaming-llm-responses-sse/</guid>
      <description>Stream OpenAI responses via SSE. UX matters, Python + Node patterns, proxy + CDN gotchas.</description>
    </item>
    <item>
      <title>Few-Shot Prompting and In-Context Learning</title>
      <link>https://muhammadamal.my.id/blog/few-shot-prompting-in-context/</link>
      <pubDate>Tue, 17 Jan 2023 09:00:00 +0700</pubDate>
      <guid>https://muhammadamal.my.id/blog/few-shot-prompting-in-context/</guid>
      <description>Few-shot prompting: 2-3 examples beat long instructions. Cost trade-offs, example selection, where it wins.</description>
    </item>
    <item>
      <title>Prompt Engineering Basics for Engineers</title>
      <link>https://muhammadamal.my.id/blog/prompt-engineering-basics-engineers/</link>
      <pubDate>Fri, 13 Jan 2023 09:00:00 +0700</pubDate>
      <guid>https://muhammadamal.my.id/blog/prompt-engineering-basics-engineers/</guid>
      <description>Prompt engineering for engineers: structure, role priming, examples, schema. From 50% to 90% accuracy.</description>
    </item>
    <item>
      <title>Calling OpenAI from Node.js</title>
      <link>https://muhammadamal.my.id/blog/openai-nodejs-integration-2023/</link>
      <pubDate>Tue, 10 Jan 2023 09:00:00 +0700</pubDate>
      <guid>https://muhammadamal.my.id/blog/openai-nodejs-integration-2023/</guid>
      <description>Node + OpenAI in 2023: SDK, prompt templates, Zod validation, p-retry, production patterns.</description>
    </item>
    <item>
      <title>Calling OpenAI from Python, Patterns and Pitfalls</title>
      <link>https://muhammadamal.my.id/blog/openai-python-patterns-pitfalls/</link>
      <pubDate>Fri, 06 Jan 2023 09:00:00 +0700</pubDate>
      <guid>https://muhammadamal.my.id/blog/openai-python-patterns-pitfalls/</guid>
      <description>Python + OpenAI in 2023: SDK, prompt templates, JSON parsing, retries, async, production pitfalls.</description>
    </item>
    <item>
      <title>Why Every Backend Needs an LLM Integration in 2023</title>
      <link>https://muhammadamal.my.id/blog/why-llm-integration-backend-2023/</link>
      <pubDate>Tue, 03 Jan 2023 09:00:00 +0700</pubDate>
      <guid>https://muhammadamal.my.id/blog/why-llm-integration-backend-2023/</guid>
      <description>Why backend engineers should integrate LLMs in 2023. Real cases beyond chatbots, OpenAI today, realistic path.</description>
    </item>
  </channel>
</rss>
