<?xml version="1.0" encoding="utf-8" standalone="yes"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom">
  <channel>
    <title>Rag on Hi, I&#39;m Muhammad Amal</title>
    <link>https://muhammadamal.my.id/tags/rag/</link>
    <description>Recent content in Rag on Hi, I&#39;m Muhammad Amal</description>
    <generator>Hugo</generator>
    <language>en-us</language>
    <lastBuildDate>Mon, 03 Nov 2025 09:00:00 +0700</lastBuildDate>
    <atom:link href="https://muhammadamal.my.id/tags/rag/index.xml" rel="self" type="application/rss+xml" />
    <item>
      <title>RAG Systems for Technical Support Teams in 2025</title>
      <link>https://muhammadamal.my.id/blog/rag-systems-for-technical-support-teams-in-2025/</link>
      <pubDate>Mon, 03 Nov 2025 09:00:00 +0700</pubDate>
      <guid>https://muhammadamal.my.id/blog/rag-systems-for-technical-support-teams-in-2025/</guid>
      <description>A field-tested walkthrough of building retrieval-augmented generation for L1 through L3 support, with runnable Python, pgvector and Qdrant pipelines, and the failure modes nobody talks about.</description>
    </item>
    <item>
      <title>Securing RAG Systems Against Data Exfiltration in 2025</title>
      <link>https://muhammadamal.my.id/blog/securing-rag-against-data-exfiltration-2025/</link>
      <pubDate>Wed, 10 Sep 2025 09:00:00 +0700</pubDate>
      <guid>https://muhammadamal.my.id/blog/securing-rag-against-data-exfiltration-2025/</guid>
      <description>Practical controls that stop the most common RAG exfiltration vectors without breaking retrieval quality.</description>
    </item>
    <item>
      <title>Local RAG with SLMs, Private Knowledge Without the Cloud</title>
      <link>https://muhammadamal.my.id/blog/local-rag-with-slms-private-knowledge-without-cloud/</link>
      <pubDate>Mon, 27 Jan 2025 09:00:00 +0700</pubDate>
      <guid>https://muhammadamal.my.id/blog/local-rag-with-slms-private-knowledge-without-cloud/</guid>
      <description>End-to-end local RAG, no cloud. Embeddings, vectors, retrieval, and grounded generation on a single box.</description>
    </item>
    <item>
      <title>Securing RAG Systems Against Data Exfiltration</title>
      <link>https://muhammadamal.my.id/blog/securing-rag-systems-against-data-exfiltration/</link>
      <pubDate>Wed, 23 Oct 2024 09:00:00 +0700</pubDate>
      <guid>https://muhammadamal.my.id/blog/securing-rag-systems-against-data-exfiltration/</guid>
      <description>How to design RAG systems so that prompt injection and over-eager retrieval don&amp;rsquo;t become an exfiltration channel.</description>
    </item>
    <item>
      <title>Evaluating RAG, Beyond Vibes-Based Testing</title>
      <link>https://muhammadamal.my.id/blog/rag-evaluation-ragas-trulens-deepeval/</link>
      <pubDate>Mon, 26 Feb 2024 09:00:00 +0700</pubDate>
      <guid>https://muhammadamal.my.id/blog/rag-evaluation-ragas-trulens-deepeval/</guid>
      <description>Ragas, TruLens, DeepEval — measuring RAG quality. Faithfulness, context precision, answer relevance. CI integration without LLM-as-judge bills.</description>
    </item>
    <item>
      <title>Re-ranking and Reciprocal Rank Fusion in RAG Pipelines</title>
      <link>https://muhammadamal.my.id/blog/rag-reranking-rrf-cohere-bge/</link>
      <pubDate>Wed, 21 Feb 2024 09:00:00 +0700</pubDate>
      <guid>https://muhammadamal.my.id/blog/rag-reranking-rrf-cohere-bge/</guid>
      <description>Cross-encoder rerankers turn top-50 retrieval into clean top-5. Cohere Rerank vs BGE-reranker, latency budgets, where it slots in your RAG pipeline.</description>
    </item>
    <item>
      <title>Securing RAG, Per-User Document Access Without Re-indexing</title>
      <link>https://muhammadamal.my.id/blog/rag-security-access-control-multi-tenant/</link>
      <pubDate>Mon, 19 Feb 2024 09:00:00 +0700</pubDate>
      <guid>https://muhammadamal.my.id/blog/rag-security-access-control-multi-tenant/</guid>
      <description>Multi-tenant RAG without leaks. Metadata filtering at retrieval, ACL design, audit trails, and prompt-side defenses for what filters miss.</description>
    </item>
    <item>
      <title>Hybrid Search, BM25 Plus Vectors for Better RAG Recall</title>
      <link>https://muhammadamal.my.id/blog/hybrid-search-bm25-vectors-rag/</link>
      <pubDate>Wed, 14 Feb 2024 09:00:00 +0700</pubDate>
      <guid>https://muhammadamal.my.id/blog/hybrid-search-bm25-vectors-rag/</guid>
      <description>Pure vector search misses exact-match queries. Hybrid BM25 + dense + RRF closes the gap. Real code, real numbers, real trade-offs.</description>
    </item>
    <item>
      <title>Chunking Strategies for RAG That Survive Real Documents</title>
      <link>https://muhammadamal.my.id/blog/rag-chunking-strategies-real-documents/</link>
      <pubDate>Mon, 12 Feb 2024 09:00:00 +0700</pubDate>
      <guid>https://muhammadamal.my.id/blog/rag-chunking-strategies-real-documents/</guid>
      <description>Chunking is where RAG quality is won or lost. Semantic, hierarchical, sentence-window strategies and concrete code for documents that break defaults.</description>
    </item>
    <item>
      <title>Embedding Models in 2024, OpenAI vs Cohere vs Open Source</title>
      <link>https://muhammadamal.my.id/blog/embedding-models-2024-openai-cohere-open-source/</link>
      <pubDate>Wed, 07 Feb 2024 09:00:00 +0700</pubDate>
      <guid>https://muhammadamal.my.id/blog/embedding-models-2024-openai-cohere-open-source/</guid>
      <description>text-embedding-3, Cohere v3, bge-m3 — which embedding model in 2024. Dimension trade-offs, multilingual, cost. Honest comparison.</description>
    </item>
    <item>
      <title>Choosing a Vector Database, Pinecone vs Qdrant vs pgvector</title>
      <link>https://muhammadamal.my.id/blog/vector-database-pinecone-qdrant-pgvector/</link>
      <pubDate>Mon, 05 Feb 2024 09:00:00 +0700</pubDate>
      <guid>https://muhammadamal.my.id/blog/vector-database-pinecone-qdrant-pgvector/</guid>
      <description>Pinecone serverless, Qdrant v1.7, pgvector 0.5 — how to pick. Cost, hybrid search, filtering, ops. Honest trade-offs, no marketing.</description>
    </item>
    <item>
      <title>Why Naive RAG Fails in Production, A 2024 Reality Check</title>
      <link>https://muhammadamal.my.id/blog/naive-rag-failures-production-2024/</link>
      <pubDate>Fri, 02 Feb 2024 09:00:00 +0700</pubDate>
      <guid>https://muhammadamal.my.id/blog/naive-rag-failures-production-2024/</guid>
      <description>Naive RAG breaks in prod. Recall gaps, chunk boundaries, stale data. What the 2024 RAG stack changed and where the demo-to-prod gap still hides.</description>
    </item>
    <item>
      <title>Putting a RAG Evaluation Pipeline in CI, The Setup I Actually Use</title>
      <link>https://muhammadamal.my.id/blog/rag-evaluation-pipeline-ci/</link>
      <pubDate>Mon, 20 Nov 2023 09:00:00 +0700</pubDate>
      <guid>https://muhammadamal.my.id/blog/rag-evaluation-pipeline-ci/</guid>
      <description>A practical RAG eval setup wired into CI — retrieval and generation metrics, golden questions, and catching silent regressions.</description>
    </item>
    <item>
      <title>Securing an Internal LLM Chatbot, Threats, Boundaries, and What I Got Wrong</title>
      <link>https://muhammadamal.my.id/blog/securing-internal-llm-chatbot-data/</link>
      <pubDate>Tue, 14 Nov 2023 09:00:00 +0700</pubDate>
      <guid>https://muhammadamal.my.id/blog/securing-internal-llm-chatbot-data/</guid>
      <description>A practical guide to securing internal LLM chatbots — prompt injection, leakage, access control, and the gaps people miss.</description>
    </item>
    <item>
      <title>Shipping an Internal RAG Chatbot with LlamaIndex 0.8, What Actually Matters</title>
      <link>https://muhammadamal.my.id/blog/internal-rag-chatbot-llamaindex/</link>
      <pubDate>Thu, 02 Nov 2023 09:00:00 +0700</pubDate>
      <guid>https://muhammadamal.my.id/blog/internal-rag-chatbot-llamaindex/</guid>
      <description>Lessons from building a production internal RAG chatbot with LlamaIndex 0.8 — retrieval design, chunking, and the plumbing that makes it work.</description>
    </item>
  </channel>
</rss>
