LLM | Hi, I'm Muhammad Amal

November 20, 2023 · 7 min read

Putting a RAG Evaluation Pipeline in CI, The Setup I Actually Use

November 16, 2023 · 7 min read

Hybrid Retrieval with pgvector and BM25, A Practical Walkthrough

November 14, 2023 · 7 min read

Securing an Internal LLM Chatbot, Threats, Boundaries, and What I Got Wrong

November 10, 2023 · 6 min read

The OpenAI Assistants API in Production, A Cautious Take

November 8, 2023 · 6 min read

Migrating to GPT-4 Turbo, What 128K Context Actually Changes

November 2, 2023 · 7 min read

Shipping an Internal RAG Chatbot with LlamaIndex 0.8, What Actually Matters

April 27, 2023 · 8 min read

LangChain 0.0.13x, The Framework, the Hype, and the Real Engineering Tradeoffs

January 27, 2023 · 4 min read

Error Handling and Retries for LLM APIs

January 24, 2023 · 4 min read

LLM Cost Control and Token Budgets

January 20, 2023 · 4 min read

Streaming Responses from LLM APIs

January 17, 2023 · 4 min read

Few-Shot Prompting and In-Context Learning

January 13, 2023 · 4 min read

Prompt Engineering Basics for Engineers