LLM
November 20, 2023 · 7 min read
Putting a RAG Evaluation Pipeline in CI, The Setup I Actually Use
November 16, 2023 · 7 min read
Hybrid Retrieval with pgvector and BM25, A Practical Walkthrough
November 14, 2023 · 7 min read
Securing an Internal LLM Chatbot, Threats, Boundaries, and What I Got Wrong
November 10, 2023 · 6 min read
The OpenAI Assistants API in Production, A Cautious Take
November 8, 2023 · 6 min read
Migrating to GPT-4 Turbo, What 128K Context Actually Changes
November 2, 2023 · 7 min read
Shipping an Internal RAG Chatbot with LlamaIndex 0.8, What Actually Matters
April 27, 2023 · 8 min read
LangChain 0.0.13x, The Framework, the Hype, and the Real Engineering Tradeoffs
January 27, 2023 · 4 min read
Error Handling and Retries for LLM APIs
January 24, 2023 · 4 min read
LLM Cost Control and Token Budgets
January 20, 2023 · 4 min read
Streaming Responses from LLM APIs
January 17, 2023 · 4 min read
Few-Shot Prompting and In-Context Learning
January 13, 2023 · 4 min read