AI
November 20, 2023 · 7 min read
Putting a RAG Evaluation Pipeline in CI, The Setup I Actually Use
November 16, 2023 · 7 min read
Hybrid Retrieval with pgvector and BM25, A Practical Walkthrough
November 14, 2023 · 7 min read
Securing an Internal LLM Chatbot, Threats, Boundaries, and What I Got Wrong
November 10, 2023 · 6 min read
The OpenAI Assistants API in Production, A Cautious Take
November 8, 2023 · 6 min read
Migrating to GPT-4 Turbo, What 128K Context Actually Changes
November 2, 2023 · 7 min read
Shipping an Internal RAG Chatbot with LlamaIndex 0.8, What Actually Matters
April 27, 2023 · 8 min read
LangChain 0.0.13x, The Framework, the Hype, and the Real Engineering Tradeoffs
April 24, 2023 · 7 min read
Chroma 0.3, The Local-First Vector Database for Notebook-Scale Prototyping
April 20, 2023 · 7 min read
Weaviate 1.18 and Hybrid Search, When Keyword and Vector Search Are Both Right
April 17, 2023 · 7 min read
Milvus 2.2 in Production, Self-Hosting the Heavyweight Open-Source Vector Database
April 13, 2023 · 7 min read
Building Semantic Search From Scratch, A Production Walkthrough
April 10, 2023 · 7 min read