Embedding Models & Vector Search Hub

Deep dives into text embeddings, vector databases, sparse retrieval, and semantic search. A curated collection of articles on building production-ready search systems.

Embedding Models, Vector Databases & Semantic Search

This is a curated collection of my writing on text embeddings, sparse retrieval, vector databases, and semantic search architecture.

If you’re building search systems or RAG pipelines, start here.

I’ve spent the last couple of years deep in the embedding world — training tiny Arabic models, implementing BM25 from scratch, and building hybrid search that actually works for Arabic text. This series covers both theory and code, from sparse embeddings to dense retrieval and everything in between.

HyperRun — A self-hosted AI docs widget with semantic code search
HyperRun + FastHTML — Streaming chat architecture with HTMX and SSE
Late Interaction — ColBERT and multi-vector retrieval explained
BM25 from Scratch — Implementing classical sparse retrieval
BM25 + Qdrant for Arabic — Practical sparse+dense hybrid search
BM25 Benchmarks — Full comparison across datasets
Vector Database Book — Notes on building vector search systems

Each article builds on the last — start with BM25 if you’re new, then move to late interaction and HyperRun once you’ve got the foundations down.

← Back to all blog posts

Vector Databases - O’reilly By Nitin Borwankar

blogging

til

blog/review/book

Review of a new book about Vector database from O’reilly

HyperRun + ColGrep: A Self-Hosted Alternative to RunLLM

blogging

til

blog/build/project

HyperRun is an open-source, self-hosted alternative to RunLLM. Add an…

HyperRun Deep Dive: Streaming Chat with FastHTML, HTMX, and SSE

blogging

til

blog/build/project

How I built a real-time AI chat widget with zero React, zero WebSockets, and…

BM25 Part 3: Benchmarking Python BM25 Libraries

blogging

embedding

qdrant

Benchmarking BM25 implementations: bm25-sparse, rankbm25, numpy vectorization…

BM25 Explained Part 2 | Qdrant hybrid search for real estate

blogging

embedding

qdrant

sparse

Step-by-step guide to building a bilingual hybrid search system using Qdrant…

BM25 Search Algorithm: Python Implementation from Scratch

blogging

embedding

qdrant

Learn how BM25 improves search relevance over TF-IDF. This guide covers the…

Late Interaction & ColPali: Efficient Semantic Search

blogging

embedding

minishlab

model2vec

arabic

Explore Late Interaction models like ColBERT and ColPali for efficient…