Architecture

In this section of blogs, I will explain core components of vector databases, how they work, and also cover Piramid's architecture, the different components involved, how they interact with each other, the design decisions and tradeoffs made in the architecture, and the future possibilities and directions for the architecture.

What's covered

Databases→

What a database is, how vector databases differ from relational, document, and graph stores, and why similarity search requires a fundamentally different model.

Embeddings→

Where vectors come from, how neural encoders work, what it means for two vectors to be geometrically close, and how embedding quality affects retrieval.

Indexing→

The three index types (Flat, IVF, HNSW), the tradeoffs between exact recall and query latency, and how auto-selection works at different collection sizes.

Query→

How a search request moves through the engine: ANN traversal, metadata filtering, overfetch, and the recall/latency tradeoff in practice.

Storage→

How Piramid keeps data alive across restarts: mmap, the write-ahead log, checkpoints, compaction, and what the durability guarantees actually mean.

The Evolution Databases

Architecture

What's covered#

What's covered