Free data asset · updated for 2026

The Vector Database Index

A neutral index of the vector databases teams actually shortlist for RAG and semantic search in 2026 — grouped by what kind of store they are, how they're hosted, and what each is genuinely best at. A vector database stores embeddings and does similarity (nearest-neighbour) search; it's the memory layer underneath most AI agents. We describe each option's type, hosting, and license rather than quoting prices or version-specific benchmarks, because those move fast and the database is rarely the dominant cost. Pair this with the agent-frameworks index — the framework orchestrates, the vector DB remembers.

Last reviewed June 13, 2026 · 12 databases · neutral & vendor-independent

The matrix

Database	Type	Hosting	License	Best for
Pinecone	Dedicated vector DB	Managed	Proprietary	Zero-ops, fully-managed production vector search
Weaviate	Dedicated vector DB	Both	Open-source	Open-source vector search with built-in hybrid search and modules
Qdrant	Dedicated vector DB	Both	Open-source	High-performance, Rust-based search with heavy metadata filtering
Chroma	Embedded	Both	Open-source	Lightweight, developer-friendly prototyping and small RAG apps
Milvus	Dedicated vector DB	Both (Zilliz managed)	Open-source	Billion-scale vector search
pgvector	Database extension	Both (any Postgres)	Open-source	Adding vectors to an existing Postgres stack without a new database
Redis	Document/cache DB + vectors	Both	Source-available	Low-latency vector search alongside caching/session data
Elasticsearch / OpenSearch	Search engine + vectors	Both	Open-source (OpenSearch) / mixed	Combining full-text and vector (hybrid) search at scale
MongoDB Atlas Vector Search	Document/cache DB + vectors	Managed	Proprietary (Atlas)	Vectors stored next to your operational documents
Azure AI Search	Search engine + vectors	Managed	Proprietary	Vector and hybrid search inside the Azure ecosystem
Vertex AI Vector Search	Dedicated vector DB	Managed	Proprietary	Google Cloud-scale ANN search (formerly Matching Engine)
LanceDB	Embedded	Both	Open-source	Embedded, multimodal, on-disk vector storage

Store type, hosting, and license — not version-specific benchmarks, which go stale fast and depend on your data and query shape. Benchmark the shortlist on your own workload before committing.

How to read the types

Dedicated vector DB: Purpose-built for storing embeddings and running approximate-nearest-neighbour (ANN) search — the most features for pure vector workloads.
Database extension: Adds vector columns and indexes to a general database you already run (e.g. Postgres) — fewer moving parts.
Search engine + vectors: A full-text search engine that also does vector and hybrid (keyword + semantic) search at scale.
Embedded: Runs in-process / on-disk inside your application — no separate server to operate.
Document/cache DB + vectors: An operational database (document or in-memory) that adds vector search next to your live data.

Which one should you pick?

You want zero-ops, fully managed vector searchPinecone
You already run Postgres and want one databasepgvector
You want open-source with hybrid searchWeaviate or Qdrant
You're prototyping a RAG app fastChroma or LanceDB
You need billion-scale ANNMilvus
You want vectors next to operational dataMongoDB Atlas or Redis
You need full-text + vector at scaleElasticsearch / OpenSearch
You're committed to Azure or Google CloudAzure AI Search / Vertex AI Vector Search

FAQs

What is a vector database and do I actually need one?

A vector database stores embeddings — numerical representations of text, images, or other data — and finds the most similar ones via approximate-nearest-neighbour (ANN) search. It's what powers retrieval-augmented generation (RAG) and semantic search. You don't always need a dedicated one: for many apps, the pgvector extension on a Postgres you already run, or your existing search engine, is enough. Reach for a dedicated vector DB when you need large scale, advanced filtering, or specialized index types.

Which vector database is the best?

There's no single winner — it depends on scale and stack. For zero-ops managed search, Pinecone; for open-source with hybrid search, Weaviate or Qdrant; if you're already on Postgres, pgvector; for fast prototyping, Chroma or LanceDB; for billion-scale, Milvus; for vectors next to operational data, MongoDB Atlas or Redis. Cloud-committed teams often default to Azure AI Search or Vertex AI Vector Search.

Do I need a dedicated vector database, or can I just use Postgres?

For a large share of real applications, pgvector on Postgres is enough and keeps your stack to one database — fewer moving parts, one backup story, familiar tooling. Graduate to a dedicated vector database when you hit very large vector counts, need sophisticated metadata filtering, want specialized ANN index types, or need to decouple vector workloads from your transactional database.

How do vector databases relate to AI agents and frameworks?

They're complementary layers of the same stack. An agent that uses RAG retrieves relevant context from a vector database before answering; the agent framework (LangChain, LlamaIndex, and others) orchestrates the reasoning and tool calls, while the vector database stores and retrieves the embeddings. See the AI Agent Frameworks Index for the orchestration layer that sits on top of these stores.

Are vector databases free, and how current is this index?

Many are open-source: free to self-host, with managed tiers that bill on usage. The database is rarely the dominant cost — embedding generation and model API calls usually are. The ecosystem evolves quickly, so we describe each option's type, hosting, and license rather than version-specific benchmarks; a last-reviewed date is shown at the top, and you should verify each project's current docs before committing.

The Vector Database Index

Last reviewed June 13, 2026 · 12 databases · neutral & vendor-independent

The matrix

Database	Type	Hosting	License	Best for
Pinecone	Dedicated vector DB	Managed	Proprietary	Zero-ops, fully-managed production vector search
Weaviate	Dedicated vector DB	Both	Open-source	Open-source vector search with built-in hybrid search and modules
Qdrant	Dedicated vector DB	Both	Open-source	High-performance, Rust-based search with heavy metadata filtering
Chroma	Embedded	Both	Open-source	Lightweight, developer-friendly prototyping and small RAG apps
Milvus	Dedicated vector DB	Both (Zilliz managed)	Open-source	Billion-scale vector search
pgvector	Database extension	Both (any Postgres)	Open-source	Adding vectors to an existing Postgres stack without a new database
Redis	Document/cache DB + vectors	Both	Source-available	Low-latency vector search alongside caching/session data
Elasticsearch / OpenSearch	Search engine + vectors	Both	Open-source (OpenSearch) / mixed	Combining full-text and vector (hybrid) search at scale
MongoDB Atlas Vector Search	Document/cache DB + vectors	Managed	Proprietary (Atlas)	Vectors stored next to your operational documents
Azure AI Search	Search engine + vectors	Managed	Proprietary	Vector and hybrid search inside the Azure ecosystem
Vertex AI Vector Search	Dedicated vector DB	Managed	Proprietary	Google Cloud-scale ANN search (formerly Matching Engine)
LanceDB	Embedded	Both	Open-source	Embedded, multimodal, on-disk vector storage

Store type, hosting, and license — not version-specific benchmarks, which go stale fast and depend on your data and query shape. Benchmark the shortlist on your own workload before committing.

How to read the types

Dedicated vector DB

Purpose-built for storing embeddings and running approximate-nearest-neighbour (ANN) search — the most features for pure vector workloads.

Database extension

Adds vector columns and indexes to a general database you already run (e.g. Postgres) — fewer moving parts.

Search engine + vectors

A full-text search engine that also does vector and hybrid (keyword + semantic) search at scale.

Embedded

Runs in-process / on-disk inside your application — no separate server to operate.

Document/cache DB + vectors

An operational database (document or in-memory) that adds vector search next to your live data.

Which one should you pick?

You want zero-ops, fully managed vector searchPinecone

You already run Postgres and want one databasepgvector

You want open-source with hybrid searchWeaviate or Qdrant

You're prototyping a RAG app fastChroma or LanceDB

You need billion-scale ANNMilvus

You want vectors next to operational dataMongoDB Atlas or Redis

You need full-text + vector at scaleElasticsearch / OpenSearch

You're committed to Azure or Google CloudAzure AI Search / Vertex AI Vector Search

FAQs

What is a vector database and do I actually need one?

Which vector database is the best?

Do I need a dedicated vector database, or can I just use Postgres?

How do vector databases relate to AI agents and frameworks?

Are vector databases free, and how current is this index?

The matrix

How to read the types

Which one should you pick?

FAQs

Field notes from the automation frontier

The matrix

How to read the types

Which one should you pick?

FAQs

Field notes from the automation frontier