228+ Distributed Databases Ranked & Compared

Compare distributed databases ranked by GitHub stars, fault tolerance, and global scalability.

Last updated: May 29, 2026
206 databases
1Elasticsearch
Elasticsearch
76.7k+152 30d

Distributed search and analytics engine built on Apache Lucene for full-text search, observability, and security

Search·2010·Elastic-2.0·Java
2etcd
etcd
51.7k+77 30d

Distributed reliable key-value store for the most critical data of a distributed system

Key-Value·2013·Apache-2.0·Go
3ClickHouse
ClickHouse
47.7k+551 30d

Blazing-fast open-source column-oriented database for real-time analytics and OLAP

Analytics·2016·Apache-2.0·C++
4Milvus
Milvus
44.5k+469 30d

High-performance cloud-native vector database built for scalable similarity search and AI applications

Vector·2019·Apache-2.0·Go, C++
5Apache Spark SQL
Apache Spark SQL
43.4k+149 30d

Distributed SQL query engine within Apache Spark for structured data processing at scale

Analytics·2014·Apache-2.0·Scala, Java, Python, R
6TiDB
TiDB
40.1k+73 30d

MySQL-compatible distributed SQL database for hybrid transactional and analytical workloads

Relational·2015·Apache-2.0·Go
7SurrealDB
SurrealDB
32.3k+291 30d

Multi-model database combining documents, graphs, vectors, and time-series with built-in API layer and real-time capabilities

Multi-Model·2022·BSL-1.1·Rust
8CockroachDB
CockroachDB
32.2k+63 30d

Distributed SQL database built for cloud-native global applications with serializable isolation

Relational·2015·BSL-1.1·Go, C++
9Qdrant
Qdrant
31.6k+804 30d

High-performance open-source vector database for next-generation AI applications

Vector·2021·Apache-2.0·Rust
10InfluxDB
InfluxDB
31.5k+41 30d

Scalable time-series database built in Rust for metrics, events, and real-time analytics

Time-Series·2013·Apache-2.0·Rust
11MongoDB
MongoDB
28.3k+62 30d

The most popular document database for modern applications

Document·2009·SSPL·C++, JavaScript, Python
12RethinkDB
RethinkDB
27.0k+4 30d

Open-source document database designed for real-time push updates to applications

Document·2012·Apache-2.0·C++
13Apache Flink
Apache Flink
26.0k+67 30d

Stateful stream processing framework for real-time and batch data at any scale

Streaming·2011·Apache-2.0·Java, Scala
14Valkey
Valkey
25.9k+319 30d

Open-source high-performance key-value database forked from Redis, backed by the Linux Foundation

Key-Value·2024·BSD-3-Clause·C
15TDengine
TDengine
24.9k+37 30d

High-performance open-source time-series database designed for Industrial IoT and real-time analytics

Time-Series·2019·AGPL-3.0·C
16Neon
Neon
22.1k+420 30d

Serverless PostgreSQL with separated storage and compute, branching, and scale-to-zero

Relational·2022·Apache-2.0·Rust, C
17Dgraph
Dgraph
21.7k+2 30d

Distributed graph database with native GraphQL support built for horizontal scale

Graph·2017·Apache-2.0·Go
18Vitess
Vitess
21.0k+37 30d

Cloud-native database clustering system for horizontal scaling of MySQL through transparent sharding

Relational·2012·Apache-2.0·Go
19Apache ShardingSphere

Distributed SQL middleware providing sharding, encryption, and read-write splitting across any database

Relational·2016·Apache-2.0·Java
20rqlite
rqlite
17.5k+77 30d

Lightweight, fault-tolerant, distributed relational database built on SQLite and Raft consensus

Relational·2014·MIT·Go
21VictoriaMetrics
VictoriaMetrics
17.1k+182 30d

Fast, cost-effective time-series database and monitoring solution compatible with Prometheus

Time-Series·2018·Apache-2.0·Go
22Presto
Presto
16.7k+3 30d

Distributed SQL query engine for running interactive analytic queries against data sources of all sizes

Analytics·2012·Apache-2.0·Java, C++
23TiKV
TiKV
16.7k+41 30d

Distributed transactional key-value database providing ACID guarantees at scale

Key-Value·2016·Apache-2.0·Rust
24Neo4j
Neo4j
16.6k+197 30d

Native graph database with Cypher query language for connected data at scale

Graph·2007·GPL-3.0·Java, Scala
25FoundationDB
FoundationDB
16.4k+66 30d

Distributed, transactional key-value store with multi-model layers and strict serializability

Key-Value·2013·Apache-2.0·C++, Flow
26Weaviate
Weaviate
16.2k+160 30d

AI-native vector database with hybrid search and built-in model integration

Vector·2019·BSD-3-Clause·Go
27ScyllaDB
ScyllaDB
15.6k+53 30d

High-performance NoSQL wide-column database compatible with Apache Cassandra and Amazon DynamoDB

Wide-Column·2015·ScyllaDB Source Available License·C++
28Apache Doris
Apache Doris
15.4k+136 30d

High-performance real-time analytical database for sub-second queries on large-scale data

Analytics·2017·Apache-2.0·Java, C++
29Memcached
Memcached
14.2k+17 30d

High-performance distributed memory caching system for speeding up dynamic web applications

Key-Value·2003·BSD-3-Clause·C
30ArangoDB
ArangoDB
14.2k+20 30d

Multi-model database unifying document, graph, and key-value in a single engine with AQL

Multi-Model·2012·BUSL-1.1·C++, JavaScript
31Thanos
Thanos
14.1k+44 30d

Highly available Prometheus setup with unlimited long-term storage on object storage

Time-Series·2017·Apache-2.0·Go
32Apache Druid
Apache Druid
14.0k+28 30d

High-performance real-time analytics database for sub-second OLAP queries at scale

Analytics·2012·Apache-2.0·Java
33OpenSearch
OpenSearch
13.0k+178 30d

Community-driven open-source search and analytics engine forked from Elasticsearch

Search·2021·Apache-2.0·Java
34Trino
Trino
12.9k+103 30d

Fast distributed SQL query engine for big data analytics across heterogeneous data sources

Analytics·2019·Apache-2.0·Java
35Citus
Citus
12.5k+63 30d

Distributed PostgreSQL as an extension for multi-tenant SaaS and real-time analytics at scale

Relational·2016·AGPL-3.0·C
36KeyDB
KeyDB
12.5k+14 30d

Multithreaded Redis fork with higher throughput and active replication

Key-Value·2019·BSD-3-Clause·C++, C
37NebulaGraph
NebulaGraph
12.2k+40 30d

Distributed graph database built for billion-scale graphs with millisecond latency

Graph·2019·Apache-2.0·C++
38Mnesia
Mnesia
12.1k+23 30d

Distributed real-time database management system built into Erlang/OTP for telecom-grade fault tolerance

Key-Value·1999·Apache-2.0·Erlang
39Manticore Search
Manticore Search
11.8k+45 30d

Fast open-source search database with SQL and JSON interfaces

Search·2017·GPL-3.0·C++
40Convex
Convex
11.7k+309 30d

Reactive backend database with real-time sync, TypeScript-native queries, and built-in serverless functions

Document·2022·BUSL-1.1·Rust, TypeScript
41StarRocks
StarRocks
11.7k+101 30d

High-performance MPP analytics engine for real-time and batch data warehousing

Analytics·2021·Apache-2.0·Java, C++
42Quickwit
Quickwit
11.2k+132 30d

Cloud-native search engine for observability, built on object storage with sub-second latency

Search·2021·Apache-2.0·Rust
43YugabyteDB
YugabyteDB
10.3k+66 30d

PostgreSQL-compatible distributed SQL database with high resilience and geo-distribution

Relational·2017·Apache-2.0·C++, Java
44OceanBase
OceanBase
10.1k+51 30d

Distributed relational database for high-performance transactional, analytical, and AI workloads at scale

Relational·2010·Mulan PubL v2·C++
45Cassandra
Cassandra
9.7k+27 30d

Distributed wide-column database designed for high availability and linear scalability across data centers

Wide-Column·2008·Apache-2.0·Java
46Databend
Databend
9.3k+30 30d

Cloud-native data warehouse built in Rust for analytics, search, and AI on object storage

Analytics·2021·Apache-2.0·Rust
47Deep Lake
Deep Lake
9.1k+37 30d

GPU-native vector and multimodal data lake for AI agents with deep learning integrations

Vector·2020·Apache-2.0·Python, C++
48RisingWave
RisingWave
9.0k+94 30d

Postgres-compatible streaming database for real-time event processing and analytics

Streaming·2022·Apache-2.0·Rust
49Vespa
Vespa
6.9k+30 30d

Open-source big data serving engine combining search, recommendation, and real-time AI at scale

Search·2017·Apache-2.0·Java, C++
50Apache CouchDB
Apache CouchDB
6.9k+21 30d

Seamless multi-master sync with an intuitive HTTP/JSON API

Document·2005·Apache-2.0·Erlang, JavaScript, C
51Hazelcast
Hazelcast
6.6k−4 30d

Unified real-time data platform combining in-memory data grid with stream processing

Key-Value·2008·Apache-2.0·Java
52Apache IoTDB
Apache IoTDB
6.3k+13 30d

High-performance time-series database for IoT data with lightweight architecture and high compression

Time-Series·2019·Apache-2.0·Java
53GreptimeDB
GreptimeDB
6.3k+92 30d

Open-source unified observability database for metrics, logs, and traces built in Rust

Time-Series·2022·Apache-2.0·Rust
54Apache Pinot
Apache Pinot
6.1k+17 30d

Real-time distributed OLAP datastore for ultra low-latency analytics at high throughput

Analytics·2015·Apache-2.0·Java
55Apache Hive
Apache Hive
6.0k−4 30d

Data warehouse software for reading, writing, and managing large datasets in distributed storage using SQL

Analytics·2010·Apache-2.0·Java
56AliSQL
AliSQL
5.8k+6 30d

Alibaba's battle-tested MySQL branch with built-in DuckDB analytics and vector search

Relational·2016·GPL-2.0·C++, C
57Cortex
Cortex
5.8k+12 30d

Horizontally scalable, multi-tenant long-term storage for Prometheus metrics

Time-Series·2016·Apache-2.0·Go
58KurrentDB
KurrentDB
5.8k+20 30d

Event-native database for event sourcing and event-driven architectures with built-in streaming, formerly EventStoreDB

Streaming·2012·Kurrent License·C#
59JanusGraph
JanusGraph
5.8k+21 30d

Scalable open-source distributed graph database optimized for storing and querying billions of vertices and edges

Graph·2017·Apache-2.0·Java
60Apache HBase
Apache HBase
5.5k−9 30d

Distributed wide-column store for random real-time read/write access to big data

Wide-Column·2008·Apache-2.0·Java
61Apache Ignite
Apache Ignite
5.1k+3 30d

Distributed in-memory database with ACID transactions, SQL, and compute capabilities

Multi-Model·2015·Apache-2.0·Java, C++, C#
62OpenTSDB
OpenTSDB
5.1k+4 30d

Distributed, scalable time-series database built on top of HBase for monitoring at massive scale

Time-Series·2010·LGPL-2.1·Java
63Marqo
Marqo
5.0k0 30d

AI-native tensor search engine with built-in embedding generation for multimodal vector search

Vector·2022·Apache-2.0·Python
64OrientDB
OrientDB
5.0k+9 30d

Multi-model database combining graph, document, key-value, and object models with SQL support and ACID transactions

Multi-Model·2010·Apache-2.0·Java
65M3DB
M3DB
4.9k−6 30d

Distributed time-series database built by Uber for large-scale metrics with Prometheus and Graphite compatibility

Time-Series·2018·Apache-2.0·Go
66YDB
YDB
4.7k+8 30d

Open-source distributed SQL database combining high availability, scalability, strong consistency, and ACID transactions

Relational·2022·Apache-2.0·C++
67CrateDB
CrateDB
4.4k+3 30d

Distributed SQL database for real-time analytics on massive datasets with PostgreSQL compatibility

Multi-Model·2014·Apache-2.0·Java
68TypeDB
TypeDB
4.3k+33 30d

Polymorphic database with a conceptual data model, strong type system, and symbolic reasoning engine

Graph·2016·MPL-2.0·Rust
69dqlite
dqlite
4.3k+19 30d

Lightweight distributed SQLite with Raft consensus for fault-tolerant edge and IoT deployments

Relational·2017·LGPL-3.0·C
70Kvrocks
Kvrocks
4.3k+23 30d

Distributed Redis-compatible key-value NoSQL database built on RocksDB for cost-effective persistent storage

Key-Value·2019·Apache-2.0·C++
71RavenDB
RavenDB
3.9k+7 30d

ACID document database with integrated full-text search, time series, and distributed counters

Document·2010·AGPL-3.0 / Commercial·C#
72Apache Kylin
Apache Kylin
3.8k0 30d

Distributed OLAP engine with sub-second query performance via pre-calculated cubes on Hadoop

Analytics·2015·Apache-2.0·Java
73Netflix Atlas
Netflix Atlas
3.6k0 30d

In-memory dimensional time-series database built for operational metrics at Netflix scale

Time-Series·2014·Apache-2.0·Scala, Java
74Olric
Olric
3.5k+16 30d

Distributed, in-memory key/value store and cache with Redis-compatible protocol support

Key-Value·2018·Apache-2.0·Go
75Roshi
Roshi
3.2k+2 30d

Large-scale CRDT set implementation for timestamped events backed by Redis

Time-Series·2014·BSD-2-Clause·Go
76PolarDB for PostgreSQL

Alibaba's cloud-native PostgreSQL with shared-storage architecture and elastic scaling

Relational·2021·Apache-2.0·C
77Apache HugeGraph
Apache HugeGraph
3.1k+26 30d

High-performance graph database supporting hundreds of billions of vertices and edges

Graph·2017·Apache-2.0·Java
78LinDB
LinDB
3.1k+2 30d

Scalable, high-performance distributed time-series database with multi-IDC replication

Time-Series·2019·Apache-2.0·Go
79Apache HoraeDB
Apache HoraeDB
2.8k−3 30d

High-performance distributed cloud-native time-series database for analytics and time-series workloads

Time-Series·2022·Apache-2.0·Rust
80GridDB
GridDB
2.5k−3 30d

IoT-optimized time-series database with hybrid in-memory and disk storage from Toshiba

Time-Series·2013·AGPL-3.0·C++, Java
81Apache Geode
Apache Geode
2.4k+7 30d

In-memory data grid providing real-time, consistent access to data-intensive applications at massive scale

Key-Value·2015·Apache-2.0·Java
82Apache Sedona
Apache Sedona
2.3k+10 30d

Cluster computing framework for large-scale geospatial data processing on Spark, Flink, and Snowflake

Analytics·2017·Apache-2.0·Scala, Java, Python, Rust
83Graph Engine
Graph Engine
2.3k0 30d

Distributed in-memory graph processing engine with strongly-typed key-value store, formerly Trinity

Graph·2015·MIT·C#, C++
84YTsaurus
YTsaurus
2.2k+33 30d

Exabyte-scale distributed storage and processing platform for big data from Yandex

Multi-Model·2023·Apache-2.0·C++
85Apache Drill
Apache Drill
2.0k+1 30d

Schema-free SQL query engine for Hadoop, NoSQL, and cloud storage with dynamic schema discovery

Analytics·2015·Apache-2.0·Java
86VictoriaLogs
VictoriaLogs
1.9k+96 30d

Fast and easy-to-use open-source log management database by VictoriaMetrics

Search·2023·Apache-2.0·Go
87ActorDB
ActorDB
1.9k−2 30d

Distributed SQL database using the actor model with Raft consensus on SQLite

Relational·2014·MPL-2.0·Erlang, C
88MatrixOne
MatrixOne
1.8k+8 30d

Cloud-native HTAP database with MySQL compatibility, Git-style data versioning, and AI-native capabilities

Relational·2021·Apache-2.0·Go
89KairosDB
KairosDB
1.8k+2 30d

Fast distributed scalable time-series database built on top of Apache Cassandra

Time-Series·2013·Apache-2.0·Java
90CnosDB
CnosDB
1.8k+4 30d

Cloud-native open-source distributed time-series database with high performance and compression

Time-Series·2022·AGPL-3.0·Rust
91Elassandra
Elassandra
1.7k−1 30d

Apache Cassandra distribution with tightly integrated Elasticsearch for combined NoSQL storage and search

Wide-Column·2015·Apache-2.0·Java
92Vald
Vald
1.7k+2 30d

Highly scalable distributed vector search engine built on Cloud-Native architecture with NGT

Vector·2019·Apache-2.0·Go
93OpenMLDB
OpenMLDB
1.7k+1 30d

Open-source machine learning database providing consistent feature engineering for training and inference

Time-Series·2021·Apache-2.0·C++, Java, Python
94PolarDB-X
PolarDB-X
1.7k+3 30d

Cloud-native distributed SQL database for high concurrency and massive storage with MySQL compatibility

Relational·2020·Apache-2.0·Java
95Apache Solr
Apache Solr
1.6k+8 30d

Blazing-fast, open-source multi-modal search platform built on Apache Lucene

Search·2004·Apache-2.0·Java
96CovenantSQL
CovenantSQL
1.5k0 30d

Decentralized SQL database built on blockchain with immutable query history and GDPR compliance

Relational·2018·Apache-2.0·Go
97Comdb2
Comdb2
1.5k+8 30d

Bloomberg's clustered RDBMS built on optimistic concurrency with high availability SQL

Relational·2004·Apache-2.0·C
98GeoMesa
GeoMesa
1.5k+2 30d

Distributed spatio-temporal indexing on top of Accumulo, HBase, Cassandra, and Kafka

Multi-Model·2014·Apache-2.0·Scala, Java
99Aerospike
Aerospike
1.3k+6 30d

Flash-optimized distributed NoSQL database for real-time applications at massive scale

Key-Value·2012·AGPL-3.0·C
100Infinispan
Infinispan
1.3k+7 30d

Open-source distributed in-memory data grid with multi-protocol access and cross-site replication

Key-Value·2009·Apache-2.0·Java
101Apache Impala
Apache Impala
1.3k+6 30d

Native analytic SQL engine for Apache Hadoop and open data formats with low-latency queries

Analytics·2012·Apache-2.0·C++, Java
102Apache Cloudberry

Advanced open-source MPP analytics database forked from Greenplum with a modern PostgreSQL kernel

Analytics·2023·Apache-2.0·C, C++
103openGemini
openGemini
1.2k+6 30d

Cloud-native distributed time-series database by Huawei for IoT and observability at massive scale

Time-Series·2022·Apache-2.0·Go
104Apache Accumulo
Apache Accumulo
1.1k+1 30d

Sorted, distributed key-value store built on Hadoop with cell-level security

Wide-Column·2011·Apache-2.0·Java
105Apache Phoenix
Apache Phoenix
1.1k+2 30d

Massively parallel SQL engine on top of Apache HBase for low-latency OLTP queries

Relational·2014·Apache-2.0·Java
106MyScale
MyScale
1.0k0 30d

SQL vector database built on ClickHouse for high-performance AI applications with filtered search

Vector·2023·Apache-2.0·C++
107ArcadeDB
ArcadeDB
901+71 30d

Multi-model database supporting graphs, documents, key-value, vectors, time-series, and search in one engine

Multi-Model·2021·Apache-2.0·Java
108openGauss
openGauss
777+3 30d

Huawei's open-source PostgreSQL-derived enterprise RDBMS optimized for ARM and high concurrency

Relational·2020·Mulan PSL v2·C, C++
109NCache
NCache
660+1 30d

Open-source distributed in-memory cache for .NET and Java with pub/sub messaging

Key-Value·2005·Apache-2.0·C#, .NET
110SiriDB
SiriDB
5120 30d

Highly scalable and super fast open-source time-series database with dynamic grouping

Time-Series·2017·MIT·C
111Oracle Coherence

In-memory data grid with fault-tolerant caching, transactions, and event processing for enterprise Java applications

Key-Value·2001·UPL-1.0·Java
112Warp 10
Warp 10
415+1 30d

Advanced open-source time-series platform with native geo-temporal support and WarpScript analytics

Time-Series·2015·Apache-2.0·Java
113Fluree
Fluree
385+10 30d

Immutable, ledger-backed semantic graph database with native RDF and JSON-LD support

Graph·2016·EPL-2.0·Clojure
114OpenTenBase
OpenTenBase
385+133 30d

Enterprise-level distributed HTAP database based on PostgreSQL for hybrid transactional and analytical workloads

Relational·2019·BSD-3-Clause·C
115Gnocchi
Gnocchi
3210 30d

Scalable time-series database with pre-computed aggregations for cloud metrics and resource indexing

Time-Series·2017·Apache-2.0·Python
116Percona Server for MongoDB

Enhanced open-source MongoDB drop-in replacement with enterprise-grade security and backup features

Document·2016·SSPL·C++, JavaScript
117Riak KV
Riak KV
44+2 30d

Distributed NoSQL key-value database with masterless architecture for high availability and fault tolerance

Key-Value·2009·Apache-2.0·Erlang
1181010data

Cloud-based columnar analytics platform for massive-scale data discovery and ad hoc analysis

Analytics·2000·proprietary·K
119Actian NoSQL Database

Object-oriented database for complex data models with native language integration

Document·1988·proprietary·C, C++, Java
120Algolia

AI-powered search and discovery API delivering sub-millisecond results with typo tolerance and real-time indexing

Search·2012·proprietary·C++
121Alibaba Cloud AnalyticDB for MySQL

Cloud-native real-time data warehouse with MySQL compatibility for petabyte-scale analytics

Analytics·2017·proprietary·C++
122Alibaba Cloud AnalyticDB for PostgreSQL

MPP cloud data warehouse with PostgreSQL compatibility and vector search capabilities

Analytics·2016·proprietary·C, C++
123Alibaba Cloud Log Service

Cloud-native observability platform for PB-scale log collection, analysis, and visualization

Analytics·2016·proprietary
124Alibaba Cloud MaxCompute

Fully managed petabyte-scale data warehouse with serverless SQL, MapReduce, and graph computation

Analytics·2010·proprietary
125Alibaba Cloud PolarDB

Cloud-native relational database with MySQL, PostgreSQL, and Oracle compatibility and HTAP capabilities

Relational·2018·proprietary·C, C++
126Alibaba Cloud Table Store

Serverless NoSQL wide-column and time-series storage with auto-scaling to 10 PB

Wide-Column·2016·proprietary
127AllegroGraph

Neuro-symbolic AI platform combining RDF knowledge graphs, vector store, and SPARQL in a transactional graph database

Graph·2004·Commercial (free edition available)·Common Lisp, C
128Amazon Aurora

MySQL and PostgreSQL-compatible relational database with up to 5x throughput and 99.999% availability

Relational·2014·proprietary
129Amazon CloudSearch

Managed search service with auto-scaling, faceted search, and support for 34 languages

Search·2012·proprietary
130Amazon DocumentDB

Fully managed MongoDB-compatible document database with fast performance and up to 10 global regions

Document·2019·proprietary
131Amazon DynamoDB

Serverless, fully managed NoSQL key-value and document database with single-digit millisecond performance at any scale

Key-Value·2012·proprietary
132Amazon Keyspaces

Serverless, fully managed Apache Cassandra-compatible database service on AWS

Wide-Column·2020·proprietary
133Amazon Neptune

Fully managed graph database service supporting Gremlin, openCypher, and SPARQL

Graph·2018·proprietary
134Amazon Redshift

Petabyte-scale cloud data warehouse with columnar storage and massively parallel processing

Analytics·2013·proprietary
135Amazon Timestream

Serverless time-series database for IoT and operational applications with built-in analytics

Time-Series·2020·proprietary
136AnzoGraph DB

Massively parallel graph OLAP database for W3C standards-based analytics at scale

Graph·2018·proprietary·C++
137Axibase Time Series Database

Special-purpose time-series database for IT infrastructure, industrial equipment, and financial market data

Time-Series·2004·proprietary·Java
138Azure Cosmos DB

Globally distributed, multi-model database service with turnkey multi-region replication and single-digit millisecond latency

Multi-Model·2017·proprietary·C++, C#
139Cloudflare Workers KV

Global edge key-value store with low-latency reads across 330+ locations

Key-Value·2018·proprietary
140CloudKit

Apple's cloud database service for seamless data sync across all Apple platforms

Document·2014·proprietary
141Couchbase

Multi-model NoSQL database for enterprise applications with SQL++ support

Multi-Model·2011·BSL 1.1 / Apache-2.0 (Community)·C++, Go, Erlang, C
142Coveo

AI-powered enterprise search and relevance platform with machine learning recommendations

Search·2005·proprietary
143Databricks

Unified data lakehouse platform combining the best of data warehouses and data lakes with Delta Lake

Analytics·2013·proprietary
144DataStax Enterprise

Enterprise-grade distributed database built on Apache Cassandra with integrated analytics, search, and graph

Wide-Column·2010·Commercial·Java
145DolphinDB

High-performance time-series database with built-in analytics for finance and IoT

Time-Series·2018·Proprietary·C++
146Exasol

High-performance in-memory MPP analytics database delivering up to 1000x faster analytical queries

Analytics·2000·Commercial·C++
147Firebolt

Sub-second analytics cloud data warehouse built for high-concurrency, data-intensive applications

Analytics·2021·proprietary·C++
148Firestore

Serverless, fully managed NoSQL document database with real-time sync and offline support for mobile and web apps

Document·2017·proprietary
149Galaxybase

High-performance native distributed graph database for HTAP workloads at trillion-edge scale

Graph·2018·proprietary·Java
150GBase

Chinese enterprise database platform with leading analytical and transactional database products

Analytics·2004·proprietary·C, C++
151GemStone/S

Smalltalk-based object database for scalable, transactional multi-tier business applications

Multi-Model·1986·Proprietary·Smalltalk, C
152GigaSpaces XAP

In-memory computing platform for real-time analytics and extreme transaction processing

Key-Value·2000·Proprietary·Java
153Google BigQuery

Serverless, highly scalable multi-cloud data warehouse with built-in ML and real-time analytics

Analytics·2010·proprietary
154Google Cloud Bigtable

Fully managed, low-latency wide-column NoSQL database for massive analytical and operational workloads

Wide-Column·2015·Proprietary·C++
155Google Cloud Datastore

Highly scalable NoSQL document database with automatic sharding and ACID transactions

Document·2013·Proprietary
156Google Cloud Spanner

Globally distributed, strongly consistent relational database with unlimited scale and 99.999% availability

Relational·2017·Proprietary·C++
157GraphBase

Enterprise graph database for knowledge graphs and Graph RAG with distributed cloud-native architecture

Graph·proprietary
158Greenplum

Massively parallel processing analytics database built on PostgreSQL for large-scale data warehousing

Analytics·2005·proprietary·C, C++, Python
159GridGain

In-memory computing platform built on Apache Ignite for real-time transactions and analytics

Multi-Model·2007·proprietary·Java, C++, C#
160HPE Ezmeral Data Fabric

Converged data platform with integrated NoSQL database, file system, and event streams for hybrid cloud

Multi-Model·2009·proprietary·C++, Java
161IBM Cloudant

Fully managed CouchDB-compatible JSON document database with global distribution and serverless scaling

Document·2010·proprietary·Erlang, JavaScript, C
162IBM Db2

Enterprise-grade relational database with AI-powered optimization and hybrid cloud deployment

Relational·1983·proprietary·C, C++, Assembly
163InfiniteGraph

Distributed graph database for large-scale relationship analytics and deep link analysis

Graph·2010·Commercial·Java, C++
164Kinetica

GPU-accelerated real-time analytics database for spatial, temporal, graph, and AI workloads at scale

Analytics·2016·proprietary·C++
165Kyligence Enterprise

AI-augmented OLAP analytics platform delivering sub-second queries on petabyte-scale data, built on Apache Kylin

Analytics·2016·proprietary·Java
166LeanXcale

Ultra-scalable distributed SQL database with full ACID compliance and NoSQL-speed ingestion

Relational·2015·proprietary·Java
167MarkLogic

Enterprise multi-model database combining documents, graph, and search with government-grade security

Multi-Model·2001·proprietary·C++
168Microsoft Azure AI Search

Enterprise cloud search service with vector search, semantic ranking, and AI-powered agentic retrieval

Search·2014·proprietary
169Microsoft Azure Data Explorer

Fast and scalable data analytics service for real-time analysis of streaming and time-series data using Kusto Query Language

Analytics·2019·proprietary
170Microsoft Azure SQL Database

Fully managed cloud relational database built on the Microsoft SQL Server engine with intelligent performance

Relational·2010·proprietary
171Microsoft Azure Synapse Analytics

Integrated analytics platform combining data warehousing, big data, and data integration with serverless and provisioned options

Analytics·2019·proprietary
172Microsoft Azure Table Storage

Schemaless NoSQL key-value store for massive volumes of semi-structured data with strong consistency

Key-Value·2009·proprietary
173Microsoft Fabric

Unified analytics platform converging data warehousing, engineering, science, and real-time intelligence on a single SaaS foundation

Analytics·2023·proprietary
174Netezza

Purpose-built analytics appliance for high-performance data warehousing and advanced analytics

Analytics·2002·Proprietary·C, C++
175NonStop SQL

Fault-tolerant relational database for mission-critical OLTP on HPE NonStop systems

Relational·1986·proprietary·C, C++
176NuoDB

Cloud-native distributed SQL database with ACID compliance and elastic scale-out

Relational·2010·proprietary·C++
177Ontotext GraphDB

Enterprise RDF triplestore with real-time semantic inferencing at billion-statement scale

Graph·2000·proprietary·Java
178Oracle

Enterprise-grade multi-model database with AI-native capabilities

Relational·1979·Oracle Commercial License·C, C++
179Oracle NoSQL Database

Distributed NoSQL database providing key-value, table, and document data models with ACID transactions

Key-Value·2011·Apache-2.0 (Community Edition)·Java
180OushuDB

Cloud-native MPP data warehouse built on Apache HAWQ for petabyte-scale interactive analytics

Analytics·2016·proprietary·C, C++
181PieCloudDB

Cloud-native virtual data warehouse with elastic massive parallel processing (eMPP) architecture

Analytics·2022·proprietary·C++
182Pinecone

Fully managed vector database built for high-performance AI applications at scale

Vector·2021·proprietary
183PlanetScale

Serverless MySQL-compatible cloud database platform powered by Vitess with branching and deploy requests

Relational·2021·proprietary·Go
184QuasarDB

High-performance distributed column-oriented time-series database with native transactional support

Time-Series·2009·proprietary·C++
185Rizhiyi

Enterprise log analytics platform with proprietary Beaver search engine for PB-level log management

Search·2014·proprietary·C++
186SAP HANA

In-memory relational database for real-time analytics and transactional processing in enterprise environments

Relational·2010·proprietary·C++
187ScaleOut StateServer

In-memory data grid with distributed caching, parallel query, and high availability for .NET and Java

Key-Value·2005·Commercial·C++, C#
188SciDB

Array database for multidimensional data management and complex analytics in scientific computing

Analytics·2010·AGPL-3.0·C++
189SingleStore

Distributed SQL database for data-intensive applications combining transactions, analytics, and AI workloads

Relational·2013·SingleStore Commercial License·C++
190Snowflake

Cloud-native data warehouse with automatic scaling, separation of storage and compute, and near-zero administration

Analytics·2014·proprietary
191SpaceTime

Spatiotemporal relational database optimized for analytical workloads on moving objects with JIT compilation

Analytics·2020·proprietary·C++
192Splunk

Platform for searching, monitoring, and analyzing machine-generated data via a web-style interface

Search·2003·proprietary·C++, Python
193Stardog

Enterprise knowledge graph platform built on RDF standards for data unification and AI-powered insights

Graph·2012·proprietary·Java
194TDSQL for MySQL

Tencent's distributed MySQL-compatible database with strong consistency and horizontal scaling

Relational·2012·proprietary·C, C++
195Teradata

Enterprise-grade parallel data warehouse for large-scale analytics and business intelligence

Analytics·1984·proprietary·C, C++
196TigerGraph

High-performance native graph analytics platform for AI, fraud detection, and real-time insights on connected data

Graph·2017·proprietary·C++
197Transwarp ArgoDB

Distributed analytical database replacing Hadoop+MPP with unified SQL analytics and real-time data processing

Analytics·proprietary
198Transwarp Hippo

Enterprise cloud-native distributed vector database with GPU acceleration and multi-model support

Vector·proprietary
199Transwarp KunDB

Financial-grade distributed relational database with strong consistency and Oracle/MySQL compatibility

Relational·proprietary
200Transwarp StellarDB

Enterprise distributed graph database with native graph storage and deep link analysis at PB scale

Graph·proprietary
201turbopuffer

Serverless vector and full-text search database built on object storage for low-cost high-scale workloads

Vector·2024·Commercial·Rust
202Ultipa

Ultra-high-performance 4th-generation graph database with deep traversal and GQL compliance

Graph·2019·proprietary·C++
203Vertica

High-performance columnar analytics database for petabyte-scale real-time analytics and machine learning

Analytics·2005·Proprietary·C++
204VMware Tanzu GemFire

Enterprise-grade distributed in-memory data grid for sub-millisecond, low-latency applications

Key-Value·2002·proprietary·Java
205XtremeData

Cloud-scale parallel SQL analytics engine with vectorized execution for complex analytical workloads

Analytics·2005·proprietary·C++
206Yellowbrick

Modern enterprise cloud data warehouse built on Kubernetes for extreme speed and concurrency

Analytics·2014·proprietary·C++

What is a Distributed Database?

A distributed database spreads data across multiple machines (nodes) that work together as a single logical system. Data is partitioned (sharded) and replicated across nodes to provide horizontal scalability, fault tolerance, and geographic distribution. When one node fails, others continue serving requests. Distributed databases solve the fundamental limitation of single-server databases: they scale beyond what one machine can handle. They span multiple data models — distributed SQL (CockroachDB, TiDB, YugabyteDB), distributed NoSQL (Cassandra, ScyllaDB), distributed key-value (etcd, FoundationDB), and distributed document stores (MongoDB with sharding).

When to Use a Distributed Database

Use a distributed database when a single server can't meet your requirements for storage capacity, write throughput, read scalability, or availability. Specific scenarios include: applications serving users across multiple regions with low-latency requirements, workloads needing 99.999% uptime, datasets too large for a single machine, and write-heavy systems that exceed single-node throughput. Distributed SQL databases (CockroachDB, TiDB) offer familiar SQL with automatic sharding. Distributed NoSQL (Cassandra) offers extreme write scalability. Consider single-node databases when your data fits on one machine — they're simpler to operate, debug, and reason about.

Frequently Asked Questions

What is the difference between a distributed database and a regular database?
A regular (centralized) database runs on a single server — all data lives on one machine. A distributed database runs across multiple servers, with data partitioned and replicated across them. Distributed databases offer horizontal scalability (add more nodes to handle more load), fault tolerance (survive node failures), and geographic distribution (serve users from nearby nodes). The tradeoff is operational complexity — distributed systems are harder to deploy, monitor, debug, and reason about than single-node databases.
What is the CAP theorem?
The CAP theorem states that a distributed system can provide at most two of three guarantees simultaneously: Consistency (every read returns the latest write), Availability (every request gets a response), and Partition tolerance (the system works despite network failures). Since network partitions are inevitable, the real choice is between consistency and availability during failures. CockroachDB and YugabyteDB choose consistency (CP). Cassandra and DynamoDB choose availability (AP). In practice, modern databases offer tunable consistency — you can adjust the tradeoff per query.
What is the difference between CockroachDB and TiDB?
Both are distributed SQL databases but with different compatibility targets. CockroachDB is PostgreSQL wire-compatible — existing Postgres drivers and many Postgres queries work unchanged. TiDB is MySQL wire-compatible — existing MySQL applications can migrate with minimal changes. CockroachDB uses a Raft-based consensus protocol and is written in Go. TiDB separates compute (TiDB) from storage (TiKV, built on RocksDB). Both support horizontal scaling and ACID transactions across distributed nodes.
Is MongoDB a distributed database?
MongoDB supports distribution through built-in sharding — you can partition data across multiple servers based on a shard key. However, MongoDB is not distributed by default. A single-node MongoDB deployment is a standard centralized database. You enable distribution by configuring a sharded cluster with config servers, shard nodes, and mongos routers. Purpose-built distributed databases like CockroachDB, TiDB, and Cassandra are distributed from the ground up and handle sharding automatically without manual configuration.
Do I need a distributed database?
Most applications don't. A well-tuned single-node PostgreSQL or MySQL instance handles millions of rows and thousands of concurrent connections. You need a distributed database when: your dataset exceeds single-server storage, your write throughput exceeds single-server capacity, you need multi-region low-latency access, or you require 99.999%+ uptime. If your data fits on one machine and you're not serving globally, a single-node database with read replicas is simpler and often faster than a distributed system.

Manage Distributed Databases Visually

1bench is a modern GUI client that supports all major distributed databases and many more.

Get Started