Version: v24.1

Deployment Patterns

This guide covers different Dgraph deployment patterns, from simple development setups to production-grade highly available clusters.

Pattern Selection Guide

Pattern	Use Case	Nodes	HA	Sharding
Basic	Dev/Test environments, non critical production	1 Zero, 1 Alpha	❌	❌
HA	Production, <1TB	3 Zeros, 3 Alphas	✅	❌
Distributed	Dev/Test environments for large dataset	1 Zero, 2+ Alphas	❌	✅
Distributed HA	Large production, >10TB	3 Zeros, 6+ Alphas	✅	✅

Getting Started? For first-time users and local development, see the Learning Environment guide, which covers Docker standalone and Docker Compose setups with Ratel UI.

Basic cluster

Best for: Development teams, staging environments, CI/CD

Architecture

┌──────────────┐
│  Dgraph Zero │ :5080, :6080
└──────┬───────┘
       │
┌──────▼───────┐
│ Dgraph Alpha │ :7080, :8080, :9080
└──────────────┘

Refer to Basic Cluster instructions.

HA cluster

Best for: Production workloads up to 10TB

Architecture

Zero Cluster (3 nodes) - Raft Group 0
    ├─ Zero 1 :5080 (Leader)
    ├─ Zero 2 :5080 (Follower)
    └─ Zero 3 :5080 (Follower)
         │
Alpha Group 1 (3 replicas) - Raft Group 1
    ├─ Alpha 1 :7080, :8080, :9080 (Leader)
    ├─ Alpha 2 :7080, :8080, :9080 (Follower)
    └─ Alpha 3 :7080, :8080, :9080 (Follower)

Setup Steps

1. Start Zero Cluster:

# Zero 1 (on host1) - First Zero initializes the cluster
dgraph zero --my=host1:5080 --raft "idx=1" --replicas=3
# Zero 2 (on host2) - Uses --peer to join existing cluster
dgraph zero --my=host2:5080 --raft "idx=2" --peer=host1:5080
# Zero 3 (on host3) - Uses --peer to join existing cluster
dgraph zero --my=host3:5080 --raft "idx=3" --peer=host1:5080

Important Notes:

Raft IDs: Each Zero node must have a unique Raft ID set via --raft "idx=N". Dgraph does not auto-assign Raft IDs to Zero nodes.
Cluster Initialization: The first Zero node starts the cluster. All subsequent Zero nodes must use --peer=<first-zero-address> to join the existing cluster. If --peer is omitted, a new independent cluster will be created.
Replication: The --replicas=3 flag on Zero controls how many Alpha replicas will be in each Alpha group. 2. Start Alpha Cluster:

# Alpha 1 (on host1)
dgraph alpha --my=host1:7080 --zero=host1:5080,host2:5080,host3:5080
# Alpha 2 (on host2)
dgraph alpha --my=host2:7080 --zero=host1:5080,host2:5080,host3:5080
# Alpha 3 (on host3)
dgraph alpha --my=host3:7080 --zero=host1:5080,host2:5080,host3:5080

Important Notes:

Zero Connection: Alphas can connect to any Zero in the cluster; list all Zeros for redundancy.
Group Assignment: Zero automatically assigns Alphas to groups based on the --replicas setting. With --replicas=3, the first 3 Alphas join Group 1.
Alpha Raft IDs: Unlike Zero nodes, Alpha nodes receive their Raft IDs automatically from Zero.

Kubernetes (Helm)

helm repo add dgraph https://charts.dgraph.io
helm install my-dgraph dgraph/dgraph \
  --set zero.replicaCount=3 \
  --set alpha.replicaCount=3

Characteristics:

Tolerates 1 node failure (in each group)
All data replicated 3x
No sharding (all predicates on all Alphas)
Suitable for datasets up to ~1TB Pros: High availability, automatic failover Cons: Storage scales vertically only

Distributed (Multi-Group) - Basic

Sharding, No HA

Best for: Development with large datasets (>10TB)

Architecture

┌──────────────┐
│  Dgraph Zero │ :5080
└──────┬───────┘
       │
       ├─ Group 1: Alpha 1 :7080
       ├─ Group 2: Alpha 2 :7081 (port offset)
       └─ Group 3: Alpha 3 :7082 (port offset)

Setup (Single Host with Port Offsets)

# Start Zero with replicas=1 (no replication)
dgraph zero --my=localhost:5080 --replicas=1

# Start Alpha nodes with port offsets
dgraph alpha --my=localhost:7080 --zero=localhost:5080 -p data/p1 -w data/w1
dgraph alpha --my=localhost:7081 --zero=localhost:5080 -p data/p2 -w data/w2 -o 1
dgraph alpha --my=localhost:7082 --zero=localhost:5080 -p data/p3 -w data/w3 -o 2

Characteristics:

3 Alpha groups (no replication within groups)
Data sharded by predicate across groups
Horizontal storage scaling
No fault tolerance

Pros: Horizontal scalability, handles large datasets Cons: No HA, any node failure loses data

Distributed - HA (Production Large-Scale)

Best for: Production workloads >10TB, high traffic, mission-critical

Architecture

Zero Cluster (3 nodes)
    └─ Replicates cluster metadata
         │
         ├─ Group 1: Alpha 1,2,3 (3 replicas)
         │   └─ Predicates: name, age, email
         │
         ├─ Group 2: Alpha 4,5,6 (3 replicas)
         │   └─ Predicates: friend, follows
         │
         └─ Group 3: Alpha 7,8,9 (3 replicas)
             └─ Predicates: location, company

Setup (9 Alpha Nodes across 3 Hosts)

Zeros (3 nodes):

# Host 1: Zero 1
dgraph zero --my=host1:5080 --raft "idx=1" --replicas=3
# Host 2: Zero 2
dgraph zero --my=host2:5080 --raft "idx=2" --peer=host1:5080
# Host 3: Zero 3
dgraph zero --my=host3:5080 --raft "idx=3" --peer=host1:5080

Alphas (3 groups × 3 replicas = 9 nodes):

# Host 1: Alphas 1, 4, 7
dgraph alpha --my=host1:7080 --zero=host1:5080,host2:5080,host3:5080 -p p1 -w w1
dgraph alpha --my=host1:7081 --zero=host1:5080,host2:5080,host3:5080 -p p4 -w w4 -o 1
dgraph alpha --my=host1:7082 --zero=host1:5080,host2:5080,host3:5080 -p p7 -w w7 -o 2
# Host 2: Alphas 2, 5, 8
dgraph alpha --my=host2:7080 --zero=host1:5080,host2:5080,host3:5080 -p p2 -w w2
dgraph alpha --my=host2:7081 --zero=host1:5080,host2:5080,host3:5080 -p p5 -w w5 -o 1
dgraph alpha --my=host2:7082 --zero=host1:5080,host2:5080,host3:5080 -p p8 -w w8 -o 2
# Host 3: Alphas 3, 6, 9
dgraph alpha --my=host3:7080 --zero=host1:5080,host2:5080,host3:5080 -p p3 -w w3
dgraph alpha --my=host3:7081 --zero=host1:5080,host2:5080,host3:5080 -p p6 -w w6 -o 1
dgraph alpha --my=host3:7082 --zero=host1:5080,host2:5080,host3:5080 -p p9 -w w9 -o 2

Group Assignment:

Zero automatically assigns Alphas 1,2,3 → Group 1
Zero assigns Alphas 4,5,6 → Group 2
Zero assigns Alphas 7,8,9 → Group 3 Characteristics:
3 groups with 3x replication each
Tolerates 1 node failure per group
Data sharded across groups
All predicates replicated 3x within their group Pros: Maximum scalability and availability Cons: Higher operational complexity, more resources

Configuration Flags Reference

Common Flags

Flag	Component	Description	Default
`--my`	Zero/Alpha	Address:port that other nodes connect to	`localhost:5080` (Zero `localhost:7080` (Alpha)
`--zero`	Alpha	Address(es) of Zero node(s) to connect to	Required
`--peer`	Zero	Address of existing Zero to join cluster	None (creates new cluster if omitted)
`--raft "idx=N"`	Zero	Unique Raft ID for Zero node (required for HA)	`1`
`--replicas`	Zero	Number of Alpha replicas per group	`1`
`-w` / `--wal`	Zero/Alpha	Directory for write-ahead log entries	`zw` (Zero) `w` (Alpha)
`-p` / `--postings`	Alpha	Directory for data storage	`p`
`--bindall`	Zero/Alpha	Bind to `0.0.0.0` for network access	`true`
`--v=2`	Zero/Alpha	Log verbosity level (recommended: 2)	`0`

Configuration Methods: Flags can be set via command-line arguments, environment variables, or configuration files. See Config for details.

Best Practices

Node Placement

Different Physical Hosts: Run each replica on a separate machine
Availability Zones: Distribute across 3 AZs when possible
Network Latency: Keep inter-node latency <5ms for best performance

Resource Planning

Deployment	CPUs/Node	RAM/Node	Disk/Node
Development	2 cores	4GB	50GB
Small Production	8 cores	16GB	250GB SSD
Large Production	16 cores	32GB	1TB NVMe

Scaling Strategy

Vertical First:

Start with HA single-group (3 Alphas)
Increase CPU/RAM per node as load grows

Horizontal When:

Dataset >1TB
Query latency increases despite vertical scaling
Need to isolate hot predicates

Add 3 Alphas at a time to maintain replication factor

Deployment Checklist

Before production deployment:

Set --replicas=3 on Zero nodes
Configure persistent storage volumes
Enable TLS for client connections
Set up IP whitelisting for admin endpoints
Configure monitoring (Prometheus/Grafana)
Set up binary backups (Enterprise)
Test failover scenarios
Document cluster topology
Plan capacity for 2x growth

Pattern Selection Guide​

Basic cluster​

Architecture​

HA cluster​

Architecture​

Setup Steps​

Kubernetes (Helm)​

Distributed (Multi-Group) - Basic​

Architecture​

Setup (Single Host with Port Offsets)​

Distributed - HA (Production Large-Scale)​

Architecture​

Setup (9 Alpha Nodes across 3 Hosts)​

Configuration Flags Reference​

Common Flags​

Best Practices​

Node Placement​

Resource Planning​

Scaling Strategy​

Deployment Checklist​

Next Steps​

Pattern Selection Guide

Basic cluster

Architecture

HA cluster

Architecture

Setup Steps

Kubernetes (Helm)

Distributed (Multi-Group) - Basic

Architecture

Setup (Single Host with Port Offsets)

Distributed - HA (Production Large-Scale)

Architecture

Setup (9 Alpha Nodes across 3 Hosts)

Configuration Flags Reference

Common Flags

Best Practices

Node Placement

Resource Planning

Scaling Strategy

Deployment Checklist

Next Steps