This tutorial shows you how to develop a native cloud Cassandra deployment on Kubernetes. In this instance, a custom Cassandra SeedProvider
enables Cassandra to discover new Cassandra nodes as they join the cluster.
Deploying stateful distributed applications, like Cassandra, within a clustered environment can be challenging. StatefulSets greatly simplify this process. Please read about StatefulSets for more information about the features used in this tutorial.
Cassandra Docker
The pods use the gcr.io/google-samples/cassandra:v12
image from Google’s container registry.
The docker is based on debian:jessie
and includes OpenJDK 8. This image includes a standard Cassandra installation from the Apache Debian repo. By using environment variables you can change values that are inserted into cassandra.yaml
.
ENV VAR | DEFAULT VALUE |
---|---|
CASSANDRA_CLUSTER_NAME | ‘Test Cluster’ |
CASSANDRA_NUM_TOKENS | 32 |
CASSANDRA_RPC_ADDRESS | 0.0.0.0 |
Service
.StatefulSet
to create a Cassandra ring.StatefulSet
.StatefulSet
.StatefulSet
and its Pods
.To complete this tutorial, you should already have a basic familiarity with Pods, Services, and StatefulSets. In addition, you should:
Install and Configure the kubectl
command line
Download cassandra-service.yaml and cassandra-statefulset.yaml
Have a supported Kubernetes Cluster running
Note: Please read the getting started guides if you do not already have a cluster.
Warning: Minikube defaults to 1024MB of memory and 1 CPU which results in an insufficient resource errors.
To avoid these errors, run minikube with:
minikube start --memory 5120 --cpus=4
A Kubernetes Service describes a set of Pods that perform the same task.
The following Service
is used for DNS lookups between Cassandra pods and clients within the Kubernetes Cluster.
cd
to the folder you saved the .yaml files.Service
to track all Cassandra StatefulSet Nodes from the following .yaml
file:kubectl create -f cassandra-service.yaml
cassandra-service.yaml
|
---|
|
Get the Cassandra Service
.
kubectl get svc cassandra
The response should be
NAME CLUSTER-IP EXTERNAL-IP PORT(S) AGE
cassandra None <none> 9042/TCP 45s
If anything else returns, the service was not successfully created. Read Debug Services for common issues.
The StatefulSet manifest, included below, creates a Cassandra ring that consists of three pods.
Note: This example uses the default provisioner for Minikube. Please update the following StatefulSet for the cloud you are working with.
.yaml
file:kubectl create -f cassandra-statefulset.yaml
cassandra-statefulset.yaml
|
---|
|
kubectl get statefulset cassandra
The response should be
NAME DESIRED CURRENT AGE
cassandra 3 0 13s
The StatefulSet resource deploys pods sequentially.
kubectl get pods -l="app=cassandra"
NAME READY STATUS RESTARTS AGE
cassandra-0 1/1 Running 0 1m
cassandra-1 0/1 ContainerCreating 0 8s
Note: It can take up to ten minutes for all three pods to deploy.
Once all pods are deployed, the same command returns:
kubectl get pods -l="app=cassandra"
NAME READY STATUS RESTARTS AGE
cassandra-0 1/1 Running 0 10m
cassandra-1 1/1 Running 0 9m
cassandra-2 1/1 Running 0 8m
kubectl exec cassandra-0 -- nodetool status
Datacenter: DC1-K8Demo
======================
Status=Up/Down
|/ State=Normal/Leaving/Joining/Moving
-- Address Load Tokens Owns (effective) Host ID Rack
UN 10.4.2.4 65.26 KiB 32 63.7% a9d27f81-6783-461d-8583-87de2589133e Rack1-K8Demo
UN 10.4.0.4 102.04 KiB 32 66.7% 5559a58c-8b03-47ad-bc32-c621708dc2e4 Rack1-K8Demo
UN 10.4.1.4 83.06 KiB 32 69.6% 9dce943c-581d-4c0e-9543-f519969cc805 Rack1-K8Demo
Use kubectl edit
to modify the size of of a Cassandra StatefulSet.
kubectl edit statefulset cassandra
This command opens an editor in your terminal. The line you need to change is Replicas
.
Note: The following sample is an excerpt of the StatefulSet file.
# Please edit the object below. Lines beginning with a '#' will be ignored,
# and an empty file will abort the edit. If an error occurs while saving this file will be
# reopened with the relevant failures.
#
apiVersion: apps/v1beta1
kind: StatefulSet
metadata:
creationTimestamp: 2016-08-13T18:40:58Z
generation: 1
labels:
app: cassandra
name: cassandra
namespace: default
resourceVersion: "323"
selfLink: /apis/apps/v1beta1/namespaces/default/statefulsets/cassandra
uid: 7a219483-6185-11e6-a910-42010a8a0fc0
spec:
replicas: 3
Increase the number of replicas to 4, and then save the manifest.
The StatefulSet now contains 4 pods.
Get the Cassandra StatefulSet to verify:
kubectl get statefulset cassandra
The response should be
NAME DESIRED CURRENT AGE
cassandra 4 4 36m
Deleting or scaling a StatefulSet down does not delete the volumes associated with the StatefulSet. This ensures safety first: your data is more valuable than an auto purge of all related StatefulSet resources.
Warning: Depending on the storage class and reclaim policy, deleting the Persistent Volume Claims may cause the associated volumes to also be deleted. Never assume you’ll be able to access data if its volume claims are deleted.
StatefulSet
:grace=$(kubectl get po cassandra-0 -o=jsonpath='{.spec.terminationGracePeriodSeconds}') \
&& kubectl delete statefulset -l app=cassandra \
&& echo "Sleeping $grace" \
&& sleep $grace \
&& kubectl delete pvc -l app=cassandra
Service
.kubectl delete service -l app=cassandra