GitHub - agilestacks/korral: Enumerate Kubernetes cluster to determine cloud costs

Kubernetes cluster cost metrics

Korral collects Kubernetes cluster cost metrics and provides them to Prometheus. Currently, on AWS, GCP, and Azure.

The exported metrics structure adheres to Prometheus best practices where metrics are exposed on fine granularity level, then aggregation is performed in Prometheus. There are two facets though to help users write simpler queries:

Cluster level where cost is provided on cluster object level: node, node volumes, load-balancers, etc.;
Pod level where cost is per pod and includes pod volumes. Costs that cannot be reliably attributed to a specific pod is amortized across all pods (in a namespace, on a node). For example, ingress controller load-balancer and it's egress traffic cost or node boot volume cost.

In both cases, the metrics are measured in USD per hour. The Sum of all metrics in a facet should add up to the total cluster cost, modulo rounding errors. Note that orphan volumes costs are not included in Pod level facet.

Cluster level

korral_cluster_node_cost_per_hour_dollars - cluster node cost without cost of attached volumes, split by node tag
korral_cluster_node_volumes_cost_per_hour_dollars - cluster node attached volumes cost, spilt by node tag; this includes Kubernetes volumes attached to the node and node boot volume
korral_cluster_loadbalancer_cost_per_hour_dollars - cluster loadbalancer cost, split by hostname
korral_cluster_loadbalancer_traffic_cost_per_hour_dollars - cluster loadbalancer ingress/egress traffic and LCUs cost, split by hostname
korral_cluster_orphaned_volumes_cost_per_hour_dollars - cluster volumes that exist but not used if any, split by claim_namespace, claim tags if corresponding PVC exists
korral_cluster_k8s_cost_per_hour_dollars cluster cloud provider cost if any, ie. $0.10 per hour for EKS cluster; 0 is reported if there is no additional cost.

Pod level

korral_cluster_pod_cost_per_hour_dollars - pod cost without cost of attached volumes, split by name, pod_namespace, node tags
korral_cluster_pod_volumes_cost_per_hour_dollars - pod volumes cost if any, split by name, pod_namespace, node tags.

Additionally, if --labels= flag is specified (the default is pod_owner,release,app.kubernetes.io/name), then each pod metric get the requested labels copied from the pod. pod_owner is a special case - the collector will traverse Kubernetes resource hierarchy to determine top-most controller resource name to assign to the label (name of deployment, statefulset, etc.). If no label set on the pod/deployment then '(none)' will be set as label value to simplify Prometheus queries.

The cost model makes a few arbitrary assumptions:

A sum of pod containers resources.requests is used to determine pod share of node total cost. If no requests are available, then limits are used, else { cpu: '100m', memory: '32Mi' }. RAM cost is 23% of instance cost; this is more or less true for AWS General Purpose instance types. Thus the cost will change as pods are rescheduled.
Cost of node volumes that are not Kubernetes volumes is amortized across pods on that particular node.
Load-balancer cost is spread across namespace pods evenly. There should be at least one pod.
Cluster cloud provider cost (if any, EKS $0.10) is spread across all pods.
Only Running pods are counted.
Orphan volumes costs are not attributed to any pod.

Test drive

In case you have Node.js installed, do npm install and then run korral:

./korral print

Cluster:
    Total:   0.19298 USD per hour
    Nodes:   0.0459
    Volumes: 0.02208
             0.00125 Kubernetes
             0.02082 boot
    ELBs:    0.025
    K8s:     0.1

Set KUBECONFIG and/or supply --context= to change cluster. Call ./korral help for details.

If you have Docker instead, you may want to try something along these lines:

docker run --rm \
    -v ${KUBECONFIG:-$HOME/.kube/config}:/kubeconfig -e KUBECONFIG=/kubeconfig \
    agilestacks/korral print

You must map your cloud credentials into the container, ie. AWS_*, GOOGLE_APPLICATION_CREDENTIALS, or AZURE_* vars. No aws-iam-authenticator nor AWS CLI is present in the image so for EKS it's easier to start on Node.js path.

Installation

install/kubernetes.yaml configures service account with restricted privileges and installs the deployment. install/prometheus-servicemonitor.yaml installs Prometheus Operator ServiceMonitor custom resource.

kubect create namespace monitoring
kubectl apply -f install/kubernetes.yaml
kubectl apply -f install/prometheus-servicemonitor.yaml

Installed on the cloud-native Kubernetes (EKS, GKE, AKS) it will automatically determine cloud API to use. If you have your own Kubernetes flavor, please add --cloud=aws|gcp|azure to deployment args.

The default scrape timeout for Prometheus is 10 seconds. If your exporter can be expected to exceed this, you should explicitly call this out in your user documentation.

Installed Prometheus ServiceMonitor custom resource configures the timeout to 20sec. You may want to change that.

Manual Prometheus configuration

In case Prometheus is somewhere else, then use following config:

- job_name: korral/test.dev.superhub.io
  scrape_interval: 5m
  scrape_timeout: 20s
  metrics_path: /api/v1/namespaces/monitoring/services/korral:9897/proxy/metrics
  scheme: https
  tls_config:
    insecure_skip_verify: true
  bearer_token: <korral service account token>
  static_configs:
  - targets:
    - 62c3ef7c6ad39c6c6f8ea17d4557f3f7.gr7.us-east-2.eks.amazonaws.com:443
    labels:
      domain: test.dev.superhub.io

Grafana

grafana/dashboard.json is Grafana dashboard model. In Grafana, create new dashboard, visit Dashboard settings, then JSON Model; paste the JSON and Save Changes. Or press on + in sidebar and choose Import.

Fiber is Korral Operator

For multi-cluster deployment with centralized Prometheus (operated by Prometheus Operator) you may want to use Fiber.

Name		Name	Last commit message	Last commit date
Latest commit History 99 Commits
grafana		grafana
install		install
src		src
templates		templates
.dockerignore		.dockerignore
.eslintrc.json		.eslintrc.json
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
hub-component.yaml		hub-component.yaml
korral		korral
package-lock.json		package-lock.json
package.json		package.json
prometheus.png		prometheus.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Kubernetes cluster cost metrics

Cluster level

Pod level

Test drive

Installation

Manual Prometheus configuration

Grafana

Fiber is Korral Operator

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

License

agilestacks/korral

Folders and files

Latest commit

History

Repository files navigation

Kubernetes cluster cost metrics

Cluster level

Pod level

Test drive

Installation

Manual Prometheus configuration

Grafana

Fiber is Korral Operator

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages