Dashboard Stories

November 16, 2023

Prometheus dashboards for monitoring AWS EKS clusters

This EKS dashboard displays key metric data from an AWS EKS (Elastic Kubernetes Service) cluster using the Prometheus service so you can visualize resource utilization and health status.

Antrone Simmons

Customer Solutions Engineer, SquaredUp

Dashboard Preview

Create your free dashboard

Challenge

Engineers need to understand the status of microservices run on EKS, like health status of clusters and nodes, to avoid issues impacting business critical microservices. Plus, you need to be able to keep an eye on EKS resources, including whether the Kubernetes cluster has auto-scaled (where enabled).

Usually, to view these metrics, it requires looking at each EKS cluster and node group individually in the AWS Console, or via another complex third-party dashboarding tool. The data is siloed and difficult to consolidate.

Solution

With an EKS dashboard built in SquaredUp you can see the status of all EKS clusters and node groups at once due to the wide scope. Additionally, it allows for teams that might not have AWS Console access to view these metrics in an easily viewable format.

It’s also possible to place important EKS metrics on the same dashboard with microservice level metrics - surfaced with something like Prometheus - which allows engineers to view all the important metrics about a microservice without hopping between dashboards.

Create a SquaredUp account – free forever

EKS dashboard walk-through

This screenshot shows the top of the EKS dashboard created in SquaredUp. In the top right tile, we have the overall health status of any EKS clusters or node pools in the demo environment. The top two tiles would flag if the resources became unhealthy or unavailable. If any more nodes were created they would automatically be added to all metrics in this dashboard.

The next two tiles below this are ‘CPU Utilization per EKS Node’ and ‘Free Memory per EKS node’. Both metrics are fetched using standard PromQL queries, example:

sum by (pod) (rate(container_cpu_usage_seconds_total[5m]))&nbsp;

(node_memory_MemFree_bytes / node_memory_MemTotal_bytes) * 100

Scrolling down the dashboard, we then have some further metrics regarding storage and network.

Ephemeral EKS storage is important to understand as it can scale automatically if enabled. Therefore, I have surfaced the currently used storage, and total allocatable. These two can then be compared to understand how much available storage is remaining.

Additionally, total network utilization is useful to understand. If the hosted microservices are publicly available, this metric could be much higher at peak time and could explain performance impact.

Create your free dashboard

This Prometheus Elastic Kubernetes Service (EKS) dashboard is not available out of the box, but you can easily build something similar yourself using the Prometheus plugin.

Create your dashboard – free forever

Simply create a free account to get started, or check out this video to see how easy it is to use our Dashboard Designer:

To see what other dashboards you can create, including a Google Kubernetes Engine dashboard, check out our Dashboard Gallery.

Create a free SquaredUp account

Visualize over 60 data sources, including:

View all 80+ plugins

Continue learning

Guides

Getting started with AWS CloudWatch dashboards

See how to build an AWS CloudWatch dashboard for multiple accounts, regions, and tools. Try our flexible access control and notification features!

Dashboard Story

Google Kubernetes Engine (GKE) dashboard: Simple utilization and health summary

Visualize all your key metrics from any GKE clusters and node groups, and more with this Google Kubernetes Engine dashboard.

Blog

Instrumenting Node.js code with Prometheus custom metrics

How to instrument your Node.js code with custom Prometheus metrics using the prom-client package. Get a full walk through here

Blog

Three Ways to Run Prometheus

In this article I will show you how to get Prometheus up and running as a binary, a container running in Docker, and inside Kubernetes.

Dashboard Story

Monitoring critical AWS resources – API Gateway, ELB, Lambda, Route 53, S3, and EC2

Get an instant overview of your AWS environment with these 6 dashboards in SquaredUp. Add your own metrics to the customizable dashboards too

Dashboard Story

Is your disaster recovery plan ready for action? Monitoring AWS Backup status

This dashboard shows an overview of AWS Backup Plans. Select a specific plan from a dropdown and view the status of recent runs plus individual job details.

Dashboard Stories

November 16, 2023

Prometheus dashboards for monitoring AWS EKS clusters

This EKS dashboard displays key metric data from an AWS EKS (Elastic Kubernetes Service) cluster using the Prometheus service so you can visualize resource utilization and health status.

Antrone Simmons

Customer Solutions Engineer, SquaredUp

Dashboard Preview

Create your free dashboard

Challenge

Solution

Create a SquaredUp account – free forever

EKS dashboard walk-through

The next two tiles below this are ‘CPU Utilization per EKS Node’ and ‘Free Memory per EKS node’. Both metrics are fetched using standard PromQL queries, example:

sum by (pod) (rate(container_cpu_usage_seconds_total[5m]))&nbsp;

(node_memory_MemFree_bytes / node_memory_MemTotal_bytes) * 100

Scrolling down the dashboard, we then have some further metrics regarding storage and network.

Create your free dashboard

This Prometheus Elastic Kubernetes Service (EKS) dashboard is not available out of the box, but you can easily build something similar yourself using the Prometheus plugin.

Create your dashboard – free forever

Simply create a free account to get started, or check out this video to see how easy it is to use our Dashboard Designer:

To see what other dashboards you can create, including a Google Kubernetes Engine dashboard, check out our Dashboard Gallery.

Create a free SquaredUp account

Visualize over 60 data sources, including:

View all 80+ plugins