Troubleshoot Elastic Node Pools

If an Elastic Node Pool's nodes are not becoming ready or are otherwise stuck, consider the following steps.

Operations

The Operations tab shows recent or running operations accompanied by detailed logs. Check them first to see if something is going on or went wrong.

Inspect Nodes via kubectl

You can inspect the virtual machine instances backing your nodes using kubectl against the Fleet Cluster:

bash

# List all VMIs in the namespace
kubectl -n <namespace> get vmi

# Describe a specific node's VMI
kubectl -n <namespace> describe vmi node-<xyz>

WARNING

The Fleet Cluster is managed by the Elastic Fleet provider and may not be accessible to you. Contact your provider if you don't have access.

Node Console

If you need to debug a node directly, you can access its console using virtctl against the Fleet Cluster:

bash

virtctl -n <namespace> console node-<xyz>

To log in, use the Recovery User and Recovery Password shown in the Elastic Node Pool's Info tab under Linux. The recovery credentials are shared across all nodes in the pool.

Once logged in, useful commands to investigate:

bash

# Kubernetes service logs (Elastic Nodes do not run a machine agent)
journalctl -u kubelet
journalctl -u containerd

# Kernel messages (hardware, driver, boot issues)
dmesg | tail -100

# Network status (Flatcar uses systemd-networkd)
ip addr
ip route
networkctl status
journalctl -u systemd-networkd

Troubleshoot Elastic Node Pools ​

Operations ​

Inspect Nodes via kubectl ​

Node Console ​

Troubleshoot Elastic Node Pools

Operations

Inspect Nodes via kubectl

Node Console