Prometheus Adapter for HPA on EKS

First PublishedMay 18, 2026ByAtif Alam

Use prometheus-adapter when Horizontal Pod Autoscaler (HPA) should scale on application metrics (request rate, queue lag, business KPIs) defined in PromQL, not only CPU and memory.

Overview: Autoscaling on EKS. k3s lab: Prometheus Adapter HPA on k3s. For CloudWatch-native metrics, see Container Insights for HPA on EKS.

When to use

Use Prometheus Adapter	Consider KEDA instead
RPS, latency, in-cluster PromQL	SQS / Kafka / cron triggers
You already run Prometheus or AMP remote write	Minimal metrics plumbing for queues
Same metric feeds dashboards and HPA	Event source is outside Prometheus

Choose your Prometheus backend

1
                    ┌─────────────────────────────┐
2
                    │      prometheus-adapter      │
3
                    │  (custom.metrics.k8s.io)   │
4
                    └──────────────┬──────────────┘
5
                                   │ queries
6
           ┌───────────────────────┼───────────────────────┐
7
           ▼                       ▼                       ▼
8
   In-cluster Prometheus    Amazon Managed Prometheus   Remote Prometheus URL
9
   (Helm / operator)        (workspace + remote write)   (VPC endpoint / VPN)

Backend	This guide
In-cluster Prometheus (e.g. `kube-prometheus-stack`)	Primary walkthrough below
Amazon Managed Prometheus (AMP)	Alternative setup — AMP

You still need metrics-server if the same HPA mixes Resource metrics (CPU) with custom metrics.

Prerequisites

EKS cluster 1.23+, kubectl configured.
Helm 3 — Helm.
Prometheus reachable from the adapter pod (in-cluster Service DNS or AMP query endpoint).
Cluster admin to install APIService and ClusterRole for the adapter.

Verify APIs:

1
kubectl get apiservice v1beta1.metrics.k8s.io
2
# After adapter install:
3
kubectl get apiservice v1beta1.custom.metrics.k8s.io

Install prometheus-adapter (in-cluster Prometheus)

Assume Prometheus from kube-prometheus-stack in namespace monitoring, Service monitoring-kube-prometheus-prometheus on port 9090. Adjust names to your release.

1. Rules ConfigMap

Create adapter-rules.yaml:

1
rules:
2
  - seriesQuery: 'http_requests_total{namespace!="",pod!=""}'
3
    resources:
4
      overrides:
5
        namespace: { resource: "namespace" }
6
        pod: { resource: "pod" }
7
    name:
8
      matches: "^(.*)_total$"
9
      as: "${1}_per_second"
10
    metricsQuery: 'sum(rate(<<.Series>>{<<.LabelMatchers>>}[2m])) by (<<.GroupBy>>)'

2. Helm values

Create prometheus-adapter-values.yaml:

1
prometheus:
2
  url: http://monitoring-kube-prometheus-prometheus.monitoring.svc
3
  port: 9090
4

5
rules:
6
  existing: adapter-rules
7

8
replicas: 2
9

10
resources:
11
  requests:
12
    cpu: 100m
13
    memory: 128Mi

3. Install chart

1
helm repo add prometheus-community https://prometheus-community.github.io/helm-charts
2
helm repo update
3

4
helm upgrade --install prometheus-adapter prometheus-community/prometheus-adapter \
5
  --namespace monitoring \
6
  --create-namespace \
7
  -f prometheus-adapter-values.yaml \
8
  --wait

4. Verify adapter and API

1
kubectl get pods -n monitoring -l app.kubernetes.io/name=prometheus-adapter
2
kubectl get apiservice v1beta1.custom.metrics.k8s.io
3

4
# List custom metrics (names vary with your rules)
5
kubectl get --raw "/apis/custom.metrics.k8s.io/v1beta1" | head -c 2000

Expected: adapter pods Running; APIService Available; raw API returns resource kinds.

1
kubectl logs -n monitoring -l app.kubernetes.io/name=prometheus-adapter --tail=100

Sample app and HPA (Pods metric)

Deploy a workload that exports http_requests_total and is scraped by Prometheus ( ServiceMonitor with release: monitoring if using kube-prometheus-stack — see Observability setup).

Example HPA fragment (metric name must match adapter rule output, e.g. http_requests_per_second):

1
apiVersion: autoscaling/v2
2
kind: HorizontalPodAutoscaler
3
metadata:
4
  name: sample-app-hpa
5
  namespace: default
6
spec:
7
  scaleTargetRef:
8
    apiVersion: apps/v1
9
    kind: Deployment
10
    name: sample-app
11
  minReplicas: 2
12
  maxReplicas: 10
13
  metrics:
14
    - type: Pods
15
      pods:
16
        metric:
17
          name: http_requests_per_second
18
        target:
19
          type: AverageValue
20
          averageValue: "100"

1
kubectl apply -f sample-app-hpa.yaml
2
kubectl describe hpa sample-app-hpa -n default

EKS specifics

Networking: Prometheus in another VPC or AMP private endpoint requires security groups and DNS so adapter pods can query port 9090 / 443.
IRSA: Adapter pods usually need no AWS API access; workloads scraping AWS metrics may — keep IAM on the app or use Container Insights guide instead.
AMP: Use workspace query URL in prometheus.url / prometheus.port and ensure adapter identity can reach the endpoint (VPC endpoints, IAM if using SigV4 proxy).
High availability: Run 2+ adapter replicas; APIService aggregates from any healthy adapter.

Alternative setup: Amazon Managed Prometheus

Create an AMP workspace and configure remote write from in-cluster Prometheus or ADOT collectors.
Point prometheus-adapter prometheus.url at the AMP query endpoint for your workspace (see current AMP documentation).
Reuse the same rules ConfigMap; validate metric names exist in AMP with the AMP query editor.
Install adapter via Helm as above with updated prometheus-adapter-values.yaml.

AMP networking and authentication change with private VPC endpoints — validate kubectl get --raw from a debug pod before wiring HPA.

Troubleshooting

Symptom	Likely cause	Action
APIService not Available	Adapter crash, TLS, wrong Prometheus URL	Check adapter logs and `prometheus.url`
HPA `unable to get metrics`	Metric name mismatch	Compare `describe hpa` with `kubectl get --raw .../namespaces/NS/...`
Metric never appears	No series in Prometheus	Targets UP? PromQL returns data?
Forbidden on metrics API	RBAC	Adapter ClusterRole / APIService aggregation
Works in Prometheus, not HPA	Rule `seriesQuery` / `metricsQuery` wrong	Test PromQL; fix adapter rules

1
kubectl describe hpa <name> -n <namespace>
2
kubectl get --raw "/apis/custom.metrics.k8s.io/v1beta1/namespaces/<namespace>/pods/*/http_requests_per_second"

For on-call commands, see EKS troubleshooting cheat sheet — HPA.