Loki

First PublishedFeb 16, 2026ByAtif Alam

Grafana Loki is a log aggregation system designed to be cost-effective and easy to operate. It’s like Prometheus, but for logs — it indexes labels (metadata) instead of the full log content, making it much cheaper to run than Elasticsearch.

How Loki Works

1
┌──────────┐     push logs      ┌──────────┐     query      ┌──────────┐
2
│  Promtail │───────────────────►│   Loki   │◄───────────────│  Grafana │
3
│  (agent)  │                    │  (store) │                │  (UI)    │
4
└──────────┘                    └──────────┘                └──────────┘

Promtail (or another agent) reads log files, attaches labels, and pushes them to Loki.
Loki stores the logs, indexing only the labels (not the content). Log content is stored compressed in chunks.
Grafana queries Loki using LogQL and displays logs in the Explore panel or Log panels.

Why Not Elasticsearch?

	Loki	Elasticsearch
Indexing	Labels only	Full-text index
Storage cost	Low (compressed chunks)	High (full index)
Query speed	Fast for label-filtered queries; grep-like for content	Fast for any full-text search
Complexity	Simple (few components)	Complex (cluster, shards, mappings)
Best for	Kubernetes/cloud-native logs with known label structure	Full-text search across diverse log formats

Loki trades full-text search speed for much lower resource usage.

Labels

Labels are key-value pairs attached to log streams. They’re how you filter and query logs:

1
{app="my-app", namespace="production", pod="my-app-7b8f9c-x2k4d"}

Good Labels

Labels should have low cardinality (few distinct values):

1
{app="my-app"}                    # good — few apps
2
{namespace="production"}           # good — few namespaces
3
{env="staging", region="us-east"}  # good — few environments and regions

Bad Labels (Avoid)

High-cardinality labels explode the index:

1
{user_id="abc123"}        # bad — millions of users
2
{request_id="req-456"}    # bad — unique per request
3
{ip="192.168.1.50"}       # bad — thousands of IPs

Put high-cardinality data in the log line, not in labels. Query it with LogQL filters.

Promtail Configuration

Promtail is the default agent that ships logs to Loki:

1
server:
2
  http_listen_port: 9080
3

4
positions:
5
  filename: /tmp/positions.yaml    # tracks read position in log files
6

7
clients:
8
  - url: http://loki:3100/loki/api/v1/push
9

10
scrape_configs:
11
  - job_name: system
12
    static_configs:
13
      - targets: [localhost]
14
        labels:
15
          job: syslog
16
          host: web1
17
          __path__: /var/log/syslog
18

19
  - job_name: app
20
    static_configs:
21
      - targets: [localhost]
22
        labels:
23
          job: myapp
24
          __path__: /var/log/myapp/*.log

Kubernetes (Auto-Discovery)

Promtail automatically discovers pods and attaches Kubernetes labels:

1
scrape_configs:
2
  - job_name: kubernetes-pods
3
    kubernetes_sd_configs:
4
      - role: pod
5
    pipeline_stages:
6
      - docker: {}
7
    relabel_configs:
8
      - source_labels: [__meta_kubernetes_pod_label_app]
9
        target_label: app
10
      - source_labels: [__meta_kubernetes_namespace]
11
        target_label: namespace

Alternative Agents

Grafana Alloy (successor to Promtail) — All-in-one collector for metrics, logs, traces.
Fluentd / Fluent Bit — With the Loki output plugin.
Vector — With the Loki sink.

LogQL Basics

LogQL is Loki’s query language, inspired by PromQL.

Stream Selectors

Filter by labels:

1
{app="my-app"}
2
{app="my-app", namespace="production"}
3
{app=~"my-app|api-gateway"}       # regex match
4
{namespace!="development"}         # not equal

Line Filters

Filter log content (like grep):

1
{app="my-app"} |= "error"                    # contains "error"
2
{app="my-app"} != "healthcheck"               # does not contain
3
{app="my-app"} |~ "timeout|connection refused" # regex match
4
{app="my-app"} !~ "DEBUG|TRACE"               # regex exclude

Chain filters:

1
{app="my-app"} |= "error" != "healthcheck" |~ "500|503"

Parser Stages

Extract structured fields from log lines:

1
# JSON logs
2
{app="my-app"} | json | status_code >= 500
3

4
# Logfmt logs (key=value)
5
{app="my-app"} | logfmt | level="error"
6

7
# Regex extraction
8
{app="my-app"} | regexp `(?P<method>\w+) (?P<path>/\S+) (?P<status>\d+)`
9
  | status >= 500
10

11
# Pattern (simpler than regex)
12
{app="my-app"} | pattern `<method> <path> <status>`
13
  | status >= 500

Label Filters (After Parsing)

1
{app="my-app"} | json | duration > 1s
2
{app="my-app"} | json | status_code = 500 | method = "POST"
3
{app="my-app"} | logfmt | level = "error" | line_format "{{.msg}}"

Metric Queries

Turn logs into metrics:

1
# Count of error logs per minute
2
count_over_time({app="my-app"} |= "error" [1m])
3

4
# Rate of logs per second
5
rate({app="my-app"}[5m])
6

7
# Bytes rate
8
bytes_rate({app="my-app"}[5m])
9

10
# Average extracted duration (from JSON field)
11
avg_over_time({app="my-app"} | json | unwrap duration [5m])
12

13
# Quantile of extracted values
14
quantile_over_time(0.95, {app="my-app"} | json | unwrap duration [5m])

Aggregations

1
# Error count by app
2
sum by (app) (count_over_time({namespace="production"} |= "error" [5m]))
3

4
# Top 5 apps by log volume
5
topk(5, sum by (app) (bytes_rate({namespace="production"}[5m])))

Viewing Logs in Grafana

Go to Explore (compass icon).
Select the Loki data source.
Write a LogQL query.
View log lines with timestamps, labels, and content.
Click a log line to expand and see all fields.

Log Panel on Dashboards

Add a Logs panel to any dashboard:

1
{app="my-app", namespace="production"} |= "error"

Combine with metric panels on the same dashboard — see error rate spikes alongside the actual error logs.

Structured Logging

Structured logging means writing logs as key-value pairs (usually JSON) instead of free-form text. This makes logs far easier to query and parse in Loki.

Unstructured vs Structured

Unstructured:

1
2026-02-16 10:23:45 ERROR Failed to process order 12345 for user abc - timeout after 5s

To extract the order ID, user, or duration, you need regex.

Structured (JSON):

1
{"ts":"2026-02-16T10:23:45Z","level":"error","msg":"Failed to process order","order_id":12345,"user":"abc","duration_s":5,"error":"timeout"}

LogQL can parse this instantly:

1
{app="orders"} | json | level="error" | duration_s > 3

Best Practices for Structured Logging

Practice	Why
Use JSON or logfmt format	Parseable by LogQL without regex
Include a `level` field	Filter by severity (`error`, `warn`, `info`, `debug`)
Include a `msg` field	Human-readable description of what happened
Add request/trace IDs	Correlate with distributed traces
Add domain fields	`order_id`, `user_id`, `endpoint` — searchable context
Don’t log sensitive data	No passwords, tokens, PII in plain text
Use consistent field names	`duration_s` everywhere, not `dur` / `elapsed` / `time_ms` in different services

Structured Logging in Code

Python (structlog):

1
import structlog
2

3
logger = structlog.get_logger()
4

5
logger.info("order_processed",
6
    order_id=12345,
7
    user="abc",
8
    duration_s=0.45,
9
    status="success"
10
)
11
# Output: {"event":"order_processed","order_id":12345,"user":"abc","duration_s":0.45,"status":"success","timestamp":"2026-02-16T10:23:45Z"}

Go (zerolog):

1
log.Info().
2
    Int("order_id", 12345).
3
    Str("user", "abc").
4
    Float64("duration_s", 0.45).
5
    Msg("order processed")

Node.js (pino):

1
const logger = require('pino')();
2

3
logger.info({ order_id: 12345, user: 'abc', duration_s: 0.45 }, 'order processed');

Log Pipelines

A log pipeline processes logs between the source (your app) and the destination (Loki). Pipelines parse, transform, filter, and enrich logs before storage.

Why Pipelines?

Parse unstructured logs into structured fields.
Enrich logs with metadata (Kubernetes labels, host info).
Filter noisy logs (healthchecks, debug lines) to reduce storage costs.
Redact sensitive data (emails, tokens) before storage.
Route different logs to different destinations or tenants.

Promtail Pipeline Stages

Promtail has a built-in pipeline engine. Each scrape job can define pipeline_stages:

1
scrape_configs:
2
  - job_name: app
3
    static_configs:
4
      - targets: [localhost]
5
        labels:
6
          job: myapp
7
          __path__: /var/log/myapp/*.log
8
    pipeline_stages:
9
      # 1. Parse JSON logs
10
      - json:
11
          expressions:
12
            level: level
13
            msg: msg
14
            duration: duration_s
15

16
      # 2. Set 'level' as a label (low cardinality — only error/warn/info/debug)
17
      - labels:
18
          level:
19

20
      # 3. Filter out debug logs in production
21
      - match:
22
          selector: '{level="debug"}'
23
          action: drop
24

25
      # 4. Redact email addresses
26
      - replace:
27
          expression: '([a-zA-Z0-9._%+-]+@[a-zA-Z0-9.-]+\.[a-zA-Z]{2,})'
28
          replace: '***REDACTED***'
29

30
      # 5. Add a timestamp from the log line
31
      - timestamp:
32
          source: ts
33
          format: RFC3339

Common Pipeline Stages

Stage	Purpose	Example
`json`	Parse JSON log lines	Extract `level`, `msg`, `duration`
`logfmt`	Parse logfmt (key=value) lines	Extract `level`, `caller`
`regex`	Parse with regex named groups	Extract fields from unstructured logs
`labels`	Promote extracted fields to Loki labels	Set `level` as a label for filtering
`timestamp`	Use a field as the log timestamp	Align Loki timestamp with app timestamp
`output`	Rewrite the log line	Change what gets stored
`match`	Conditionally apply stages	Drop debug logs, route by content
`replace`	Regex find-and-replace	Redact sensitive data
`drop`	Drop log lines entirely	Remove healthcheck noise
`tenant`	Set Loki tenant ID per line	Multi-tenant log routing
`multiline`	Merge multi-line logs (stack traces)	Combine Java exception lines into one entry

Multi-Line Log Handling

Stack traces and multi-line exceptions need special handling — otherwise each line becomes a separate log entry:

1
pipeline_stages:
2
  - multiline:
3
      firstline: '^\d{4}-\d{2}-\d{2}'  # new log starts with a date
4
      max_wait_time: 3s
5
      max_lines: 128

This merges lines until the next log entry starts, keeping stack traces together.

Pipeline in Grafana Alloy

Grafana Alloy (the successor to Promtail) uses a different config format but the same pipeline concepts:

1
loki.process "app_pipeline" {
2
  stage.json {
3
    expressions = { level = "level", msg = "msg" }
4
  }
5
  stage.labels {
6
    values = { level = "" }
7
  }
8
  stage.drop {
9
    expression = ".*healthcheck.*"
10
  }
11
  forward_to = [loki.write.default.receiver]
12
}

External Pipeline Tools

For complex pipelines or when you need to route logs to multiple destinations beyond Loki:

Tool	Strengths
Vector	High-performance Rust-based pipeline; transform, filter, route to Loki, Elasticsearch, S3, etc.
Fluent Bit	Lightweight C-based; popular in Kubernetes; many output plugins
Fluentd	Mature Ruby-based; huge plugin ecosystem; heavier than Fluent Bit
OTel Collector	Unified pipeline for metrics, logs, and traces via OpenTelemetry

Example: Use Fluent Bit to collect, Vector to transform and route, Loki to store, Grafana to visualize.

Key Takeaways

Loki indexes labels only, not log content — cheap to run, but requires good label design.
Keep labels low cardinality (app, namespace, environment). Put user IDs, request IDs, etc. in the log line.
LogQL = stream selectors ({app="x"}) + line filters (|= "error") + parsers (| json) + metrics (count_over_time).
Use metric queries to turn logs into charts (error counts, log volume, extracted latencies).
Promtail (or Grafana Alloy) auto-discovers Kubernetes pods and attaches labels automatically.
Structured logging (JSON/logfmt) makes logs instantly parseable — no regex needed.
Log pipelines (Promtail stages, Alloy, Vector) parse, filter, redact, and route logs before they reach Loki.