Distributed Tracing

First PublishedFeb 16, 2026ByAtif Alam

Distributed tracing follows a single request as it travels through multiple services, showing where time is spent and which service is the bottleneck. It completes the “three pillars” of observability alongside metrics and logs.

Why Tracing?

Metrics tell you something is slow. Logs tell you what happened in one service. Traces tell you the full story of a request across every service it touched.

1
User ──► API Gateway ──► Auth Service ──► User DB
2
                     └──► Product Service ──► Cache
3
                                          └──► Product DB

Without tracing, debugging “why is this API call slow?” means correlating timestamps across logs from 5+ services. With tracing, you get one view that shows the exact latency of every hop.

Core Concepts

Spans

A span represents a single unit of work — an HTTP handler, a database query, a gRPC call. Each span records:

Field	What It Stores
Trace ID	Unique ID shared by all spans in the same request
Span ID	Unique ID for this specific span
Parent Span ID	The span that triggered this one (builds the tree)
Operation name	What the span represents (e.g. `GET /api/users`)
Start / end time	When the operation started and finished
Attributes	Key-value pairs (`http.status_code=200`, `db.system=postgresql`)
Status	OK, Error, or Unset
Events	Timestamped annotations (e.g. “cache miss at 14ms”)

Traces

A trace is a tree of spans — the root span is the entry point (e.g. the API gateway), and child spans are downstream calls:

1
Trace ID: abc123
2

3
[API Gateway]──────────────────────────────── 250ms
4
  ├─[Auth Service]──────── 40ms
5
  │   └─[User DB query]── 15ms
6
  └─[Product Service]──────────────── 180ms
7
      ├─[Cache lookup]── 2ms (cache miss)
8
      └─[Product DB query]────── 160ms  ← bottleneck

This immediately shows the Product DB query is the bottleneck.

Context Propagation

For tracing to work across services, the trace ID must be passed between them. This is called context propagation.

How it works:

Service A creates a span and generates a trace ID.
Service A adds the trace ID to the outgoing HTTP headers.
Service B reads the trace ID from the incoming headers.
Service B creates a child span using the same trace ID.

Common propagation formats:

Format	Header	Used By
W3C Trace Context (standard)	`traceparent`, `tracestate`	OpenTelemetry, most modern tools
B3	`X-B3-TraceId`, `X-B3-SpanId`	Zipkin, older Jaeger
Jaeger	`uber-trace-id`	Jaeger native

W3C Trace Context is the recommended standard. OpenTelemetry uses it by default.

Example traceparent header:

1
traceparent: 00-4bf92f3577b34da6a3ce929d0e0e4736-00f067aa0ba902b7-01
2
              │  │                                │                  │
3
              │  │                                │                  └─ sampled (01=yes)
4
              │  │                                └─ parent span ID
5
              │  └─ trace ID
6
              └─ version

Sampling

In production, tracing every request generates enormous volumes. Sampling reduces this while keeping useful data.

Sampling Strategies

Strategy	How It Works	When to Use
Head-based	Decide at the start of the request whether to trace	Simple; low overhead
Tail-based	Collect all spans, then decide after the request completes	Keep errors and slow requests; discard healthy ones
Rate limiting	Trace N requests per second	Predictable volume
Probabilistic	Trace X% of requests	Simple; statistically representative

Head-based is simpler but might miss interesting requests. Tail-based captures all errors and slow requests but requires a collector to buffer spans before deciding.

1
# OTel Collector tail-based sampling
2
processors:
3
  tail_sampling:
4
    policies:
5
      - name: errors
6
        type: status_code
7
        status_code: {status_codes: [ERROR]}
8
      - name: slow-requests
9
        type: latency
10
        latency: {threshold_ms: 1000}
11
      - name: sample-rest
12
        type: probabilistic
13
        probabilistic: {sampling_percentage: 10}

Jaeger

Jaeger is an open-source, end-to-end distributed tracing system, originally built by Uber.

Architecture

1
┌─────────┐     ┌──────────────┐     ┌──────────────┐     ┌──────────┐
2
│  Apps   │────►│  Jaeger      │────►│  Jaeger      │────►│  Storage │
3
│ (OTel)  │     │  Collector   │     │  Ingester    │     │ (ES/     │
4
└─────────┘     └──────────────┘     │  (optional)  │     │  Cassand-│
5
                                     └──────────────┘     │  ra)     │
6
                                                          └──────────┘
7
                         ┌──────────────┐                       │
8
                         │  Jaeger UI   │◄──────────────────────┘
9
                         │  (query)     │
10
                         └──────────────┘

Quick Start (All-in-One)

1
# all-in-one image includes collector, query, and UI with in-memory storage
2
docker run -d --name jaeger \
3
  -p 16686:16686 \   # Jaeger UI
4
  -p 4317:4317 \     # OTLP gRPC
5
  -p 4318:4318 \     # OTLP HTTP
6
  jaegertracing/all-in-one:latest

Open http://localhost:16686 to access the Jaeger UI.

Searching Traces

The Jaeger UI lets you:

Search by service — Select a service and time range to see recent traces.
Search by trace ID — Paste a trace ID directly (useful when you find it in a log line).
Filter by tags — Find traces where http.status_code=500 or error=true.
Compare traces — Side-by-side comparison of two traces to spot differences.

Grafana Tempo

Grafana Tempo is a high-scale, cost-effective trace backend that integrates natively with Grafana. Unlike Jaeger, Tempo stores traces in object storage (S3, GCS, Azure Blob) — no Elasticsearch or Cassandra needed.

Why Tempo?

Feature	Jaeger	Tempo
Storage	Elasticsearch, Cassandra	Object storage (S3, GCS)
Cost at scale	Higher (indexed storage)	Lower (no indexing, cheap storage)
Search	Full search by tags	Trace ID lookup + TraceQL
Grafana integration	Plugin	Native
Index	Full index of tags	Minimal index (by trace ID)

Tempo trades full tag-based search for much lower storage costs. It compensates with:

Trace ID lookup — Find a trace if you have its ID (from logs or metrics).
TraceQL — A query language for searching traces by structure and attributes.

TraceQL

TraceQL lets you search traces by span attributes, duration, and structure:

1
# Find traces where an HTTP span returned 500 and took > 1s
2
{ span.http.status_code = 500 && duration > 1s }
3

4
# Find traces that touched the "payments" service
5
{ resource.service.name = "payments" }
6

7
# Find traces where a database query was slow
8
{ span.db.system = "postgresql" && duration > 500ms }

Tempo Architecture

1
┌──────────┐     ┌──────────────┐     ┌──────────────┐
2
│  Apps    │────►│  Tempo       │────►│  Object      │
3
│  (OTel)  │     │  Distributor │     │  Storage     │
4
└──────────┘     └──────┬───────┘     │  (S3/GCS)   │
5
                        │             └──────────────┘
6
                        ▼                    │
7
                 ┌──────────────┐            │
8
                 │  Tempo       │◄───────────┘
9
                 │  Querier     │
10
                 └──────────────┘
11
                        ▲
12
                 ┌──────────────┐
13
                 │  Grafana     │
14
                 └──────────────┘

Deploying Tempo

1
# docker-compose snippet
2
tempo:
3
  image: grafana/tempo:latest
4
  command: ["-config.file=/etc/tempo.yaml"]
5
  volumes:
6
    - ./tempo.yaml:/etc/tempo.yaml
7
  ports:
8
    - "4317:4317"    # OTLP gRPC
9
    - "3200:3200"    # Tempo query API
10

11
# tempo.yaml
12
server:
13
  http_listen_port: 3200
14

15
distributor:
16
  receivers:
17
    otlp:
18
      protocols:
19
        grpc:
20
          endpoint: 0.0.0.0:4317
21

22
storage:
23
  trace:
24
    backend: local          # use s3/gcs in production
25
    local:
26
      path: /tmp/tempo/blocks
27
    wal:
28
      path: /tmp/tempo/wal
29

30
metrics_generator:
31
  processor:
32
    service_graphs:
33
      enabled: true
34
    span_metrics:
35
      enabled: true
36
  storage:
37
    path: /tmp/tempo/generator/wal
38
    remote_write:
39
      - url: http://prometheus:9090/api/v1/write

The metrics_generator in Tempo can automatically create RED metrics (Rate, Errors, Duration) from traces and push them to Prometheus — bridging traces and metrics.

Connecting Traces to Logs and Metrics

The real power of distributed tracing comes from correlating all three signals.

Trace → Logs

Embed the trace ID in your log lines:

1
import logging
2
from opentelemetry import trace
3

4
logger = logging.getLogger(__name__)
5

6
def handle_request():
7
    span = trace.get_current_span()
8
    trace_id = format(span.get_span_context().trace_id, '032x')
9
    logger.info("Processing request", extra={"trace_id": trace_id})

In Grafana, configure a derived field in Loki to make trace IDs clickable — clicking a trace ID in a log line jumps directly to the trace in Tempo.

1
# Grafana Loki data source config — derived fields
2
derivedFields:
3
  - name: TraceID
4
    matcherRegex: "trace_id=(\\w+)"
5
    url: "${__data.fields.traceID}"
6
    datasourceUid: tempo
7
    urlDisplayLabel: "View Trace"

Trace → Metrics (Exemplars)

Exemplars attach a trace ID to a specific metric data point. When you see a spike in latency on a Grafana dashboard, click the exemplar dot to jump to the exact trace that caused it.

1
                    ●  ← exemplar (trace_id=abc123)
2
       ┌────────────┤
3
 p99   │            │
4
       │    ────────┤
5
 p50   │            │
6
       └────────────┴──────────── time

In Prometheus, exemplars are stored alongside histogram buckets. Grafana displays them as dots on graphs.

The Full Loop

1
Dashboard spike → click exemplar → open trace → see slow span →
2
click trace_id in logs → see error message → fix bug

This is the core value of distributed tracing in an observable system.

Key Takeaways

Distributed tracing follows requests across services — showing latency, dependencies, and bottlenecks as a span tree.
Context propagation (W3C Trace Context headers) carries trace IDs between services.
Sampling (head-based or tail-based) controls trace volume in production.
Jaeger is a mature, full-featured tracing backend with tag search and a rich UI.
Grafana Tempo uses cheap object storage and TraceQL for cost-effective tracing at scale.
Correlating traces with logs and metrics (via trace IDs and exemplars) is what makes distributed tracing truly powerful.