OpenTelemetry

First PublishedFeb 16, 2026ByAtif Alam

OpenTelemetry (OTel) is an open-source, vendor-neutral framework for generating, collecting, and exporting telemetry data — metrics, logs, and traces — from your applications. It’s becoming the industry standard for instrumentation, backed by the CNCF.

Why OpenTelemetry?

Without OTel, you instrument your app separately for each backend:

Prometheus client library for metrics
A logging library for logs
A Jaeger/Zipkin SDK for traces

With OTel, you instrument once and export to any backend:

1
┌─────────────┐     OTel SDK      ┌──────────────┐     export     ┌──────────────┐
2
│  Your App   │──────────────────►│  OTel        │───────────────►│  Prometheus  │
3
│             │  metrics, logs,   │  Collector   │               │  Loki        │
4
│             │  traces           │              │               │  Jaeger/Tempo│
5
└─────────────┘                   └──────────────┘               └──────────────┘

Benefits

Vendor-neutral — Switch backends (Prometheus → Datadog, Jaeger → Tempo) without changing application code.
One SDK, three signals — Metrics, logs, and traces from a single library.
Auto-instrumentation — Get traces and metrics from HTTP frameworks, databases, and gRPC with zero code changes.
Correlation — Trace IDs link metrics, logs, and traces together so you can jump between signals.
CNCF standard — Wide industry adoption; all major vendors support it.

Core Concepts

Concept	What It Is
Signal	A type of telemetry: metrics, logs, or traces
Span	A single operation within a trace (e.g. an HTTP request, a database query)
Trace	A tree of spans representing an end-to-end request across services
Exporter	Sends telemetry to a backend (Prometheus, OTLP, Jaeger, etc.)
Collector	A standalone service that receives, processes, and exports telemetry
Context propagation	Passing trace IDs between services (via HTTP headers, gRPC metadata)
Resource	Metadata about where telemetry comes from (service name, version, host)

Instrumentation

Manual Instrumentation (Python Example)

1
from opentelemetry import trace, metrics
2
from opentelemetry.sdk.trace import TracerProvider
3
from opentelemetry.sdk.metrics import MeterProvider
4
from opentelemetry.sdk.trace.export import BatchSpanProcessor
5
from opentelemetry.exporter.otlp.proto.grpc.trace_exporter import OTLPSpanExporter
6
from opentelemetry.exporter.otlp.proto.grpc.metric_exporter import OTLPMetricExporter
7
from opentelemetry.sdk.metrics.export import PeriodicExportingMetricReader
8

9
# Set up tracing
10
trace.set_tracer_provider(TracerProvider())
11
trace.get_tracer_provider().add_span_processor(
12
    BatchSpanProcessor(OTLPSpanExporter(endpoint="http://otel-collector:4317"))
13
)
14

15
# Set up metrics
16
reader = PeriodicExportingMetricReader(
17
    OTLPMetricExporter(endpoint="http://otel-collector:4317")
18
)
19
metrics.set_meter_provider(MeterProvider(metric_readers=[reader]))
20

21
# Use in your code
22
tracer = trace.get_tracer("my-app")
23
meter = metrics.get_meter("my-app")
24

25
request_counter = meter.create_counter(
26
    "http_requests_total",
27
    description="Total HTTP requests",
28
)
29

30
@app.route("/api/data")
31
def get_data():
32
    with tracer.start_as_current_span("get_data") as span:
33
        span.set_attribute("http.method", "GET")
34
        span.set_attribute("http.route", "/api/data")
35
        request_counter.add(1, {"method": "GET", "status": "200"})
36
        return fetch_data()

Manual Instrumentation (Go Example)

1
import (
2
    "go.opentelemetry.io/otel"
3
    "go.opentelemetry.io/otel/trace"
4
)
5

6
tracer := otel.Tracer("my-app")
7

8
func handleRequest(w http.ResponseWriter, r *http.Request) {
9
    ctx, span := tracer.Start(r.Context(), "handleRequest")
10
    defer span.End()
11

12
    span.SetAttributes(
13
        attribute.String("http.method", r.Method),
14
        attribute.String("http.url", r.URL.Path),
15
    )
16

17
    result := fetchData(ctx)  // pass context to propagate trace
18
    json.NewEncoder(w).Encode(result)
19
}

Auto-Instrumentation (Zero Code Changes)

OTel provides auto-instrumentation that patches common libraries automatically:

Python:

1
pip install opentelemetry-distro opentelemetry-exporter-otlp
2
opentelemetry-bootstrap -a install    # auto-install instrumentation packages
3

4
# Run your app with auto-instrumentation
5
opentelemetry-instrument \
6
  --service_name my-app \
7
  --exporter_otlp_endpoint http://otel-collector:4317 \
8
  python app.py

This automatically instruments Flask/Django/FastAPI, requests, SQLAlchemy, psycopg2, redis, and more — no code changes.

Java (agent):

1
java -javaagent:opentelemetry-javaagent.jar \
2
  -Dotel.service.name=my-app \
3
  -Dotel.exporter.otlp.endpoint=http://otel-collector:4317 \
4
  -jar my-app.jar

Node.js:

1
npm install @opentelemetry/auto-instrumentations-node
2

3
node --require @opentelemetry/auto-instrumentations-node/register app.js

What Auto-Instrumentation Captures

Library Type	Examples	Signals
HTTP frameworks	Flask, Express, Spring	Traces + metrics
HTTP clients	requests, axios, HttpClient	Traces
Databases	SQLAlchemy, pg, mysql2	Traces
Message queues	Kafka, RabbitMQ, SQS	Traces
gRPC	grpc-python, grpc-java	Traces + metrics
Redis	redis-py, ioredis	Traces

The OTel Collector

The Collector is a proxy that sits between your apps and your backends. It receives telemetry, processes it, and exports it to one or more destinations.

1
App 1 ──► ┌──────────────────────────────────────┐ ──► Prometheus
2
App 2 ──► │  OTel Collector                       │ ──► Loki
3
App 3 ──► │  receivers → processors → exporters   │ ──► Tempo/Jaeger
4
          └──────────────────────────────────────┘

Why Use a Collector?

Decouple apps from backends — Apps send to the Collector; the Collector routes to backends. Change backends without touching apps.
Processing — Filter, sample, batch, add attributes, transform data before exporting.
Multiple backends — Send traces to Tempo and Jaeger, metrics to Prometheus and Datadog, from one pipeline.
Reduce app overhead — Offload batching and retry logic to the Collector.

Collector Configuration

1
receivers:
2
  otlp:
3
    protocols:
4
      grpc:
5
        endpoint: 0.0.0.0:4317
6
      http:
7
        endpoint: 0.0.0.0:4318
8

9
processors:
10
  batch:
11
    timeout: 5s
12
    send_batch_size: 1000
13
  memory_limiter:
14
    check_interval: 1s
15
    limit_mib: 512
16
  resource:
17
    attributes:
18
      - key: environment
19
        value: production
20
        action: upsert
21

22
exporters:
23
  prometheus:
24
    endpoint: 0.0.0.0:8889     # Prometheus scrapes this
25
  otlp/tempo:
26
    endpoint: tempo:4317         # send traces to Tempo
27
    tls:
28
      insecure: true
29
  loki:
30
    endpoint: http://loki:3100/loki/api/v1/push
31

32
service:
33
  pipelines:
34
    metrics:
35
      receivers: [otlp]
36
      processors: [memory_limiter, batch]
37
      exporters: [prometheus]
38
    traces:
39
      receivers: [otlp]
40
      processors: [memory_limiter, batch, resource]
41
      exporters: [otlp/tempo]
42
    logs:
43
      receivers: [otlp]
44
      processors: [memory_limiter, batch]
45
      exporters: [loki]

Pipeline Structure

Each pipeline has three stages:

Stage	Purpose	Examples
Receivers	Accept telemetry data	`otlp`, `prometheus`, `jaeger`, `filelog`
Processors	Transform, filter, batch	`batch`, `memory_limiter`, `filter`, `resource`, `tail_sampling`
Exporters	Send to backends	`prometheus`, `otlp`, `loki`, `jaeger`, `datadog`

Deployment Patterns

Agent (per-host sidecar):

1
┌──────────┐     ┌───────────┐
2
│  App Pod  │────►│ Collector │──► Backend
3
│           │     │ (sidecar) │
4
└──────────┘     └───────────┘

Each pod or host runs a Collector instance. Low latency, local processing.

Gateway (central):

1
App 1 ──►┐
2
App 2 ──►├──► Collector Gateway ──► Backend
3
App 3 ──►┘

One (or a few) Collector instances for the whole cluster. Simpler to manage, single point to configure exports.

Agent + Gateway (recommended for production):

1
App ──► Collector Agent (sidecar) ──► Collector Gateway ──► Backend

Agents handle local collection; the gateway handles routing, sampling, and export. Most flexible.

OTel vs Prometheus Client Libraries

	Prometheus Client	OpenTelemetry
Signals	Metrics only	Metrics + traces + logs
Pull/push	Pull (Prometheus scrapes)	Push (app → Collector)
Vendor lock	Prometheus format	Vendor-neutral (OTLP)
Auto-instrumentation	No	Yes
Trace correlation	No	Yes (trace IDs in metrics/logs)

You can use both: OTel for traces and the Prometheus client for metrics. Or go all-in on OTel — the Collector can export metrics in Prometheus format.

Key Takeaways

OpenTelemetry gives you one SDK for metrics, logs, and traces — instrument once, export to any backend.
Auto-instrumentation captures HTTP, database, and gRPC telemetry with zero code changes.
The OTel Collector decouples your apps from backends: receivers → processors → exporters.
Use the agent + gateway pattern in production for flexibility and reliability.
OTel and Prometheus complement each other — you don’t have to choose one exclusively.