Records and iteration

First PublishedMar 30, 2026ByAtif Alam

This page builds on Data structures — especially list and dict — with patterns you use when each row is a small record (a dict) and you hold many rows in a list. Examples use a fictional service inventory so the shapes match ops-style scripts. Everything here is stdlib only.

List of dicts: create, read, update, delete

1
services = [
2
    {"id": 1, "name": "auth-service", "status": "healthy", "replicas": 3},
3
    {"id": 2, "name": "maps-tile-server", "status": "degraded", "replicas": 5},
4
    {"id": 3, "name": "routing-engine", "status": "healthy", "replicas": 2},
5
]
6

7
# Create — append a dict
8
new_service = {"id": 4, "name": "geocoder", "status": "healthy", "replicas": 4}
9
services.append(new_service)
10
print("After create:", [s["name"] for s in services])
11

12

13
def find_by_id(data, service_id):
14
    return next((s for s in data if s["id"] == service_id), None)
15

16

17
# Read — one row or a subset
18
print("By id:", find_by_id(services, 2))
19
degraded = [s["name"] for s in services if s["status"] == "degraded"]
20
print("Degraded:", degraded)
21

22

23
# Update — mutate the dict in place (same object the list holds)
24
def update_status(data, service_id, new_status):
25
    row = find_by_id(data, service_id)
26
    if row:
27
        row["status"] = new_status
28
        return True
29
    return False
30

31

32
update_status(services, 2, "healthy")
33
print("After update:", find_by_id(services, 2))
34

35
# Delete — build a new list (original pattern unchanged if you assign to a new name)
36
services = [s for s in services if s["id"] != 4]
37
print("After delete:", [s["name"] for s in services])
38

39
sorted_services = sorted(services, key=lambda s: s["replicas"], reverse=True)
40
print("By replicas:", [(s["name"], s["replicas"]) for s in sorted_services])
41
print("Total replicas:", sum(s["replicas"] for s in services))

find_by_id is a small reusable pattern: scan until match, return one dict or None. It is the same idea as “first row where …” in many APIs.

filter, map, and reduce

filter and map return iterators. In day-to-day Python, list comprehensions and sum / max / min are often clearer; still, recognizing filter / map / reduce helps when reading older code or other languages.

1
from functools import reduce
2

3
services = [
4
    {"id": 1, "name": "auth-service", "status": "healthy", "replicas": 3, "region": "us-west"},
5
    {"id": 2, "name": "maps-tile-server", "status": "degraded", "replicas": 5, "region": "us-east"},
6
    {"id": 3, "name": "routing-engine", "status": "healthy", "replicas": 2, "region": "us-west"},
7
    {"id": 4, "name": "geocoder", "status": "down", "replicas": 0, "region": "eu-west"},
8
    {"id": 5, "name": "search-index", "status": "healthy", "replicas": 4, "region": "us-east"},
9
]
10

11
healthy = list(filter(lambda s: s["status"] == "healthy", services))
12
healthy_lc = [s for s in services if s["status"] == "healthy"]
13
# Same members as filter() here; list comp is usually preferred for simple filters.
14
assert [s["id"] for s in healthy] == [s["id"] for s in healthy_lc]
15

16
us_healthy = list(
17
    filter(
18
        lambda s: s["status"] == "healthy" and s["region"].startswith("us"),
19
        services,
20
    )
21
)
22
print("US healthy:", [s["name"] for s in us_healthy])
23

24
names = list(map(lambda s: s["name"], services))
25
print("All names:", names)
26

27

28
def enrich_service(s):
29
    return {**s, "is_critical": s["replicas"] >= 4}
30

31

32
enriched = list(map(enrich_service, services))
33
print("Enriched sample:", enriched[1])
34

35
total_replicas = reduce(lambda acc, s: acc + s["replicas"], services, 0)
36
print("Total replicas (reduce):", total_replicas)
37

38
busiest = reduce(lambda a, b: a if a["replicas"] > b["replicas"] else b, services)
39
print("Busiest:", busiest["name"], busiest["replicas"])
40

41
id_to_name = reduce(lambda acc, s: {**acc, s["id"]: s["name"]}, services, {})
42
print("ID lookup:", id_to_name)
43

44
total_v2 = sum(
45
    s["replicas"]
46
    for s in services
47
    if s["status"] == "healthy" and "us" in s["region"]
48
)
49
print("Healthy US replicas (sum):", total_v2)

Pragmatic rule: prefer comprehensions and sum(...) / max(..., key=...) when they read naturally. Use reduce when you fold into a non-trivial accumulator (for example building a lookup dict with {**acc, k: v}).

Grouping: itertools.groupby and a dict of lists

itertools.groupby only groups consecutive rows that share the same key. Sort by that key first, or you get repeated “groups” for the same key.

1
from itertools import groupby
2
from operator import itemgetter
3

4
services = [
5
    {"id": 1, "name": "auth-service", "status": "healthy", "replicas": 3, "region": "us-west"},
6
    {"id": 2, "name": "maps-tile-server", "status": "degraded", "replicas": 5, "region": "us-east"},
7
    {"id": 3, "name": "routing-engine", "status": "healthy", "replicas": 2, "region": "us-west"},
8
    {"id": 4, "name": "geocoder", "status": "down", "replicas": 0, "region": "eu-west"},
9
    {"id": 5, "name": "search-index", "status": "healthy", "replicas": 4, "region": "us-east"},
10
    {"id": 6, "name": "cdn-edge", "status": "degraded", "replicas": 3, "region": "eu-west"},
11
]
12

13
sorted_by_status = sorted(services, key=itemgetter("status"))
14
for status, group in groupby(sorted_by_status, key=itemgetter("status")):
15
    members = list(group)
16
    print(status, [s["name"] for s in members])
17

18

19
def group_by(data, key):
20
    result = {}
21
    for item in data:
22
        k = item[key]
23
        result.setdefault(k, []).append(item)
24
    return result
25

26

27
for region, items in group_by(services, "region").items():
28
    total = sum(s["replicas"] for s in items)
29
    print(f"{region}: {len(items)} services, {total} replicas")

The group_by helper (dict of lists) does not require sorting and is easy to reuse. itemgetter("field") is a fast, readable key function for sorted and groupby.

Try it: incident records

The guide Incident records analyzer walks through a small list of incident dicts: filter, enrich, group, aggregate, and a short pipeline — stdlib only, with a full solution you can run.