ElastiCache

First PublishedFeb 16, 2026ByAtif Alam

ElastiCache is a managed in-memory data store running Redis or Memcached. It sits between your application and your database, serving frequently accessed data at sub-millisecond latency.

Why Caching?

1
Without cache:
2
Client ──► App ──► Database (10–100ms)
3

4
With cache:
5
Client ──► App ──► Cache (< 1ms, hit) ──► return
6
                └──► Cache miss ──► Database (10–100ms) ──► store in cache ──► return

Caching reduces database load, lowers latency, and improves throughput. A cache hit is typically 100x faster than a database query.

Redis vs Memcached

Feature	Redis	Memcached
Data structures	Strings, hashes, lists, sets, sorted sets, streams, bitmaps	Strings only
Persistence	Yes (snapshots + AOF)	No (in-memory only)
Replication	Yes (primary + replicas)	No
Cluster mode	Yes (automatic sharding)	Yes (client-side sharding)
Pub/Sub	Yes	No
Transactions	Yes (MULTI/EXEC)	No
Lua scripting	Yes	No
Use cases	Caching, sessions, leaderboards, queues, pub/sub, real-time analytics	Simple key-value caching

Choose Redis for most use cases — it’s more versatile. Choose Memcached only for simple caching with multi-threaded performance.

ElastiCache for Redis

Cluster Modes

Cluster Mode Disabled (Single Shard):

1
                   Read
2
Client ──► Primary ──────► Replica 1 (AZ-a)
3
           (write)  ──────► Replica 2 (AZ-b)

One primary node (writes) + up to 5 replicas (reads).
All nodes hold the full dataset.
Good for: datasets that fit in one node (up to ~340 GB).

Cluster Mode Enabled (Multiple Shards):

1
Client ──► Shard 1: Primary + Replicas   (keys A–M)
2
       ──► Shard 2: Primary + Replicas   (keys N–Z)
3
       ──► Shard 3: Primary + Replicas   (keys ...etc)

Data is partitioned across shards (each shard is a primary + replicas).
Up to 500 shards.
Good for: large datasets, high write throughput (writes scale with shards).

Node Types

Type	vCPUs	Memory	Use Case
`cache.t3.micro`	2	0.5 GB	Dev/test (free tier eligible)
`cache.t3.medium`	2	3.09 GB	Small production
`cache.r6g.large`	2	13.07 GB	Production caching
`cache.r6g.xlarge`	4	26.32 GB	Large datasets
`cache.r6g.4xlarge`	16	105.81 GB	High-memory workloads

Graviton (r6g, r7g) nodes are ~20% cheaper than Intel equivalents.

Creating a Redis Cluster

1
# Cluster mode disabled (1 primary + 2 replicas)
2
aws elasticache create-replication-group \
3
  --replication-group-id my-redis \
4
  --replication-group-description "Production cache" \
5
  --engine redis \
6
  --engine-version 7.0 \
7
  --node-type cache.r6g.large \
8
  --num-cache-clusters 3 \
9
  --automatic-failover-enabled \
10
  --multi-az-enabled \
11
  --at-rest-encryption-enabled \
12
  --transit-encryption-enabled \
13
  --cache-subnet-group-name my-cache-subnets \
14
  --security-group-ids sg-redis

Caching Patterns

Cache-Aside (Lazy Loading)

The most common pattern — the application manages the cache:

1
import redis, json
2

3
r = redis.Redis(host='my-redis.abc.ng.0001.use1.cache.amazonaws.com', port=6379)
4

5
def get_user(user_id):
6
    # 1. Check cache
7
    cached = r.get(f"user:{user_id}")
8
    if cached:
9
        return json.loads(cached)   # cache hit
10

11
    # 2. Cache miss — query database
12
    user = db.query("SELECT * FROM users WHERE id = %s", user_id)
13

14
    # 3. Store in cache with TTL
15
    r.setex(f"user:{user_id}", 3600, json.dumps(user))  # 1 hour TTL
16

17
    return user

Pros	Cons
Only caches data that’s actually requested	First request for each key is slow (cache miss)
Cache failures don’t break the app	Data can become stale (until TTL expires)
Simple to implement	Application must handle both cache and DB

Write-Through

Write to cache and database at the same time:

1
def update_user(user_id, data):
2
    # Write to DB
3
    db.execute("UPDATE users SET ... WHERE id = %s", user_id, data)
4

5
    # Write to cache
6
    r.setex(f"user:{user_id}", 3600, json.dumps(data))

Pros	Cons
Cache is always up to date	Every write hits both cache and DB (slower writes)
No stale data	Cache fills with data that may never be read

Write-Behind (Write-Back)

Write to cache first, then asynchronously write to the database:

1
App ──write──► Cache ──async──► Database

Very fast writes, but risk of data loss if the cache fails before the DB write. Rarely used with ElastiCache — more common with specialized caching layers.

Cache Invalidation

The hardest part of caching. Strategies:

Strategy	How It Works	Trade-off
TTL-based	Cache expires after N seconds	Simple; data can be stale for up to TTL
Event-based	Delete cache when data changes	Consistent; requires invalidation logic
Version keys	`user:123:v5` — increment version on change	No delete needed; old versions expire via TTL

Common Use Cases

Session Storage

1
# Store session in Redis (instead of server memory)
2
r.setex(f"session:{session_id}", 1800, json.dumps({
3
    'user_id': 'user_123',
4
    'role': 'admin',
5
    'login_time': '2026-02-16T10:00:00Z'
6
}))
7

8
# Any app server can read the session
9
session = json.loads(r.get(f"session:{session_id}"))

Redis-backed sessions work across multiple app servers — critical for load-balanced applications.

Leaderboard (Sorted Sets)

1
# Add scores
2
r.zadd("leaderboard", {"alice": 950, "bob": 870, "charlie": 1020})
3

4
# Top 10 players
5
top_10 = r.zrevrange("leaderboard", 0, 9, withscores=True)
6
# [('charlie', 1020), ('alice', 950), ('bob', 870)]
7

8
# Player's rank
9
rank = r.zrevrank("leaderboard", "alice")  # 1 (0-indexed)

Sorted sets maintain order automatically — O(log N) for inserts and rank lookups.

Rate Limiting

1
def is_rate_limited(user_id, limit=100, window=60):
2
    key = f"ratelimit:{user_id}"
3
    current = r.incr(key)
4
    if current == 1:
5
        r.expire(key, window)  # set TTL on first request
6
    return current > limit

Pub/Sub (Real-Time Notifications)

1
# Publisher
2
r.publish("notifications", json.dumps({
3
    'user_id': 'user_123',
4
    'message': 'Your order has shipped'
5
}))
6

7
# Subscriber
8
pubsub = r.pubsub()
9
pubsub.subscribe("notifications")
10
for message in pubsub.listen():
11
    if message['type'] == 'message':
12
        data = json.loads(message['data'])
13
        send_push_notification(data)

ElastiCache Security

Practice	How
Network isolation	Place in private subnets. Allow only app security groups.
Encryption at rest	Enable `at-rest-encryption-enabled` (KMS)
Encryption in transit	Enable `transit-encryption-enabled` (TLS)
Auth	Enable Redis AUTH token (password) or IAM authentication
No public access	ElastiCache is VPC-only — no public endpoints

ElastiCache vs DynamoDB DAX

	ElastiCache (Redis)	DynamoDB DAX
Type	General-purpose cache	DynamoDB-specific cache
Manages	Your caching logic	Transparent (no code changes)
Data source	Any (RDS, APIs, DynamoDB, etc.)	DynamoDB only
Use case	Flexible caching, sessions, leaderboards	Accelerate DynamoDB reads

If you only need to cache DynamoDB reads, DAX is simpler. For everything else, use ElastiCache.

Key Takeaways

ElastiCache Redis is a managed in-memory store for caching, sessions, leaderboards, rate limiting, and pub/sub.
Cache-aside is the default pattern: check cache → miss → query DB → store in cache.
Set TTLs on all cached data to prevent staleness. Event-based invalidation for consistency-critical data.
Use cluster mode enabled for large datasets and high write throughput. Use cluster mode disabled for simpler setups.
Place ElastiCache in private subnets with encryption at rest and in transit.
Choose Redis over Memcached in almost all cases — richer data structures, persistence, replication.