Caching

In the realm of distributed systems, scalability is often the most challenging yet critical aspect to master. One of the most powerful and widely adopted strategies for achieving scalability is caching. By strategically placing data closer to the end users, we can drastically reduce latency, alleviate database load, and handle increased traffic without a proportional increase in infrastructure costs.

Why Caching Matters

Before diving into the tools, let’s understand the fundamental role of caching in distributed systems. Caching acts as a temporary storage layer that serves requests for data that has been previously retrieved. This reduces the need for repeated computations and database queries, especially under high load. The key benefits include:

Reduced latency: Serving data from memory (cache) is orders of magnitude faster than disk or network round trips.
Lowered database load: By reducing the number of direct database hits, we prevent bottlenecks and improve database scalability.
Improved resilience: Caching can act as a buffer during database outages or high traffic spikes.

However, caching introduces its own challenges: cache misses (when the data isn’t in the cache), cache consistency (ensuring the cache stays in sync with the source data), and cache evictions (how to handle when cache space is limited). We’ll address these as we explore the tools.

Redis: In-Memory Data Store for High Performance Caching

Redis (Remote Dictionary Server) is a popular in-memory data structure store, used as a database, cache, and message broker. It is particularly well-suited for high-performance caching due to its speed and flexibility.

Why Redis for Caching?

Redis offers several features that make it ideal for caching in distributed systems:

In-memory storage: Data is stored in RAM, providing sub-millisecond latency for reads and writes.
Data structures: Supports complex data structures (hashes, sets, lists, sorted sets) beyond simple key-value pairs.
Persistence: Options for data persistence (RDB snapshots and AOF) to prevent data loss.
Replication: Built-in master-slave replication for high availability and read scaling.

Real-World Example: Redis as a Cache Layer

Imagine a web application that serves product listings. Without caching, every request to the product page would hit the database. With Redis, we can cache the product details for a short period.

Here’s a concrete example using a Node.js application with the redis client:

<code class="language-javascript">const Redis = require('redis');
<p>const client = Redis.createClient();</p>

<p>// Cache a product by its ID</p>
<p>async function cacheProduct(productId) {</p>
<p>  const key = <code>product:${productId}</code>;</p>
<p>  // Check if the product is already in cache</p>
<p>  const product = await client.get(key);</p>
<p>  if (product) {</p>
<p>    return JSON.parse(product);</p>
<p>  }</p>

<p>  // If not in cache, fetch from database (simulated)</p>
<p>  const dbProduct = await fetchFromDatabase(productId);</p>
<p>  // Cache the result for 5 minutes</p>
<p>  await client.setex(key, 300, JSON.stringify(dbProduct));</p>
<p>  return dbProduct;</p>
<p>}</p>

<p>// Simulated database fetch (in real life, this would be a database call)</p>
<p>async function fetchFromDatabase(productId) {</p>
<p>  // In a real app, this would be a DB query</p>
<p>  return { id: productId, name: <code>Product ${productId}</code>, price: 19.99 };</p>
<p>}</code>

In this example:

The cacheProduct function first checks the cache (Redis).
If the product is found in cache, it returns it immediately.
If not, it fetches from the database and stores the result in Redis with an expiration of 5 minutes (300 seconds).

Key Redis Caching Patterns

Time-to-Live (TTL): Set an expiration time for cache entries to prevent stale data. As shown above, setex sets both the key and the TTL.
Cache Invalidation: When a product is updated, we need to invalidate the cache. This can be done by:

– Deleting the specific cache key (client.del(product:${productId}))

– Using a cache-aside pattern (where we don’t store the data until we need it) or cache-write-through (where we update the cache on write).

Multi-Stage Caching: For very large datasets, use Redis as a cache layer in front of a database (e.g., PostgreSQL) or as a distributed cache for state.

Redis in Production: A Case Study

A major e-commerce platform used Redis to cache product catalog data. Before Redis, their database was hitting 10,000 requests per second during peak traffic. After implementation:

Database load reduced by 78%
Average response time dropped from 420ms to 18ms
Scalability improved by 400% with horizontal cluster deployment

CDN: Content Delivery Network for Global Static Assets

CDNs extend the caching principle to static content, delivering it globally with low latency and high resilience.

Why CDNs for Static Assets?

CDNs are optimized for:

Static content delivery: Images, CSS, JavaScript, and HTML files
Global distribution: Edge servers located in multiple geographic regions
High traffic resilience: Automatic failover and load balancing
Bandwidth savings: Reduced origin server load through caching

Real-World Example: Video Streaming Service

Consider a video streaming service:

Without CDN: Origin server in New York serves videos to users in Tokyo → 1200ms latency
With CDN: Video files cached at Tokyo edge server → 12ms latency
Result: 90% reduction in origin server load during peak traffic

Here’s a practical implementation scenario:

<code class="language-bash"># Configure CDN cache policy for video assets
<p>curl -X POST https://api.cloudflare.com/v4/zones/{zone_id}/rules \</p>
<p>  -H "Authorization: Bearer {api_token}" \</p>
<p>  -H "Content-Type: application/json" \</p>
<p>  -d '{</p>
<p>    "actions": [</p>
<p>      {</p>
<p>        "id": "cache",</p>
<p>        "type": "cache"</p>
<p>      }</p>
<p>    ],</p>
<p>    "rules": [</p>
<p>      {</p>
<p>        "action": "cache",</p>
<p>        "conditions": [</p>
<p>          "path: /videos/*"</p>
<p>        ]</p>
<p>      }</p>
<p>    ]</p>
<p>  }'</code>

Key CDN Considerations

Cache invalidation: Use versioned asset paths (/videos/v1.2.3/video.mp4) to avoid stale content
Edge caching: Configure TTLs per asset type (e.g., 1 hour for images, 24 hours for CSS)
Security: Implement HTTPS, rate limiting, and DDoS protection at the edge
Hybrid approach: Use Redis for dynamic session data + CDN for static assets

Redis vs CDN Comparison

Feature	Redis	CDN
Data Type	Application data (dynamic)	Static content (images, CSS, JS)
Storage	RAM (in-memory)	Distributed disk (edge servers)
Latency	Sub-millisecond	Milliseconds
Use Case	Session data, product details	Static assets
Scalability	Horizontal clusters	Global network (pre-built)
Example	User sessions, cart data	Product images, website CSS

When to Use Which

Scenario	Recommendation	Why?
User sessions, real-time data	Redis	In-memory speed for low-latency operations
Product images, website assets	CDN	Global distribution with minimal origin load
API rate limiting	Redis	In-memory key-value storage for quick lookups
User-generated content	Hybrid (Redis + CDN)	Redis for metadata, CDN for final assets

Summary

Caching is a cornerstone of scalable distributed systems. Redis provides a high-performance in-memory cache for application data, enabling rapid data access and reduced database load. CDNs extend this principle to static content, delivering it globally with low latency and high resilience. Together, they form a powerful toolkit for building systems that scale efficiently and reliably. 🚀