Posts for: #System-Design

Revision History and Snapshotting

2026-04-06sohilladhani

#distributed-systems #storage #architecture #system-design

A user hits Ctrl+Z forty times and expects to land exactly where they were yesterday. That is not just undo. That is a complete audit trail of every edit, stored efficiently, queryable at any point in time. The naive approach: store a full copy of the document after every change. Works for ten users. Collapses at ten thousand. Deltas, Not Copies Instead of storing full document state after every edit, store only what changed: the operation (insert 3 chars at position 12, delete 5 chars at position 20).

Operational Transformation

2026-04-05sohilladhani

#distributed-systems #consistency #concurrency #architecture #system-design

Two users edit the same document simultaneously. User A inserts “X” at position 5. User B deletes the character at position 3. Apply both naively and the result is corrupted. The positions shifted when B’s deletion ran first, and A’s insertion lands in the wrong place. The Position Problem Operations encode positions at generation time, not application time. When document state changes between generation and application, positions are stale. Operational Transformation (OT) transforms an incoming op relative to already-applied ops before executing it.

Lambda and Kappa Architecture

2026-04-04sohilladhani

#distributed-systems #architecture #kafka #stream-processing #system-design

Real-time results are fast and approximate. Historical results are slow and accurate. The tension between them is where Lambda and Kappa architecture come from. Lambda: Two Pipelines Lambda runs two parallel systems. The batch layer processes all historical data on a schedule (Spark on HDFS, every few hours) and produces ground truth. The speed layer processes the live stream (Kafka Streams or Flink) for low-latency results. The serving layer merges both: “latest batch result plus stream delta since the last batch.

Watermarks and Late-Arriving Data

2026-04-03sohilladhani

#distributed-systems #stream-processing #kafka #system-design

There are two clocks in any stream processing system. Event time: when the click actually happened, recorded in the payload. Processing time: when your system received it. On a healthy network they’re close. In reality they’re not. Mobile clients buffer events when offline. Retries add delay. A click at 10:00:05 might reach your processor at 10:00:47. The 10:00 window has long since closed. The Problem With Never Waiting If you never close a window, you never produce output.

Stream Processing Windows

2026-04-02sohilladhani

#distributed-systems #kafka #stream-processing #system-design

Aggregating over an infinite stream sounds easy until you realize you have no idea when it ends. You need to cut it into chunks. That’s what windows are. Three Window Types Tumbling windows are fixed, non-overlapping buckets. “Clicks per minute” is a tumbling window: minute 1, minute 2, minute 3, no overlap. Simple to implement, but events that span the boundary get split across buckets. Sliding windows overlap. “Average clicks in the last 5 minutes, recomputed every minute” means each event can appear in up to 5 windows.

Cache Write Strategies

2026-03-29sohilladhani

#caching #redis #distributed-systems #system-design #consistency

Reading from cache is easy. Writing is where it gets complicated. Three strategies, each with a different answer to the question: when does the cache get updated relative to the database? Write-through updates the cache and the database synchronously on every write. The cache is always consistent with the DB. The downside is that every write pays double the cost: serialize the object, write to cache, write to DB, all in the same request path.

Hot Key Detection and Mitigation

2026-03-28sohilladhani

#caching #redis #distributed-systems #system-design #performance

Redis is single-threaded per instance. One key receiving 50,000 reads per second will pin a single CPU core and nothing else on that shard gets processed fast. This is the hot key problem. Unlike a database where you might add replicas or indexes, a single Redis key is owned by a single shard. Traffic concentration on that key concentrates CPU on that node. Detection is straightforward: redis-cli --hotkeys scans keyspace and reports access frequency.

Cache Eviction Policies

2026-03-27sohilladhani

#caching #redis #distributed-systems #system-design

Cache fills up. Something has to go. The question is: which thing? LRU (Least Recently Used) evicts whatever was accessed longest ago. Simple, intuitive, fast to implement with a doubly-linked list and hash map. LFU (Least Frequently Used) evicts whatever was accessed least often. More accurate in theory, more expensive in practice. The LFU decay problem tripped me up: new items start with zero frequency. A fresh key that’s about to become hot looks identical to a stale key nobody cares about.

Testing Eventually Consistent Systems: When Assertions Need Patience

2026-03-26sohilladhani

#distributed-systems #system-design #testing #consistency #architecture

You write a record, immediately read it back, and assert equality. The test fails. Not because of a bug, but because the read hit a replica that hasn’t caught up yet. Your test is correct. Your assertion timing isn’t.

Contract Testing: Verifying Service Interactions Without E2E Tests

2026-03-25sohilladhani

#distributed-systems #system-design #testing #microservices #architecture

Team A changes their API response. Team B’s service breaks in production. The integration test suite passed because it was running against a mock from 3 months ago.

Chaos Engineering: Breaking Things on Purpose

2026-03-24sohilladhani

#distributed-systems #system-design #testing #resilience #architecture

Your system passed all tests. Every health check is green. You’re confident it handles failures. Then a network partition happens in production and everything falls apart. You never actually tested failure.

Consumer Group Rebalancing: The Partition Shuffle

2026-03-23sohilladhani

#distributed-systems #system-design #architecture #kafka #event-driven

You have 3 consumers reading from 6 Kafka partitions. One consumer crashes. The remaining 2 need to pick up its partitions. That handoff isn’t as smooth as you’d hope.

Log Compaction: Keeping the Latest Without Keeping Everything

2026-03-22sohilladhani

#distributed-systems #system-design #architecture #kafka #event-driven

Your event log has 100 million records. Key ‘user-42’ has been updated 500 times. You only care about the latest value. But deleting old entries would break consumers who haven’t caught up yet.

Merkle Trees: Detecting Differences Without Comparing Everything

2026-03-21sohilladhani

#distributed-systems #system-design #architecture #data-structures #replication

Two database replicas should have identical data. One has 50 million rows. Comparing row by row would take hours. Merkle trees find the differences by comparing a single hash.

Quorum Reads and Writes: Tuning Consistency with Math

2026-03-20sohilladhani

#distributed-systems #system-design #consistency #replication #architecture

Three replicas, one write. How many replicas need to acknowledge before the write is ‘done’? One? All three? The answer determines your consistency guarantees.

Push vs Pull Metrics Collection: Two Ways to Get the Numbers

2026-03-19sohilladhani

#distributed-systems #system-design #architecture #monitoring #microservices

Should your services push metrics to a collector, or should the collector pull metrics from your services? Sounds like a minor detail. It changes your entire monitoring architecture.

Downsampling: Keeping Trends, Not Every Data Point

2026-03-18sohilladhani

#distributed-systems #system-design #architecture #databases #monitoring

You’re storing metrics at 1-second granularity. After a year, that’s 31 million data points per metric. Nobody looks at second-level data from 6 months ago. But you still need the trends.

Time-Series Databases: Storage Built for Timestamps

2026-03-17sohilladhani

#distributed-systems #system-design #architecture #databases #monitoring

Your monitoring system ingests 100,000 metrics per second. Each is a timestamp, a name, and a value. A regular database buckles. Time-series databases are designed for exactly this shape of data.

Transcoding Pipelines: Processing Video at Scale

2026-03-16sohilladhani

#distributed-systems #system-design #architecture #java #distributed-processing

User uploads one video file. Your system needs to produce 240p, 480p, 720p, and 1080p versions, each with multiple audio tracks. That’s a distributed workflow problem.

Adaptive Bitrate Streaming: Adjusting Quality on the Fly

2026-03-15sohilladhani

#distributed-systems #system-design #architecture #performance #streaming

User starts watching in 1080p. They walk into an elevator. Bandwidth drops. The video freezes and buffers. Adaptive bitrate streaming would have dropped to 480p and kept playing.

CDN and Edge Caching: Serving Content from Next Door

2026-03-14sohilladhani

#distributed-systems #system-design #architecture #caching #performance

Your origin server is in us-east-1. Your user is in Mumbai. That’s 200ms of latency before a single byte transfers. CDNs put your content on a server down the street.

Proximity Search: Finding What’s Nearby at Scale

2026-03-13sohilladhani

#distributed-systems #system-design #architecture #mysql #caching

User opens the app. Show the nearest 10 coffee shops. Sounds simple until you realize ’nearest’ means computing distance against millions of locations in under 100ms.

Quadtrees: When Fixed Grids Aren’t Enough

2026-03-12sohilladhani

#distributed-systems #system-design #architecture #data-structures #indexing

Manhattan has 50,000 restaurants. Rural Wyoming has 3 per county. A fixed-size grid wastes cells on empty space and overloads dense areas. Quadtrees adapt.

Geohashing: Turning Coordinates into Searchable Strings

2026-03-11sohilladhani

#distributed-systems #system-design #architecture #mysql #indexing

Your user is at latitude 37.7749, longitude -122.4194. Your database has 10 million locations. A full table scan comparing every coordinate pair isn’t going to work.

Work Stealing: Dynamic Load Balancing Without a Coordinator

2026-03-10sohilladhani

#distributed-systems #system-design #java #performance #architecture

You split work evenly across 4 threads. Two finish in 10ms, two take 10 seconds. Half your CPU sits idle while the other half grinds. Work stealing fixes this.

Delayed Message Delivery: Execute This in 30 Minutes

2026-03-09sohilladhani

#distributed-systems #system-design #architecture #java #redis

Send a reminder in 24 hours. Retry this job in 5 minutes. Expire this hold at midnight. Delayed execution is everywhere, and Thread.sleep isn’t the answer.

Leader Election: Picking One Node to Rule

2026-03-08sohilladhani

#distributed-systems #system-design #architecture #java #redis

Three nodes, one job. Without leader election, all three run it simultaneously. With leader election, exactly one does the work while the others stand by.

MapReduce: Processing Data That Won’t Fit on One Machine

2026-03-07sohilladhani

#distributed-systems #system-design #architecture #java #performance

Your dataset is 10TB. One machine can’t hold it, let alone process it. MapReduce splits the work across hundreds of machines with a deceptively simple API.

Trie Data Structures: Prefix Search in Milliseconds

2026-03-06sohilladhani

#data-structures #system-design #java #algorithms #performance

User types three characters and expects instant suggestions. A hash map can’t do prefix lookups. A trie can, in O(k) time where k is the query length.

Inverted Indexes: How Search Actually Works

2026-03-05sohilladhani

#data-structures #system-design #java #database #distributed-systems

A normal index maps documents to words. An inverted index maps words to documents. That reversal is why search is fast.

< [Newer posts] :: [Older posts] >