Posts for: #Database

Inverted Indexes: How Search Actually Works

2026-03-05sohilladhani

A normal index maps documents to words. An inverted index maps words to documents. That reversal is why search is fast.

Optimistic vs Pessimistic Concurrency: Locks vs Versions

2026-02-27sohilladhani

#distributed-systems #database #system-design #java #mysql

Two users update the same row. Pessimistic locking blocks one until the other finishes. Optimistic locking lets both try and fails the loser. Choosing wrong kills either throughput or correctness.

[Read more]

Two-Phase Commit: The Original Distributed Transaction

2026-02-26sohilladhani

#distributed-systems #system-design #architecture #java #database

Two-phase commit guarantees atomicity across multiple databases. It also blocks everything if the coordinator dies. Here’s why microservices moved on.

[Read more]

Distributed ID Generation: Snowflake and Friends

2026-02-21sohilladhani

#distributed-systems #system-design #architecture #java #database

Auto-increment IDs break the moment you have more than one database. Snowflake IDs, UUIDs, and database sequences each solve this differently.

[Read more]

Social Graphs at Scale: Storing Relationships in MySQL

2026-02-19sohilladhani

#mysql #database #system-design #architecture #performance

A follows table with two columns seems trivial. Until you need to query it from both directions, across shards, for millions of users.

[Read more]

Cursor-Based Pagination: Why Offset Breaks at Scale

2026-02-15sohilladhani

#mysql #database #performance #system-design #java

OFFSET 50000 makes MySQL scan 50,000 rows just to skip them. Cursor pagination stays fast no matter how deep you go.

[Read more]

Read Replicas: Hidden Consistency Traps

2026-02-12sohilladhani

#mysql #database #replication #consistency #system-design

You added read replicas to scale reads. Now users update their profile and see the old version. Welcome to replica lag.

[Read more]

Database Migrations Without Downtime

2026-02-09sohilladhani

#mysql #database #system-design #deployment #architecture

ALTER TABLE on a 2M row table locks it for minutes. Your users see errors. Here’s how expand-contract and shadow writes let you migrate without downtime.

[Read more]

Connection Pooling: Why Opening Connections Is Expensive

2026-01-31sohilladhani

#performance #database #connection-pooling #java #system-design

The hidden cost of database connections. How connection pools work, why they matter, and how to size them without guessing.

[Read more]

The CAP Theorem: The Cliché I Tried to Avoid

2026-01-21sohilladhani

#distributed-systems #cap-theorem #database #system-design #architecture

Why the CAP Theorem is the most misunderstood rule in system design. Addressing the ‘Pick 2’ lie and how it sets the stage for consensus algorithms.

[Read more]

Materialized Views: The Read Optimization Pattern

2026-01-18sohilladhani

#distributed-systems #database #performance #cqrs #system-design

Why standard views are just aliases and how materialized views act as an ‘in-database cache’ to solve the cross-shard query problem.

[Read more]

Change Data Capture: Streaming Database Changes

2026-01-14sohilladhani

#database #cdc #streaming #event-driven #system-design

How to capture and stream database changes in real-time. CDC patterns, implementation approaches, and when to use it instead of application-level events.

[Read more]

Database Sharding: Splitting Data Across Machines

2026-01-12sohilladhani

#distributed-systems #database #sharding #partitioning #system-design

How to partition database across multiple servers. Hash-based vs range-based sharding, rebalancing strategies, and the complexity that comes with it.

[Read more]

Bloom Filters: Definitely Not Here

2026-01-03sohilladhani

#database #bloom-filters #data-structures #system-design

Bloom filters skip unnecessary disk reads in LSM trees by saying ‘definitely not here’ with zero false negatives. Learn how Cassandra and RocksDB use them.

[Read more]

Compaction Strategies: Cleaning Up After LSM Trees

2026-01-02sohilladhani

#database #lsm-trees #compaction #system-design

LSM trees create SSTables fast but need compaction. Learn size-tiered vs leveled compaction strategies and the write vs read amplification tradeoff.

[Read more]

LSM Trees vs B-Trees: Write Fast or Read Fast

2026-01-01sohilladhani

#database #data-structures #storage #system-design

LSM Trees vs B-Trees: the write-fast or read-fast tradeoff. Learn when to use B-trees (MySQL) vs LSM trees (Cassandra) based on your database workload.

[Read more]

Write-Ahead Logging: How Databases Survive Crashes

2025-12-31sohilladhani

#database #durability #wal #system-design

How do databases survive crashes and ensure durability? Learn how Write-Ahead Logging (WAL) uses sequential writes to guarantee data persistence without killing performance.

[Read more]

Query Execution Plans: Reading EXPLAIN Like a Map

2025-12-26sohilladhani

#database #mysql #performance #explain

Stop staring at EXPLAIN output confused. Learn to read MySQL execution plans like a map and find the root cause of slow queries in seconds, not hours.

[Read more]

Secondary Indexes in Distributed Databases

2025-12-25sohilladhani

#distributed-systems #database #partitioning #system-design

Querying partitioned databases by non-partition keys? Learn the tradeoffs between local and global secondary indexes in distributed systems.

[Read more]

The Hidden Cost of JOINs

2025-12-24sohilladhani

#database #performance #sql #system-design

Every JOIN multiplies query complexity. Learn the three JOIN strategies databases use and when denormalization beats JOIN performance by 30x.

[Read more]

Indexing Strategies That Actually Work

2025-12-23sohilladhani

#database #indexing #performance #system-design

More indexes don’t mean faster queries. Learn when to add, remove, and optimize database indexes. Real examples of 7x performance gains through strategic indexing.

[Read more]

The Query Optimization Framework

2025-12-21sohilladhani

#database #performance #optimization #system-design

Stop guessing at performance problems. Learn the 5-step systematic framework for debugging slow queries that helped reduce query times from 2+ seconds to 30ms.

[Read more]