LSM-Trees vs. B-Trees: Choosing Your Write Path

Tue, 02 Jun 2026 08:30:00 -0700

Underneath nearly every database is one of two storage engines: a B-tree or an LSM-tree. The choice shapes write throughput, read latency, space usage, and how the system behaves under pressure. It’s worth understanding why the big distributed stores — Cassandra, RocksDB, HBase, ScyllaDB — overwhelmingly chose LSM-trees.

B-trees: update in place

A B-tree keeps data sorted in fixed-size pages and mutates pages in place. To update a key, you find its page and overwrite it. Reads are excellent — typically a handful of page lookups — and the structure has powered relational databases for decades.

Storage on Distributed Data Insights

LSM-Trees vs. B-Trees: Choosing Your Write Path

B-trees: update in place