<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Fundamentals on Distributed Data Insights</title><link>http://ddinsights.net/tags/fundamentals/</link><description>Recent content in Fundamentals on Distributed Data Insights</description><generator>Hugo</generator><language>en-us</language><lastBuildDate>Sun, 07 Jun 2026 10:00:00 -0700</lastBuildDate><atom:link href="http://ddinsights.net/tags/fundamentals/index.xml" rel="self" type="application/rss+xml"/><item><title>Why Distributed Data Is Hard (and Worth It)</title><link>http://ddinsights.net/posts/why-distributed-data-is-hard/</link><pubDate>Sun, 07 Jun 2026 10:00:00 -0700</pubDate><guid>http://ddinsights.net/posts/why-distributed-data-is-hard/</guid><description>&lt;p>Every distributed data system is an answer to the same uncomfortable question:
&lt;em>what do you do when part of your system fails but the rest keeps running?&lt;/em> On a
single machine, a crash takes everything down together — clean, if catastrophic.
Across a network, failure is &lt;strong>partial&lt;/strong>, &lt;strong>ambiguous&lt;/strong>, and &lt;strong>constant&lt;/strong>.&lt;/p>
&lt;h2 id="the-three-things-the-network-takes-away">The three things the network takes away&lt;/h2>
&lt;p>When you split state across machines, you lose three guarantees you took for
granted on a single box:&lt;/p></description></item><item><title>LSM-Trees vs. B-Trees: Choosing Your Write Path</title><link>http://ddinsights.net/posts/lsm-trees-vs-btrees/</link><pubDate>Tue, 02 Jun 2026 08:30:00 -0700</pubDate><guid>http://ddinsights.net/posts/lsm-trees-vs-btrees/</guid><description>&lt;p>Underneath nearly every database is one of two storage engines: a &lt;strong>B-tree&lt;/strong> or an
&lt;strong>LSM-tree&lt;/strong>. The choice shapes write throughput, read latency, space usage, and
how the system behaves under pressure. It&amp;rsquo;s worth understanding &lt;em>why&lt;/em> the big
distributed stores — Cassandra, RocksDB, HBase, ScyllaDB — overwhelmingly chose
LSM-trees.&lt;/p>
&lt;h2 id="b-trees-update-in-place">B-trees: update in place&lt;/h2>
&lt;p>A B-tree keeps data sorted in fixed-size pages and &lt;strong>mutates pages in place&lt;/strong>. To
update a key, you find its page and overwrite it. Reads are excellent — typically
a handful of page lookups — and the structure has powered relational databases for
decades.&lt;/p></description></item></channel></rss>