About
About Dikang Gu and Distributed Data Insights — a blog on distributed data systems.
Contents
About me
I’m Dikang Gu, a distributed systems engineer who has spent his career building the data infrastructure behind some of the largest consumer platforms in the world.
I’m currently a Technical Director at Roblox, where I lead the OLTP database and storage platform that provides data storage and serving for Roblox’s hundreds of millions of monthly users and millions of creators. The work centers on fault-tolerant distributed storage, smart partitioning, rock-solid replication, and squeezing every drop of performance out of planet-scale systems.
Before Roblox, I spent many years at Meta (Facebook / Instagram) on the infrastructure teams. As tech lead of Instagram’s storage team, I helped scale Apache Cassandra to thousands of nodes and led the development of Rocksandra — a pluggable RocksDB storage engine for Cassandra — which we later open-sourced. I’ve also worked on optimal data placement at global scale (Akkio) and presented this work at venues including Cassandra Summit, OSCON, F8, and DataStax Accelerate.
Earlier in my career I worked on virtualization at VMware, storage at EMC, and did an internship at Google. I hold an M.S. in Computer Science from Shanghai Jiao Tong University and a B.S. in Computer Science from Nanjing University of Aeronautics and Astronautics. I’m based in the San Francisco Bay Area.
About this blog
Distributed Data Insights is where I write down the patterns, failure modes, and trade-offs that only really become clear once you’ve operated these systems in production:
- Storage & indexing — LSM-trees, B-trees, write/read amplification, compaction.
- Replication & consistency — quorums, leader election, linearizability vs. eventual consistency.
- Streaming & pipelines — exactly-once semantics, backpressure, watermarks.
- Partitioning & scale — sharding strategies, hot keys, rebalancing.
- Operations — observability, capacity planning, and post-incident lessons.
The goal is practical depth: enough theory to reason from first principles, and enough real-world detail to actually be useful on Monday morning.
Get in touch
Find me on LinkedIn or GitHub. Want to suggest a topic or point out something I got wrong? Open an issue or PR on the site repository.