Culture
Roblox Return to Service - A 73 Hours Downtime Postmortem 18 minutes read.
Great read by the team at Roblox, sharing their learnings from a 73 hours downtime. Production operations at such a huge scale and complexity is incredibly challenging. I’m sure it was tough to figure out what was going on and stay calm at restoring the service after so many hours, so kudos for being transparent about it and sharing things as they are. What would you take from that into how we build our systems?
Read it later via
Instapaper.
Share
it via
Twitter
or
email.
How to Design a House to Last 1000 Years (Part I) 6 minutes read.
I know what you're thinking. How does this post relate to building software companies? Reading Brian Potter's post made me think about building companies that can sustainably provide value for 1000 years. Is that possible? Can a software company be designed to handle discrete destructive events or decay processes? What would be the fundamental design of such companies if this was the year 3022?
Read it later via
Instapaper.
Share
it via
Twitter
or
email.