Issue 146 — March 17, 2017
Featured
How Discord Indexes Billions of Messages — Discord is a chat service supporting millions of users. How could they add quick searching? Here’s how they set it up with Elasticsearch.
Discord story
Timescale: An Open Source Time-Series Database — SQL made scalable for time-series data. It’s Postgres compatible and optimized for fast ingest & complex queries.
Timescale
Real-Time Deduping At Scale with Redis — Deduping at scale is a hard problem and Tapjoy share a lot of details of how they use Redis to do it on over 2 million analytics messages per minute arriving in an ‘at-least-once’ style.
Tapjoy Engineering story
Linode is the SPEEDIEST SSD host for your DB - simple, reliable & powerful. — MySQL? MariaDB? PostgreSQL? Whatever DB you use, run it on a Linode, the most scalable, reliable and fastest servers in the cloud. Use promo code DB20 for a 20ドル credit and get started!
linode sponsored
Miller: Like awk, sed, cut, join, and sort for CSV, TSV, and JSON — Provides Unix tool-esque functionality but around key-value pair data. Written in C too, so no dependencies.
John Kerl tools
Hoodie: Uber's Incremental Processing Framework on Hadoop — Uber Engineering recently built and open sourced Hoodie, an incremental processing framework to power Uber’s data pipelines at low latency.
Uber Engineering tools
Has Hadoop Failed Us? — Alex Woodie brings together several people from across the industry to question if the Hadoop dream of unifying data and compute in a distributed manner has become too expensive and complex.
Datanami opinion
Don’t Migrate Databases Automatically — Migrating your DB and deploying app code at the same time can create a race condition that could take down your app.
Philip I. Thomas opinion
In brief
Amazon Web Services, Inc. news
Fauna news
Database Performance Monitoring Buyer’s Guide — This guide is designed to aid when evaluating database monitoring solutions for your unique environment.
VividCortex tutorial sponsored
James Katz tutorial
Ben Brumm tutorial
An Antique Store Data Model — Vertabelo’s fine data modelling guides continue to get more niche :-)
Vertabelo tutorial
Payal Singh opinion
Paris Kasidiaris opinion
SQL Source Control: track each change to your SQL Server database — Get a full history in your source control system. See who made changes, what they did and why. See how.
Red Gate tools sponsored
Flyway: An Open Source Database Migrations Tool — Works with all major databases.
Boxfuse tools
TeamSQL: A Multi-Platform SQL Client (in Private Beta) — Looks promising. You can join a waiting list if interested.
Tapholic tools
dataviz.tools tools