Issue 134 — December 16, 2016
Featured
CrateDB Tackles Machine Analytics with Scale-Out SQL Database — CrateDB, a new open source database, combines the natural scalability of NoSQL solutions with the SQL interface of standard relational databases. GitHub repo here.
Alex Woodie news
Paying it Forward — How BigQuery’s Data Ingest Breaks Tech Norms — A look at Google BigQuery’s approach to batch data ingestion, why it’s so different to other approaches, and how that matters in practice.
Tino Tereshko story
TextQL: Execute SQL Against Structured Text Like CSV or TSV — Under the hood it pulls in a CSV to SQLite then runs queries against it. Written in Go.
Paul Bergeron tools
SQL Source Control: track each change to your SQL Server database — Get a full history in your source control system. See who made changes, what they did & why. See how
Red Gate sponsored
Richard Hipp on Why SQLite Succeeded as a Database — A talk with Richard Hipp, the creator of SQLite, about its history, where it came from, and why it succeeded as a database. 84 minutes.
The Changelog
Data Wrangling at Slack — The tools and technologies the data engineering team at Slack use to help people in the company quickly and easily answer questions about usage.
Ronnie Chen and Diana Pojar story
Trying Out Bloom Indexes in Postgres — Bloom filters are space-efficient probabilistic data structures that work around hashing data into bitfields. This post tests outs Postgres’ support for them as indexes.
Kaarel Moppel tutorial
How the Most Innovative SaaS Companies Use Data — How the most innovative SaaS companies out there use (and scale) their data practices. 39 minutes podcast, or a complete transcription to read.
Sarah Siwak
Jobs
Job Offers Without the Hassle - Try Hired — Sick of pushy recruiters, and dead end interviews? Try Hired to hear from top tier companies, and only talk to relevant companies.
Hired
In brief
MariaDB Takes On Teradata, Vertica with Column Store — MariaDB has added a column store to its popular relational database, enabling it to run a new class of petabyte-scale analytical queries.
Alex Woodie news
Matthew Groves tutorial
Tim Vaillancourt tutorial
Catalin Besleaga tutorial
Eric Maynard tutorial
Is your database a bottleneck? — Identify and eliminate bottlenecks w/ metrics from your dbs, load balancers, and everything in between.
Datadog tools sponsored
Laying the Foundation for a Data Team — How UK Fintech startup Monzo are building their data team.
Monzo story
Jeroen van der Heijden story
Can't Get Satisfaction from Hadoop? Try Analytic Ops — "an approach to continuous delivery of analytics results that requires close cross-functional collaboration"
Datanami opinion
The Admin Interface Framework — Generate the admin interface of your application. Simply run npm install -g lumber-cli then lumber generate to get started.
Lumber tools sponsored
sheets-api: Create A REST API Backed by Google Sheets — The API can be easily run on AWS Lambda with API Gateway.
DataFire Flows code