Issue 163 — July 14, 2017
BigchainDB 1.0: A Scalable Blockchain Database — BigchainDB is a decentralized database with blockchain characteristics, at scale. GitHub repo.
Tim Daubenschütz news
Delivering Billions of Messages Exactly Once — The single requirement of all data pipelines is that they cannot lose data, but de-duplicating messages can be the trickier part.
Segment story
A pipeline for continuous data migration with the SQLizer API — With data holed up in third party apps everywhere, APIs are the sensible way to continuously migrate data to your own databases. The SQLizer API gives you essential conversion to SQL capability for your migration pipeline, all from your command line.
SQLizer sponsored
Do We Need Specialized Graph Databases? — A look through an academic paper where the authors ran a graph data management system benchmark suite over several popular graph databases with interesting results.
the morning paper story
Working with DynamoDB At Scale is Very Tricky — Or, as the author puts it, "You probably shouldn’t use DynamoDB". (Note: DynamoDB is AWS’s NoSQL database service.)
Jono MacDougall opinion
ORMs Should Update "Changed" Values, Not Just "Modified" Ones — How the SQL language and its implementations distinguish between changed values and modified values.
Lukas Eder opinion
Using SQL Within Google Sheets — Did you know Google’s Web-based spreadsheet tool includes a ‘query’ function for using SQL against your spreadsheet? This is a guide to using it.
Ben Collins tutorial
All About Indexes: The Very Basics — Well placed and defined indexes will improve your database’s performance significantly. A popular article from last year.
Emil Drkušić tutorial
Datanami news
Apache Spark 2.2.0 Released — The structured streaming stream processing engine is now considered GA.
Apache Foundation news
Estimating CPU Per Query with Weighted Linear Regression — This eBook explains how a specialization of linear regression answers seemingly unsolvable database questions.
VividCortex sponsored
Eren Baydemir tutorial
Jack Christensen tutorial
Access all of the content from MongoDB World 2017 — Watch the keynote videos and download all of the breakout session presentations here.
Mongo DB video sponsored
Datanami story
BuntDB: Embeddable, In-Memory Key/Value Database — A low-level, in-memory (but persists to disk), key/value store in pure Go, with geospatial support. Inspired by BoltDB.
Josh Baker code