#356 — May 28, 2021
Database Weekly
immudb v1.0: A 'Tamper-Evident' Database with Immutable History — SQL support has been added and so the creators now boast immudb is the ‘first immutable database with relational capabilities’. Written in Go and Apache licensed.
Codenotary Inc
Amazon Redshift ML Now Generally Available — Redshift is Amazon’s cloud data warehouse that lets you use SQL to query and combine potentially exabytes (if you have the $!) of structured and semi-structured data.. and the ML extension goes a step further in training machine learning models to generate insights from data in your data warehouse. Even better, you can do all of this with SQL.
Danilo Poccia (AWS)
Comparing Fauna and DynamoDB: Features & Pricing — Fauna and DynamoDB are both serverless databases, but their design goals, architecture, and use cases are very different. This post discusses where they shine, where they don’t, and how they offer fundamentally different value propositions.
Fauna sponsor
An Early Look at Postgres 14: Performance and Monitoring Improvements — Last week we featured the Postgres 14 beta 1 release, and now people are starting to chew over it and see where the underlying improvements are. Small, steady, and reliable improvements seem to be the order of the day.
Lukas Fittl
Rethinking Postgres in a Post-Server World — The increasing popularity of ‘serverless’ solutions is having an impact on databases with entire paradigms (e.g. the DynamoDB model) and new services (e.g. Fauna) being created or extended to support serverless use cases. Michael Rispoli shares his thoughts on how databases created in the pre-serverless era can fit into this new world.
Michael Rispoli
SQL Query Optimization: Understanding Some Key Principles — An interesting benchmark-led guide to how different SQL approaches scale upon different sized tables.
Ivan Borshchov
Why Are Time Series Databases Important Now? — Learn what time series is, what makes it different from stream processing, full-text search and other solutions.
INFLUXDATA sponsor
A Look at a Variety of 'Free' Cloud Database Services — In reality, some of these are enterprise-y options that have free developer or trial tiers, but still, it’s a reasonable roundup.
Nebojsa Radakovic
SQLite fdw 2.0.0 Released for PostgreSQL — A foreign data wrapper to work with SQLite from Postgres 9.6, 10, 11, 12, and 13. GitHub repo.
Toshiba Software Engineering
ClangQL: Query C++ Codebases using SQLite — An interesting experiment. clangd is a language server for parsing and processing C++ code and ClangQL then adds SQLite as an intermediary so you can use SQL to ‘query’ the codebase in question.
Francesco Bertolaccini
Jobs
DevOps Engineer at X-Team (Remote) — Join the most energizing community for developers and work on long-term projects for Riot Games, FOX, Sony, Coinbase, and more.
X-Team
Find Data Engineering Jobs with Hired — Take 5 minutes to build your free profile & start getting interviews for your next job. Companies on Hired are actively hiring right now.
Hired
🎧 Some podcast episodes for you:
Software Engineering Daily is one of our favorite podcasts and it seems to have been on a real database kick lately with a variety of new data oriented episodes in the last month alone:
- Data Exploration with a New Python Library with Doris Lee — Doris Jung-Lin Lee is a research assistant who spoke about Lux, a Python library for doing data exploration through 'visual discovery'.
- Data Management Systems and Artificial Intelligence with Arun Kumar — Arun Kumar is another academic working in the data management field, speaking here about the increasing role of artificial intelligent in data management systems.
- Firebolt: Data Warehouses with Eldad Farkash — Eldad is the co-founder of Firebolt, a cloud data warehouse company, and he went on to speak both about Firebolt and modern big data solutions generally.
- ClickHouse: Data Warehousing with Robert Hodges — Altinity, where Robert works, is a leading enterprise provider for ClickHouse, an increasingly popular open-source OLAP system.
- Data Mechanics: Data Engineering with Jean-Yves Stephan — A chat about big data engineering using Apache Spark and how the cloud-native Spark platform he works on offers some unique advantages to developers.
- Apache Hudi: Large Scale Data Systems with Vinoth Chandar — Apache Hudi is an open-source data management framework used to simplify incremental data processing and data pipeline development.