Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings

boringdata/duckhouse

Repository files navigation

Duckhouse: DuckDB + Iceberg + Flight

DuckDB has become well-known as a lightweight, portable, and fast OLAP database.

While it excels as an embedded engine, could we push its boundaries further?

Could we build an actual data platform centered around DuckDB?

This is the idea behind Duckhouse: Duckhouse Architecture

Check the full article here

Getting Started

Installing Dependencies

uv sync

Running the Flight Server

uv run iceberg_over_flight.py serve -w warehouse -p 8816

Ingest data

curl https://d37ci6vzurychx.cloudfront.net/trip-data/yellow_tripdata_2023-01.parquet -o /tmp/yellow_tripdata_2023-01.parquet
uv run ingestion/ingestion.py

Run dbt

cd dbt_xorq_project
export PYTHONPATH="$PWD:$PYTHONPATH"
dbt run

Supported Operations

  • Reading and writing Iceberg tables with Flight Server
  • dbt run using Flight plugin
  • Filtering and column selection

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

AltStyle によって変換されたページ (->オリジナル) /