Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings

DataBells/data_warehouse_sql

Folders and files

NameName
Last commit message
Last commit date

Latest commit

History

42 Commits

Repository files navigation

Modern Data Warehouse Project

πŸ“ Project Overview

This project implements a Modern Data Warehouse using PostgreSQL, structured with the Medallion Architecture (Bronze, Silver, and Gold layers). It integrates and analyzes structured data from CRM (Customer Relationship Management) and ERP (Enterprise Resource Planning) systems.


πŸ”Ή Key Features

Data Integration

  • Combining structured CRM and ERP data into a unified data warehouse.

ETL Pipelines

  • Extract, Transform, and Load (ETL) processes implemented using SQL to ensure data is cleaned, transformed, and loaded efficiently.

Data Modeling

  • Utilizes a Star Schema with fact and dimension tables for optimized query performance and analytical reporting.

Scalability

  • Designed to handle large datasets and support efficient analytical queries, ensuring scalability as data grows.

οΏ½ Architecture Overview

Medallion Architecture Layers

  1. Bronze Layer: Raw data ingestion from CRM and ERP systems.
  2. Silver Layer: Cleaned and transformed data, ready for analysis.
  3. Gold Layer: Aggregated and enriched data for business reporting and insights.

πŸ› οΈ Tools & Technologies

  • Database: PostgreSQL
  • ETL: SQL-based pipelines
  • Data Modeling: Star Schema
  • Data Sources: CRM & ERP systems

πŸ“‚ Project Structure



πŸš€ Getting Started

  1. Clone the Repository:
    git clone https://github.com/DataBells/data_warehouse_sql.git
    

πŸ› οΈ Set Up PostgreSQL

1. Install PostgreSQL

2. Create a Database

  • Open pgAdmin or the psql command-line tool.
  • Run the following SQL command to create a new database:
    CREATE DATABASE modern_data_warehouse;

3. Configure Connection Details

  • Update the connection details in the project configuration file (e.g., config.json or environment variables) with the following:
    {
     "host": "localhost",
     "port": 5432,
     "database": "modern_data_warehouse",
     "user": "your_username",
     "password": "your_password"
    }

πŸš€ Run ETL and Explore Data Models Pipelines

1. Navigate to the scripts Folder

  • Open the scripts/ folder in your project directory.

2. Execute SQL Scripts

  • Run the SQL scripts in the following order to load data into the Bronze, Silver, and Gold layers

πŸ“„ License

This project is licensed under the MIT License.

πŸ™Œ Contributing

Contributions are welcome! Please open an issue or submit a pull request for any improvements.

This markdown provides a structured overview of your Modern Data Warehouse project, including key features, architecture, tools, and setup instructions. Let me know if you need further customization! πŸš€

About

Domain: E-commerce; Building a data warehouse with sql including ETL, modeling and analytics

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

Contributors

AltStyle γ«γ‚ˆγ£γ¦ε€‰ζ›γ•γ‚ŒγŸγƒšγƒΌγ‚Έ (->γ‚ͺγƒͺγ‚ΈγƒŠγƒ«) /