Name	Name	Last commit message	Last commit date
Latest commit History 9 Commits
07-Python-Modules	07-Python-Modules
08-Python-File-Handling	08-Python-File-Handling
09-Python-Json	09-Python-Json
10-Python-Random	10-Python-Random
11-Python-Blocks	11-Python-Blocks
12-Python-Logging	12-Python-Logging
13-Python-Interview-Qs	13-Python-Interview-Qs
14-Project-Streamlit	14-Project-Streamlit
01-Python-Introduction.ipynb	01-Python-Introduction.ipynb
02-Python-Conditions.ipynb	02-Python-Conditions.ipynb
03-Python-Loops.ipynb	03-Python-Loops.ipynb
04-Python-Functions.ipynb	04-Python-Functions.ipynb
05-Python-Operators.ipynb	05-Python-Operators.ipynb
06-Python-Collection.ipynb	06-Python-Collection.ipynb
07-Python-Modules-Packages.ipynb	07-Python-Modules-Packages.ipynb
README.md	README.md

Python Data Engineer Learning Repository

Welcome to the Python Data Engineer learning repository! This repo contains a structured, practical set of Jupyter notebooks for learning core Python concepts, especially with a focus on data engineering. Each topic is covered with hands-on examples and explanations, and links are provided to the code for easy reference.

Note: This summary is based on the top-level files; for a full list of all tutorials and scripts, check the GitHub repository contents.

📚 Topics Covered

01. Python Introduction

Overview: Introduction to Python, variables, data types, and basic operations.
Key Concepts:
- Printing and string manipulation
- Variable assignment and naming
- Numeric, string, and boolean data types
- Type conversion, built-in functions, and string methods
- List basics and common list operations

02. Python Conditions

Overview: Mastering conditional statements for decision making.
Key Concepts:
- if, elif, else statements
- Comparison and logical operators
- Nested conditions and practical examples

03. Python Loops

Overview: Using loops to automate repetitive tasks.
Key Concepts:
- for and while loops
- Loop control (break, continue, pass)
- Looping through lists, strings, and dictionaries

04. Python Functions

Overview: Writing reusable blocks of code with functions.
Key Concepts:
- Defining and calling functions
- Parameters, return values, and scope
- Lambda functions and higher-order functions

05. Python Operators

Overview: Using operators to manipulate data.
Key Concepts:
- Arithmetic, assignment, comparison, logical, bitwise, and membership operators
- Precedence and associativity

06. Python Collections

Overview: Mastering data structures for efficient storage and retrieval.
Key Concepts:
- Lists, tuples, sets, dictionaries
- When and how to use each collection
- Real-world data engineering examples using collections

07. Python Modules & Packages

Overview: Organizing and reusing code with modules and packages.
Key Concepts:
- The difference between modules, packages, and libraries (with LEGO analogies)
- Importing and using built-in and external libraries (e.g., Pandas, NumPy, Matplotlib, Requests, Scikit-learn)
- Creating custom modules and packages

08. Files-CSV Directory

Overview: CSV File handling and manipulation for data storage and retrieval.
Key Concepts:
- Reading and writing text and CSV files
- Using csv file with Pandas library
- File and directory operations using os and shutil
- Handling file paths and exceptions
- Data extraction and ingestion from files

09. JSON Directory

Overview: Managing JSON data formats for configuration and data exchange.
Key Concepts:
- Reading and writing JSON files with Python’s json module
- Parsing and serializing complex JSON structures
- Real-world use cases: configuration files, API responses
- Data transformation between JSON and Python objects

10. Randoms Directory

Overview: Working with randomness, generating random numbers and data for testing and simulations.
Key Concepts:
- Using Python’s random module for numbers, choices, and shuffling
- Generating random data for data engineering tasks
- Introduction to the faker library for synthetic data creation
- Practical examples: random sampling, data anonymization

11. Blocks Directory

Overview: Code blocks and reusable scripts for modular data engineering workflows.
Key Concepts:
- Encapsulating logic in code blocks (functions, scripts)
- Organizing reusable code for ETL pipelines
- Example templates for batch processing and automation

12. Logging Directory

Overview: Logging and monitoring data engineering processes.
Key Concepts:
- Using Python’s logging module for event tracking
- Setting up log formats, levels, and handlers
- Best practices for error handling and process monitoring
- Writing logs to files and integrating with external tools

📎 How to Use This Repo

Browse Notebooks: Start with the Jupyter notebooks in the main directory for a structured learning path.
Explore Directories: Check out the additional folders for more scripts and data.
Try the Code: Run the notebooks locally or in an online Jupyter environment.
Contribute: Pull requests to add new topics or improve examples are welcome!

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

gkdevops/python-data-engineer

Folders and files

Latest commit

History

Repository files navigation

Python Data Engineer Learning Repository

📚 Topics Covered

01. Python Introduction

02. Python Conditions

03. Python Loops

04. Python Functions

05. Python Operators

06. Python Collections

07. Python Modules & Packages

08. Files-CSV Directory

09. JSON Directory

10. Randoms Directory

11. Blocks Directory

12. Logging Directory

📎 How to Use This Repo

🔗 Explore More

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages

Languages