I specialize in transforming complex data into actionable insights that drive business strategies. With extensive experience in data analytics, visualization, and predictive modeling, I bring a unique approach to analyzing data trends and solving problems using tools like Python, SQL, Power BI, and Excel.
Data Cleaning Data Preprocessing EDA Visualization Predictive Modeling Machine Learning Storytelling Data Transformation Data Profiling Data Modeling
- Data Cleaning & Preparation: Ensuring data quality for accurate insights.
- Exploratory Data Analysis (EDA): Deriving key trends from raw data.
- Data Transformation: Processing and converting raw data into a usable format.
- Data Profiling: Assessing data quality and ensuring it meets analysis standards.
- Data Modeling: Structuring datasets for analysis and machine learning tasks.
- Data Visualization: Creating dynamic dashboards using Power BI, Tableau, and Excel.
- Predictive Modeling: Using machine learning to forecast trends and outcomes.
- Storytelling with Data: Communicating insights effectively through reports and presentations.
| Languages & Tools | Expertise |
|---|---|
| Python | Pandas, NumPy, Scikit-Learn, Matplotlib, Seaborn |
| SQL | PostgreSQL, MySQL |
| Excel | Advanced Functions, Power Query |
| Power BI | DAX, Visualizations |
| Jupyter Notebook | Data Analysis |
| Google Analytics | Traffic Analysis, Conversion Tracking |
| MS Word | Documentation, Reports |
| MS PowerPoint | Presentations, Data Storytelling |
| MS Outlook | Communication, Email Management |
| GitHub | Version Control, Collaboration |
π Exploring the 2024 T20 World Cup!
Dive into a detailed Jupyter Notebook showcasing player performances π, match stats π, and team insights from the 2024 T20 World Cup.
Jupyter
Data Cleaning
Visualization
π Powering data-driven insights with Python
This repository showcases data analysis projects using Python libraries like Pandas, NumPy, Matplotlib, and Seaborn. Projects include medical data visualization, time series analysis, and more.
Python
Pandas
Scikit-learn
Data Analysis
π₯ Your next favorite movie, predicted!
A Python-based project that suggests movies based on user input. Using a pre-calculated similarity matrix of 4,800+ movies, it ranks and displays the top 30 recommendations.
Scikit-learn
Machine Learning
π Predicting sales trends using data science
Using a dataset of 8,500+ entries, this project develops a predictive model for sales based on product and store attributes.
Python
Sales Forecasting
Data Science
ποΈ NYC Airbnb Listings Analysis
Data analysis of NYC Airbnb listings focusing on key metrics such as host performance, neighborhood trends, pricing, and customer reviews.
Power BI
Business Intelligence
π Discovering the secret behind top-selling pizzas!
Using SQL to analyze 20,000+ pizza sales data and uncover patterns in customer preferences, top-selling pizzas, and peak order times.
SQL
Data Analysis
π Data Analytics for a social media client
In this job simulation, I analyzed datasets for a social media client, providing insights through visual presentations.
Excel
Forage
π Cracking the leaked password database
Part of the Goldman Sachs Software Engineering virtual internship, I solved a password database challenge and produced a detailed analysis.
Security
Software Engineering
π Analyzing customer transaction data
Through the Quantium Data Analytics Job Simulation, I analyzed customer transaction data and provided strategic insights for business decisions.
Jupyter
Customer Analytics
π Driving informed decisions with data
This project focuses on data visualization, creating impactful visuals to communicate business insights effectively for TATA.
Data Visualization
Forage
π¬ Exploring Netflix Data
A comprehensive analysis of Netflixβs data using PostgreSQL, focusing on database normalization, reducing redundancy, and optimizing query performance.
Excel
Python
Tableau
π Analyzing Blinkitβs sales and customer satisfaction
A Power BI dashboard analyzing Blinkitβs total sales, item visibility, and customer ratings.
Power BI
DAX
π Predicting used car prices
Predicts car prices with 92% accuracy using a Random Forest model. It takes you through data cleaning, feature engineering, and model evaluation.
Python
Machine Learning
Regression
- Advanced Data Visualization: Enhancing skills in Power BI and Tableau.
- Time Series Forecasting: Predicting future trends using advanced models.
- Data Science in Business: Applying machine learning techniques to real-world business challenges.
- Enhancing my knowledge of machine learning models.
- Best practices for business reporting and data storytelling.
- Advanced SQL and database optimization techniques.