Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings
@Mahnoor-data
Mahnoor-data
Follow

Mahnoor Zakir Mahnoor-data

Aspiring Data Analyst | Python • SQL • Power BI • Data Visualization • Excel • EDA • Statistic | Passionate about turning curiosity into actionable insights
  • Pakistan
  • 10:43 (UTC +05:00)

Block or report Mahnoor-data

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Mahnoor-data /README.md

Mahnoor Zakir

Data Science student building systems that solve real problems with data, NLP, and deep learning. Focused on production-ready projects with live demos and measurable impact.

Python Pandas NumPy TensorFlow PyTorch SQL Git

---

About

I am a final-year DataScience student at University of Engineering and Technology Peshawar, specializing in data science and natural language processing. My work bridges traditional data analytics with modern AI — from Power BI dashboards for business insights to RAG-based systems for sensitive domains like religious text retrieval.

I build projects end-to-end: data collection, cleaning, modeling, deployment, and documentation. Every repository includes a live demo, professional README, and reproducible code.


Technical Skills

Category Tools
Languages Python, SQL
Data & Analytics Pandas, NumPy, Matplotlib, Seaborn, Power BI, Excel
Machine Learning TensorFlow, PyTorch, Scikit-learn
NLP & LLMs HuggingFace Transformers, Sentence Transformers, LangChain, Groq
Vector & Retrieval ChromaDB, FAISS, semantic search, RAG pipelines
Deployment Hugging Face Spaces, Gradio, Streamlit
Data Engineering API integration, PDF extraction, data pipeline design
Generative AI

Featured Projects

QuranFiqah — RAG-Based Islamic Question Answering System

A production-deployed Retrieval-Augmented Generation system that answers Islamic jurisprudence questions using authenticated Quran, Hadith, and Tafseer sources. Constrains LLM output to retrieved content only — eliminating hallucination risks critical for religious guidance.

  • Architecture: multilingual-e5-large embeddings + ChromaDB vector search + Llama 3.3 70B constrained generation
  • Corpus: 36,606 documents (6,236 Quran verses, ~30,000 Hadith, 6,235 Tafseer entries)
  • Performance: 3-4s response time, 100% source citation accuracy, 0% hallucination rate by design
  • Live Demo: huggingface.co/spaces/noormrc123/fiqah-qa
  • Repository: github.com/Mahnoor-data/QuranFiqah

Key technical decisions:

  • Selected ChromaDB over Pinecone/Weaviate for zero-cost persistent storage
  • Implemented batch embedding (64 docs/batch) to optimize Colab GPU utilization
  • Designed strict system prompts with temperature=0.3 to prevent creative generation of religious rulings
  • Handled real-world data engineering challenges: API failures, PDF OCR limitations, duplicate ID resolution

Hinglish YouTube Sentiment Analysis

Deep learning comparison of RNN, LSTM, GRU, BiLSTM, and DistilBERT for sentiment classification of code-mixed Hinglish (Hindi-English) YouTube comments. Includes full data pipeline from collection to visualization.

  • Dataset: 24,000 comments from 12 videos, balanced to 9,552 samples
  • Models: RNN (65%), LSTM (72.1%), GRU (72.5%), BiLSTM (74%), DistilBERT (85%)
  • Key Finding: BiLSTM delivers best accuracy-time ratio (74% in 9.9s); DistilBERT dominates accuracy but costs 28x training time
  • Techniques: Automated RoBERTa labeling, stratified split, custom readability filters, professional matplotlib visualizations
  • Repository: github.com/Mahnoor-data/hinglish-sentiment

E-Commerce Sales Insights — Blinkit Dataset

End-to-end retail analytics project analyzing consumer behavior, outlet performance, and product trends.

  • Tools: Python (Pandas, NumPy), SQL, Power BI
  • Impact: Identified Low Fat products = 64% of sales; Medium outlets = 44% revenue engine
  • Deliverable: Interactive Power BI dashboard with drill-down capability
  • Repository: github.com/Mahnoor-data/E-Commerce-Sales-and-Insights

Customer Retention & Churn Analysis

Unified 7 relational tables into a customer model to identify churn drivers and retention opportunities.

  • Tools: Power BI (DAX), SQL
  • Impact: Discovered 104K churned customers causing 9.38% revenue loss; 147K at-risk customers identified
  • Key Drivers: Order cancellations and late deliveries
  • Deliverable: Power BI dashboard with churn risk segmentation
  • Repository: github.com/Mahnoor-data/Customer-Retention-and-Churn-Analysis

Marketing Campaign ROI & A/B Testing

Statistical evaluation of campaign effectiveness across channels and devices.

  • Tools: Python (Pandas, Statsmodels), Power BI
  • Methods: ANOVA, T-tests, CPA/ROAS/Profit Margin engineering
  • Finding: Mobile CTR > Desktop, but Desktop CVR and profitability stronger; Promos delivered highest ROAS (474%)
  • Deliverable: Power BI dashboard for channel/device performance optimization
  • Repository: github.com/Mahnoor-data/Marketing-Campaign-ROI-A-B-Testing

Education

BS Datascience — University of Engineering and Technology Peshawar, Pakistan


Contact


Last updated: June 2026

Popular repositories Loading

  1. hinglish-youtube-sentiment-analysis hinglish-youtube-sentiment-analysis Public

    Sentiment analysis of code-mixed Hinglish YouTube comments using RNN, LSTM, GRU, BiLSTM and DistilBERT

    Jupyter Notebook 1

  2. fiqah-rag-engine fiqah-rag-engine Public

    A hallucination-free Islamic Q&A system that only answers from verified Quran, Hadith, and Tafseer sources. Every claim is cited. Nothing is made up.

    Jupyter Notebook 1

  3. Mahnoor-data Mahnoor-data Public

    Aspiring Data Analyst | Skilled in Python, SQL & Power BI | Passionate about Data Visualization, Analytics & Turning Curiosity into Insights | Data Science student building projects to uncover pat...

  4. E-Commerce-Sales-and-Insights E-Commerce-Sales-and-Insights Public

    Data analysis project on Blinkit sales dataset using Python, SQL, and Power BI. Covers data cleaning, transformation, advanced SQL queries, and interactive Power BI dashboards to uncover consumer t...

  5. Marketing-Campaign-ROI-A-B-Testing Marketing-Campaign-ROI-A-B-Testing Public

    This project evaluates marketing campaign effectiveness using A/B testing and statistical analysis to optimize ROI and profitability. The analysis leveraged Python (Pandas, Statsmodels, ANOVA/T-tes...

    Python

  6. Customer-Retention-and-Churn-Analysis Customer-Retention-and-Churn-Analysis Public

    This project focuses on customer retention and churn analysis using Power BI (DAX, calculated columns, interactive dashboards) and Excel for preprocessing. The analysis identifies churned, at-risk,...

AltStyle によって変換されたページ (->オリジナル) /