nilotpal = { "name" : "Nilotpal Dhar", "role" : ["Deep learning", "Machine learning Engineer", "Open Source Developer"], "education" : "B.Tech CSBS — Academy of Technology, West Bengal (2023–2027)", "current" : "6th Semester ← still in college, already in production", "kaggle" : "Expert 🏆 — competition medals & high-ranking finishes", "pypi" : "datadiagnose — zero-dependency ML dataset diagnosis library", "location" : "West Bengal, India 🇮🇳", "stack" : ["Python", "PyTorch", "Scikit-Learn", "FastAPI", "SQL"], "live_apps" : "5 deployed ML projects — each with a public URL & live backend", "available" : True, "looking_for" : ["Full-Time Data Scientist / ML Engineer", "Freelance AI Consulting"], "contact" : "dharnilotpal31@gmail.com", }
I'm a Data Scientist and ML Engineer in my 6th semester of Computer Science & Business Systems — and I've already shipped more production ML systems than most graduates.
I hold Kaggle Expert status earned through competition medals and real-world dataset finishes. I've built and deployed 5 live ML applications covering fraud detection, medical imaging, rent prediction, churn analysis, and NLP recommendation — every single one has a live URL and a FastAPI backend on Render, not just a notebook.
I'm also the author of DataDiagnose — a Python library published on PyPI with zero external dependencies. It auto-diagnoses ML datasets, scores dataset health from 0–100, and recommends the right model type before training begins. Built from scratch using only the Python standard library. 140-test suite. MIT licensed.
My standard: Does it solve a real problem? Does it work reliably in production? If yes — ship it.
Every project below has a live URL — FastAPI backend on Render, frontend on Vercel or GitHub Pages.
| # | Project | What it does | Tech | Links |
|---|---|---|---|---|
| 1 | 🔬 DermSight PRO | ResNet-50 skin lesion classifier · 85%+ acc · 7 ISIC classes · HAM10000 dataset | PyTorch · FastAPI · ResNet-50 | GH Live |
| 2 | 🛡️ FraudGuard AI | Real-time credit card fraud detection microservice with live dashboard | LightGBM · FastAPI · Vercel | GH Live |
| 3 | 🏠 RentIQ | AI rent forecasting across 6 Indian metro cities using tuned XGBoost | XGBoost · FastAPI · GitHub Pages | GH Live |
| 4 | 📡 RevenueShield ML | Telecom churn predictor with real-time risk scoring web app | Random Forest · FastAPI | GH Live |
| 5 | 🎬 Movie Recommender | NLP content-based recommender · Streamlit dashboard · 5,000-movie corpus | NLP · Scikit-Learn · Streamlit | GH Live |
pip install datadiagnose
from datadiagnose import DataDiagnose import pandas as pd report = DataDiagnose(pd.read_csv("dataset.csv")).diagnose() # → health score 0–100, detected issues, model type recommendations
Detects: missing values · duplicates · class imbalance · data leakage · high cardinality · skewed features · outliers · constant columns
| Issuer | Certificate | Status |
|---|---|---|
| Oracle | Cloud Infrastructure Data Science Professional | Verify |
| Oracle | Cloud Infrastructure AI Foundations Associate | Verify |
| LinkedIn & Microsoft | Career Essentials in Data Analysis | Verify |
| IBM — Cognitive Class | Python 101 for Data Science | Verify |
| Udemy | NumPy · SciPy · Matplotlib · Pandas | Verify |
Languages
ML / Deep Learning / Data Science
PyTorch Scikit-learn Pandas NumPy Matplotlib Plotly SciPy
Deployment & APIs
FastAPI Render Vercel Streamlit GitHub Pages
Databases & Tools