Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings
@kid-sid
kid-sid
Follow

Sidhartha Mohanty kid-sid

💭
Let's build!
Data Scientist | AI Enthusiast | Open source contributor
  • Pune

Block or report kid-sid

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
kid-sid /README.md

Sidhartha Mohanty

Data Scientist · Deep Learning · NLP · MLOps

I build ML systems end-to-end — from raw data to deployed models. I believe the best way to understand something is to build it from scratch.

LinkedIn Medium Email


About me

I'm a Data Scientist based in India with a focus on building ML systems that work in production — not just in notebooks. My work spans the full lifecycle: data pipelines, model development, and deployment.

I write on Medium about neural networks, LLMs, and applied ML — always from first principles. If I can't build it from scratch, I don't feel like I truly understand it.

Currently interested in: production RAG systems, LLM fine-tuning, and scalable ML pipelines on Databricks.


What I work on

First-principles ML → Backprop, gradient descent, perceptrons — built by hand
Applied LLMs & RAG → LangChain, LlamaIndex, vector DBs, production pipelines
Data at scale → PySpark, Databricks, pipelines for large datasets
MLOps → FastAPI, Docker, model serving and monitoring

Tech stack

Area Tools
Languages Python · SQL · PySpark
ML / DL PyTorch · TensorFlow · Keras · Scikit-learn
NLP SpaCy · HuggingFace Transformers
LLM & RAG LangChain · LlamaIndex · Vector DBs
Data Pandas · NumPy · Databricks
Infra FastAPI · Docker · Git

Writing

I write about ML concepts by building them from scratch. No hand-waving — just code and math.

Article Topic
Creating a RAG application from scratch Retrieval-augmented generation, end-to-end
Gradient descent from scratch Optimization fundamentals
Backpropagation in neural networks from scratch How neural networks actually learn
Training a single perceptron from scratch Where deep learning begins

Read all posts on Medium


GitHub stats


Open to collaborations in applied ML and LLMs · Based in India

Popular repositories Loading

  1. claude-spellbook claude-spellbook Public

    A curated collection of skills, prompts, and workflows that extend Claude's capabilities — your personal grimoire for AI-powered development.

    Python 175 15

  2. codex-spellbook codex-spellbook Public

    A curated collection of skills, prompts, and workflows that extend Codex's capabilities — your personal grimoire for AI-powered development.

    PowerShell 14 1

  3. memory_map memory_map Public

    Persistent memory & conversation history MCP server for Claude Code. Saves project context as key-value pairs, auto-summarizes sessions via LLM call, and supports cross-project memory search and gl...

    Python 9 3

  4. kid-sid kid-sid Public

    Config files for my GitHub profile.

  5. EDA-Data-visualization-using-python EDA-Data-visualization-using-python Public

    This one is the small example of tfidf vectorization of text data using sklearn

    Jupyter Notebook

  6. tfidf-implementation-using-sklearn tfidf-implementation-using-sklearn Public

    This is a simple example of tfidf implementation using sklearn

AltStyle によって変換されたページ (->オリジナル) /