Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings
@sumanshusamarora
sumanshusamarora
Follow

Sumanshu Arora sumanshusamarora

πŸ’»
Developing...
Dynamic and results-driven data science leader with over 13 years of experience in the data industry. Proven expertise in solving complex business problems.

Organizations

@petnotch

Block or report sumanshusamarora

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
sumanshusamarora /README.md

Hi, I'm Sumanshu Arora (Sam) πŸ‘‹

Senior Data Science & AI Strategy Leader | Solutionist | Tech Enthusiast

Email β€’ LinkedIn β€’ GitHub


πŸ§‘β€πŸ’Ό About Me

A strategic and impact-driven data science leader with 14+ years of experience delivering advanced analytics, AI, and ML solutions across diverse industries. I specialize in transforming complex business challenges into scalable, data-driven strategies that drive measurable outcomes.

  • Currently driving AI adoption through real-world GenAI applications.
  • Passionate about mentoring talent, enabling innovation, and building a culture of continuous learning.
  • Strong technical acumen combined with sharp product thinking.
  • A solutionist at heartβ€”tech is both my passion and hobby.

πŸ”₯ What I Do Best

  • AI/ML Product Development: Leading end-to-end delivery of scalable machine learning & AI products, from ideation to production.
  • Strategic Leadership: Driving vision, strategy, and execution of high-impact analytics and GenAI initiatives.
  • Cross-functional Collaboration: Partnering with engineering, product, and business teams to align data science solutions with strategic priorities.
  • Enterprise-Grade Engineering: Building robust pipelines, APIs, and microservices using Python, Django, FastAPI, and JavaScript frameworks.
  • Mentoring & Team Uplift: Shaping high-performing teams through coaching, capability building, and strong feedback culture.
  • Applied Generative AI: Building secure, production-grade GenAI applications using Langchain, LlamaIndex, and embedding-based retrieval.
  • Automation: Obsessed with automating processes for efficiency and scale.
  • Data Storytelling: Turning complex data into actionable recommendations for senior stakeholders and execs.

πŸ› οΈ Technical Toolbox

Languages & Frameworks:
Python, Django, FastAPI, JavaScript, Next.js, Streamlit

Data & MLOps:
SQL, NoSQL (MongoDB, DynamoDB), Snowflake, BigQuery, Delta Lake, Databricks, dbt, Apache Airflow, Mage.ai

Cloud & DevOps:
AWS, Azure, Docker, Kubernetes, Terraform

GenAI & LLM Ecosystem:
Ollama, Langchain, LlamaIndex, OpenAI API, ChromaDB, FAISS, SentenceTransformers

Machine Learning & Deep Learning:
scikit-learn, PyTorch, TensorFlow, XGBoost, LightGBM, Hugging Face, Transformers

Visualization & BI:
Plotly, Superset, Tableau, Metabase

APIs & Microservices:
RESTful APIs, FastAPI, Flask, Kafka

Version Control & CI/CD:
Git, GitHub, GitLab, Jenkins, GitHub Actions, GitLab CI, CircleCI


πŸš€ Projects & Open Source

  • mcp-server-templates: Owner and maintainer of mcp-server-templates – open-source templates for server automation and setup.
  • Mage.ai Contributor: Active contributor to Mage.ai, a modern data orchestration tool for building ETL pipelines.
  • AI-Powered RAG Chatbot: Designed and delivered a secure RAG-based analytics chatbot (OpenAI, Snowflake, Plotly, ChromaDB).
  • Account Lock Release LLM Agent: Automated customer account unlock using GPT-4.1, Slack bot, and Zendesk integration, driving major efficiency gains.
  • CollectIQ: Built a Django app on EKS to automate collections data refresh and prioritization, boosting recovery rates.
  • Debt Collection Strategy Models: Designed predictive models to optimize debt collection and referral strategies, significantly improving outcomes.

See more in my pinned and public repositories!


🏏 Beyond Tech

  • 🏏 Cricket enthusiast and team player on & off the field.
  • πŸ’‘ Tech is my hobby and my passionβ€”I love solving unsolved problems.
  • πŸ€– Always on the lookout for opportunities to automate and optimize.
  • πŸ‘¨β€πŸ« Dedicated mentor, passionate about building the next generation of data science leaders.

πŸ“« Let's Connect!


Sumanshu's GitHub stats

Most of my work is either private repositories or gitlab based so the above is just 10% of how much i code 😁


πŸ“ More About My Journey
  • Education: Bachelor of Technology (Information Technology), Lovely Professional University, India
  • Previous Leadership Roles: Zip Co, Foxtel, ANZ Bank, Westpac, NatWest, AON Hewitt
  • Specialties: GenAI, MLOps, Causal Inference, Data Storytelling, Agile Delivery, Cross-functional Influence

Pinned Loading

  1. Data-Everything/mcp-server-templates Data-Everything/mcp-server-templates Public

    A flexible platform that provides Docker & Kubernetes backends, a lightweight CLI (mcpt), and client utilities for seamless MCP integration. Spin up servers from templates, route requests through a...

    Python 12 2

AltStyle γ«γ‚ˆγ£γ¦ε€‰ζ›γ•γ‚ŒγŸγƒšγƒΌγ‚Έ (->γ‚ͺγƒͺγ‚ΈγƒŠγƒ«) /