Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings
@kamalakarpeta
kamalakarpeta
Follow

Kamalakar Peta kamalakarpeta

🎯
Focusing
Experienced Data Professional | Python, SQL, Azure Databricks, PySpark | Data Engineering Focus

Block or report kamalakarpeta

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
kamalakarpeta /README.md

Hi, I'm Kamalakar 👋

Data & AI Platform Architect — Databricks Lakehouse · GenAI · Microsoft Fabric

I design and build governed lakehouse platforms that turn raw enterprise data into trusted products for analysts, data scientists, and BI consumers. Twelve years across the stack — from hand-rolled Python/SQL pipelines to medallion lakehouses on Databricks and Microsoft Fabric, with GenAI/RAG serving layers on top.

  • 🏗️ Currently: Asst. Director, Data Engineering @ Moody's — architecting Databricks & Fabric data platforms
  • 🔭 Focus: Databricks (Delta, Unity Catalog, Mosaic AI), Microsoft Fabric / OneLake, medallion architecture, data quality & governance
  • 🎓 B.Tech CSE (JNTU Anantapur) · MBA Finance (Sri Venkateswara University)
  • 📍 Bangalore, India · 📚 perpetual learner

🚀 Featured work — a Data & AI Platform Architect's journey

A progressive portfolio, from early pipelines to today's lakehouse + GenAI platforms:

Year Project Stack
2026 Enterprise Lakehouse on Microsoft Fabric Fabric · OneLake · Direct Lake · medallion
2025 Financial-Research RAG on Databricks Mosaic AI Vector Search · MLflow · model serving
2023 Grant-Data Integration Pipeline Databricks · PySpark · Delta · Great Expectations
2022 Customs & Trade Analytics Lakehouse Databricks · Unity Catalog · medallion
2021 Platform-Usage Analytics Azure (ADF · Synapse · ADLS) · Power BI
2019 Yield-Curve Outlier Detection AWS · Streamlit · Terraform
2018 Predictive Error RCA Pipeline Python · NLP feature engineering
2016 Market-Performance Feature Platform Python · point-in-time datasets
2014 Structured-Finance Pricing Pipeline Python · SQL

🛠️ Skills

Lakehouse & Data Platforms

Databricks Delta Lake Unity Catalog Microsoft Fabric Apache Spark PySpark

GenAI & ML

Mosaic AI MLflow RAG

Cloud

Azure Azure Data Factory Azure Synapse AWS

Languages

Python SQL

Orchestration, Quality & IaC

Apache Airflow Great Expectations Terraform

BI & Visualization

Power BI Streamlit

Databases & Storage

MySQL MongoDB

Dev & Ops

GitHub Docker VS Code Jupyter


📊 GitHub Stats

Kamalakar's GitHub Stats Top Languages


🤝 Connect with me

Linkedin Portfolio X Email

Ask Me Anything !

Popular repositories Loading

  1. grant-data-integration-databricks-pipeline grant-data-integration-databricks-pipeline Public

    2023 · Automated grant-data integration on Databricks — PySpark, Delta & Great Expectations quality gates feeding downstream consumers.

    5

  2. kamalakarpeta.github.io kamalakarpeta.github.io Public

    Kamalakar Peta website

    Python 1

  3. kamalakarpeta kamalakarpeta Public

    Profile README — Data & AI Platform Architect · Databricks Lakehouse · GenAI · Microsoft Fabric

  4. customs-trade-analytics-databricks-pyspark customs-trade-analytics-databricks-pyspark Public

    2022 · Databricks + PySpark lakehouse (Delta, medallion, Unity Catalog) enriching the Orbis data product with customs/trade analytics.

  5. structured-finance-pricing-pipeline-python-sql structured-finance-pricing-pipeline-python-sql Public

    2014 · Python + SQL data pipeline aggregating & normalizing market pricing for US structured finance — serving the pricing desk.

  6. market-performance-analytics-python-ml market-performance-analytics-python-ml Public

    2016 · Market data engineering pipeline & feature platform (Python) — point-in-time datasets serving analysts & data scientists.

AltStyle によって変換されたページ (->オリジナル) /