Professional Summary

I am a results-driven Data Engineering and Analytics professional and Certified Project Management Professional (PMP®) with deep experience in healthcare and financial data ecosystems. I specialize in designing secure, scalable, and cost-optimized Big Data pipelines that transform raw data into actionable insights for decision-making, compliance, and strategic growth.

My expertise spans ETL/ELT development, data governance, and cloud integration across Azure, AWS, and Snowflake, leveraging tools such as Databricks, Airflow, and SQL to build automated and auditable workflows. I apply project management principles to every stage of development — from scope definition to delivery — ensuring projects meet quality, cost, and timeline objectives.

Combining strong technical acumen with leadership, stakeholder management, and agile execution skills, I bridge engineering precision with business strategy. Passionate about using data to improve healthcare quality, financial transparency, and operational efficiency, I focus on turning complex challenges into scalable, insight-driven solutions that deliver measurable impact.

Skills

Data Engineering & ETL

SQL / Python / PySpark / Databricks / Airflow / NiFi / Snowflake / DBT / Azure ADF / AWS Glue / ETL Design & Optimization

Data Analytics & Visualization

Power BI / Tableau / Looker Studio / Advanced Excel / Statistical Analysis / Data Modeling / KPIs & Reporting Automation

Project Management & Leadership

PMP® Certified / Agile Scrum / SDLC Lifecycle / Stakeholder Management / Risk Mitigation / Delivery Planning / Governance

Tools & Platforms

Git / GitHub Actions / Docker / Linux Shell / PyCharm / DataGrip / Jupyter / Azure DevOps / ServiceNow / Jira

🌟 Featured Projects

FHIR ETL on Spark → Snowflake

End-to-end pipeline for FHIR JSON into curated Snowflake models. SCD2 for Member/Provider, DQ checks, lineage with dbt.

Impact: 2.5× faster loads; ~30% cost savings.

Healthcare Claims Anomaly Detection

ICD-10 CM + provider signals with Python/Spark for real-time anomaly scoring across fraud, waste, abuse.

Impact: ↓ false positives ~18%; faster integrity reviews.

Finance Pricing & Margin Analytics

Airflow + Snowflake ELT; time-series KPIs & margin forecasts; exec dashboards for decision speed.

Impact: p95 report time 11m → 90s.

Cloud Computing Cloud Technologies

Platforms and tooling used to build secure, scalable, and cloud-native data platforms — integrating Azure and AWS services for healthcare and finance analytics.

Azure

ADLS • ADF • Databricks • Synapse • Azure SQL • Functions • Key Vault • DevOps Pipelines

  • FHIR → Bronze/Silver/Gold on Databricks (PySpark) with Delta + Medallion.
  • ADF-based ELT to Synapse/Snowflake with dynamic, parameterized datasets.
  • Serverless Functions for micro-batch ingestion and DQ execution.
🔎 Featured 📄 View More

AWS

S3 • Glue • DMS • Lambda • Step Functions • Athena • EMR • IAM

  • CDC via DMS → S3 → Glue ETL with partitioning & compaction (Parquet).
  • Lambda + Step Functions orchestrating serverless workflows.
  • Athena views for ad-hoc analytics; cost-optimized lifecycle policies.
🔎 Featured 📄 View More

📄 View All Cloud Projects

Healthcare Data Engineering & Analytics

Data platforms for FHIR, clinical, and claims ecosystems — enabling interoperability, care quality, and performance analytics.

FHIR ETL — Spark to Snowflake

Built an automated ETL framework for ingesting FHIR-compliant JSON data using PySpark and Snowflake Streams/Tasks. Implemented SCD2 for Member/Provider tables, dbt tests, and lineage tracking.

Impact: 2.5× faster loads, improved auditability, and ~30% cost optimization.

💻 Code 📊 Architecture

Healthcare Claims — Anomaly Detection

Developed an anomaly detection pipeline using ICD-10 CM and utilization signals to flag potential fraud, waste, and abuse. Includes profiling, feature engineering, and BI dashboards for SIU/integrity teams.

Impact: Reduced false positives by ~18%; faster case triage and review.

💻 Code 📄 Case Study

📄 View More Healthcare Projects

Finance Data Engineering & Analytics

Data platforms for pricing, profitability, and risk — integrating multi-source financial data for faster, sharper decision-making.

Pricing & Margin Analytics

Designed ELT pipelines that integrate product, sales, discount, and cost data to compute net price, margin, and KPIs. Snowflake marts power exec dashboards for segment, customer, and SKU-level profitability.

Impact: Report runtime improved from 11m → 90s; 40% faster KPI analysis.

💻 Code 📄 Case Study

Transaction Fraud Analytics

Built near-real-time fraud detection workflows leveraging event streams and anomaly detection in Snowflake and Python. Features transaction velocity, merchant risk, device fingerprint, and geo anomalies for scoring.

Impact: Reduced investigation time by ~35%; improved detection recall by ~15%.

💻 Code 📊 Dashboard

📄 View More Finance Projects

Supply Chain Data Engineering & Analytics

Data platforms and analytics for warehousing, transportation, and fulfillment — integrating WMS, TMS, and ERP data to improve inventory health, logistics cost, and OTIF performance.

Warehouse Inventory & Replenishment ETL

End-to-end ETL pipeline integrating WMS, ERP, and POS data into curated inventory models on Snowflake / Databricks. Calculates inventory positions, reorder points, and safety stock by SKU–location with dbt tests and lineage.

Impact: Designed to reduce stockouts and excess inventory through data-driven replenishment.

💻 Code 📄 Case Study

Transportation Route & Cost Optimization

Ingests shipment, carrier, and telematics (GPS) data to build lane-level KPIs such as cost-per-mile, cost-per-drop, and transit reliability. Uses Spark & Snowflake to power interactive dashboards for carrier and route benchmarking.

Impact: Enables optimization of freight spend and routing decisions across the network.

💻 Code 📊 Dashboard

📄 View More Supply Chain Projects

Insurance Data Engineering & Analytics

Scalable platforms for policy, claims, and actuarial data — enabling pricing, underwriting, fraud, and regulatory reporting at scale.

Policy & Claims Data Vault

Designed a policy–claims data vault on Snowflake integrating policy admin, billing, and claims systems. Built hubs, links, and satellites to support 360° policy/insured view and downstream star schemas.

Impact: Simplified lineage and faster enablement of new actuarial & reporting use cases.

💻 Code 📊 Architecture

Pricing & Underwriting Risk Analytics

Built ELT pipelines to create exposure, premium, and loss triangles by segment, product, and geography. Supports pricing adequacy, loss ratio, and retention analysis for underwriting and actuarial teams.

Impact: Reduced manual spreadsheet work and accelerated pricing review cycles.

💻 Code 📊 Dashboard

📄 View More Insurance Projects

E-commerce Data Engineering & Analytics

Data foundations for traffic, product, and order analytics — powering conversion, personalization, and profitable growth.

Clickstream & Conversion Funnel Analytics

Ingests web events, sessions, and cart actions into a curated funnel model (visit → view → add-to-cart → checkout → purchase). Enables drop-off analysis by channel, device, and campaign.

Impact: Exposed high-drop funnels and improved conversion experiments.

💻 Code 📊 Funnel Dashboard

Product Recommendation & Personalization Mart

Built a user–item interaction mart combining orders, views, and wishlists for recommendation engines (collaborative filtering / “customers also bought”). Serves real-time feature sets to downstream models.

Impact: Framework ready for A/B testing of personalized recommendations.

💻 Code 📄 Feature Schema

📄 View More E-commerce Projects

📂 Publications & Dashboards

📘 ICD-10 CM Bulk Loader & Profiling

Python notebook for bulk ICD-10 CM loading, validation, and profiling to support DQ and healthcare analytics pipelines.

📊 Utilization & LOS Trends Dashboard

Interactive dashboard tracking utilization, LOS, and resource efficiency using Looker Studio with healthcare claims data.

📋 Certifications

PMP® — Project Management Professional

Project Management Institute

Credential ID: 3787028
Issued: Mar 2024 · Expires: Mar 2027

📄 View Certificate

DP-203 — Azure Data Engineer Associate

Microsoft

Credential ID: 2211773CB6CBA50F
Issued: Mar 2024 · Expires: Mar 2025

📄 View Certificate

SnowPro Core Certification

Snowflake

Credential ID: xxxxxxx
Issued: 2025

About

Databricks Data Engineer Associate

Databricks

Credential ID: xxxxxxx
Issued: 2025

About

AWS Certified Cloud Practitioner

Amazon Web Services

Credential ID: xxxxxxx
Issued: 2025

About

ITIL® 4 Foundation

AXELOS / PeopleCert

Credential ID: xxxxxxx
Issued: 2024

About

Professional Scrum Master™ I (PSM I)

Scrum.org

Credential ID: xxxxxxx
Issued: 2024

About

HIPAA Privacy & Security Awareness

Accountable

Certificate #: 7a473b29-7fce-4813-a462-b7207e139c46
Valid: Aug 2025 – Aug 2026

View Certificate

Discounted Cash Flow Valuation

356 Financial Analyst

Credential ID: 3787028
Issued: Dec 2025

📄 View Certificate

👷🏼‍♂️ Project Management Leadership

Certified Project Management Professional (PMP®) with hands-on leadership in agile and hybrid delivery models for healthcare and finance data programs. Skilled at aligning technical execution with strategic objectives, cross-functional coordination, and continuous delivery.

Contact

My approach blends engineering precision with strategic execution—bridging data pipelines, analytics, and project delivery to drive measurable outcomes across healthcare and finance domains.

Based in Boston MA.(02135) and on STEM-OPT through March 2027, I’m open to opportunities in Data Engineering, Analytics, and Project Management where data can create meaningful impact.

Thank you for visiting my portfolio – I appreciate your time and interest in my work. Let's stay in touch. I would like to hear your thoughts and answer any questions you might have about my work and experience.

👁️‍🗨️ Visitors: Visitors
© 2025 Pawan Jadhav — Data Engineering & Analytics Portfolio