Professional Summary

I am a results-driven Data Engineering and Analytics professional and Certified Project Management Professional (PMP®) with deep experience in healthcare and financial data ecosystems. I specialize in designing secure, scalable, and cost-optimized Big Data pipelines that transform raw data into actionable insights for decision-making, compliance, and strategic growth.

My expertise spans ETL/ELT development, data governance, and cloud integration across Azure, AWS, and Snowflake, leveraging tools such as Databricks, Airflow, and SQL to build automated and auditable workflows. I apply project management principles to every stage of development — from scope definition to delivery — ensuring projects meet quality, cost, and timeline objectives.

Combining strong technical acumen with leadership, stakeholder management, and agile execution skills, I bridge engineering precision with business strategy. Passionate about using data to improve healthcare quality, financial transparency, and operational efficiency, I focus on turning complex challenges into scalable, insight-driven solutions that deliver measurable impact.

Skills

Data Engineering & ETL

SQL / Python / PySpark / Databricks / Airflow / NiFi / Snowflake / DBT / Azure ADF / AWS Glue / ETL Design & Optimization

Data Analytics & Visualization

Power BI / Tableau / Looker Studio / Advanced Excel / Statistical Analysis / Data Modeling / KPIs & Reporting Automation

Project Management & Leadership

PMP® Certified / Agile Scrum / SDLC Lifecycle / Stakeholder Management / Risk Mitigation / Delivery Planning / Governance

Tools & Platforms

Git / GitHub Actions / Docker / Linux Shell / PyCharm / DataGrip / Jupyter / Azure DevOps / ServiceNow / Jira

🌟 Featured Projects

FHIR ETL on Spark → Snowflake

End-to-end pipeline for FHIR JSON into curated Snowflake models. SCD2 for Member/Provider, DQ checks, lineage with dbt.

Impact: 2.5× faster loads; ~30% cost savings.

Healthcare Claims Anomaly Detection

ICD-10 CM + provider signals with Python/Spark for real-time anomaly scoring across fraud, waste, abuse.

Impact: ↓ false positives ~18%; faster integrity reviews.

Finance Pricing & Margin Analytics

Airflow + Snowflake ELT; time-series KPIs & margin forecasts; exec dashboards for decision speed.

Impact: p95 report time 11m → 90s.

📂 Publications & Dashboards

📘 ICD-10 CM Bulk Loader & Profiling

Python notebook for bulk ICD-10 CM loading, validation, and profiling to support DQ and healthcare analytics pipelines.

📊 Utilization & LOS Trends Dashboard

Interactive dashboard tracking utilization, LOS, and resource efficiency using Looker Studio with healthcare claims data.

📋 Certifications

PMP® — Project Management Professional

Project Management Institute

Credential ID: 3787028
Issued: Mar 2024 · Expires: Mar 2027

About Verify

DP-203 — Azure Data Engineer Associate

Microsoft

Credential ID: 2211773CB6CBA50F
Issued: Mar 2024 · Expires: Mar 2025

About Verify

SnowPro Core Certification

Snowflake

Credential ID: xxxxxxx
Issued: 2025

About

Databricks Data Engineer Associate

Databricks

Credential ID: xxxxxxx
Issued: 2025

About

AWS Certified Cloud Practitioner

Amazon Web Services

Credential ID: xxxxxxx
Issued: 2025

About

ITIL® 4 Foundation

AXELOS / PeopleCert

Credential ID: xxxxxxx
Issued: 2024

About

Professional Scrum Master™ I (PSM I)

Scrum.org

Credential ID: xxxxxxx
Issued: 2024

About

HIPAA Privacy & Security Awareness

Accountable

Certificate #: 7a473b29-7fce-4813-a462-b7207e139c46
Valid: Aug 2025 – Aug 2026

About Verify

👷🏼‍♂️ Project Management Leadership

Certified Project Management Professional (PMP®) with hands-on leadership in agile and hybrid delivery models for healthcare and finance data programs. Skilled at aligning technical execution with strategic objectives, cross-functional coordination, and continuous delivery.

Cloud Computing Cloud Technologies

Platforms and tooling used to build secure, scalable, and cloud-native data platforms — integrating Azure and AWS services for healthcare and finance analytics.

Azure

ADLS • ADF • Databricks • Synapse • Azure SQL • Functions • Key Vault • DevOps Pipelines

  • FHIR → Bronze/Silver/Gold on Databricks (PySpark) with Delta + Medallion.
  • ADF-based ELT to Synapse/Snowflake with dynamic, parameterized datasets.
  • Serverless Functions for micro-batch ingestion and DQ execution.
🔎 Featured 📄 View More

AWS

S3 • Glue • DMS • Lambda • Step Functions • Athena • EMR • IAM

  • CDC via DMS → S3 → Glue ETL with partitioning & compaction (Parquet).
  • Lambda + Step Functions orchestrating serverless workflows.
  • Athena views for ad-hoc analytics; cost-optimized lifecycle policies.
🔎 Featured 📄 View More

📄 View All Cloud Projects

Healthcare Data Engineering & Analytics

Data pipelines and analytics designed for clinical, claims, and regulatory ecosystems — ensuring interoperability, integrity, and actionable intelligence.

FHIR ETL — Spark to Snowflake

Built an automated ETL framework for ingesting FHIR-compliant JSON data using PySpark and Snowflake Streams/Tasks. Implemented SCD2 for Member/Provider tables, dbt tests, and lineage tracking.

Impact: 2.5× faster loads, improved auditability, and 30% cost optimization.

💻 Code 📊 Architecture

Claims Anomaly Detection

Developed an anomaly detection pipeline using ICD-10 CM data to detect potential fraud, waste, and abuse in claims. Integrated PySpark feature engineering, statistical modeling, and BI dashboards.

Impact: Reduced false positives by 18%; enabled near real-time integrity review.

💻 Code 📄 Case Study

Patient Utilization & LOS Dashboard

Designed a Looker Studio dashboard integrating EHR and claims data for patient utilization, LOS, and provider performance. Created semantic models and data marts on Snowflake for interactive insights.

Impact: Streamlined operational decision-making across ACO partner networks.

📊 Dashboard 📝 Notes

📄 View More Healthcare Projects

Finance Data Engineering & Analytics

Scalable data systems for financial insights — integrating multi-source data for pricing, profitability, and performance intelligence.

Pricing & Margin Analytics

Designed ETL pipelines for integrating product, sales, and cost data to calculate margins and KPIs. Built time-series marts in Snowflake and automated reports via Airflow.

Impact: Report runtime improved from 11m → 90s; 40% faster KPI analysis.

💻 Code 📄 Case Study

Financial Forecasting Pipelines

Developed automated data ingestion and forecasting models using Snowflake Tasks and Python ARIMA for sales trend prediction. Integrated outputs into Power BI dashboards for business planning.

Impact: Improved forecast accuracy by 22%, supporting strategic pricing decisions.

💻 Code 📊 Dashboard

Transaction Fraud Analytics

Built near-real-time fraud detection workflows leveraging event streams and anomaly detection in Snowflake with ML-based thresholds. Integrated dashboards to flag anomalies with root-cause drilldowns.

Impact: Reduced investigation time by 35%, improved detection recall by 15%.

💻 Code 📊 Dashboard

📄 View More Finance Projects

Contact

My approach blends engineering precision with strategic execution—bridging data pipelines, analytics, and project delivery to drive measurable outcomes across healthcare and finance domains.

Based in Boston MA.(02135) and on STEM-OPT through March 2027, I’m open to opportunities in Data Engineering, Analytics, and Project Management where data can create meaningful impact.

Thank you for visiting my portfolio – I appreciate your time and interest in my work. Let's stay in touch. I would like to hear your thoughts and answer any questions you might have about my work and experience.

📄 Resume 📧 Email 🔗 LinkedIn 💻 GitHub 📞 +1 914-267-5356

👁️‍🗨️ Visitors: Visitors
© 2025 Pawan Jadhav — Data Engineering & Analytics Portfolio