FHIR ETL on Spark → Snowflake
End-to-end pipeline for FHIR JSON into curated Snowflake models. SCD2 for Member/Provider, DQ checks, lineage with dbt.
Impact: 2.5× faster loads; ~30% cost savings.
I am a results-driven Data Engineering and Analytics professional and Certified Project Management Professional (PMP®) with deep experience in healthcare and financial data ecosystems. I specialize in designing secure, scalable, and cost-optimized Big Data pipelines that transform raw data into actionable insights for decision-making, compliance, and strategic growth.
My expertise spans ETL/ELT development, data governance, and cloud integration across Azure, AWS, and Snowflake, leveraging tools such as Databricks, Airflow, and SQL to build automated and auditable workflows. I apply project management principles to every stage of development — from scope definition to delivery — ensuring projects meet quality, cost, and timeline objectives.
Combining strong technical acumen with leadership, stakeholder management, and agile execution skills, I bridge engineering precision with business strategy. Passionate about using data to improve healthcare quality, financial transparency, and operational efficiency, I focus on turning complex challenges into scalable, insight-driven solutions that deliver measurable impact.
SQL / Python / PySpark / Databricks / Airflow / NiFi / Snowflake / DBT / Azure ADF / AWS Glue / ETL Design & Optimization
Power BI / Tableau / Looker Studio / Advanced Excel / Statistical Analysis / Data Modeling / KPIs & Reporting Automation
PMP® Certified / Agile Scrum / SDLC Lifecycle / Stakeholder Management / Risk Mitigation / Delivery Planning / Governance
Git / GitHub Actions / Docker / Linux Shell / PyCharm / DataGrip / Jupyter / Azure DevOps / ServiceNow / Jira
End-to-end pipeline for FHIR JSON into curated Snowflake models. SCD2 for Member/Provider, DQ checks, lineage with dbt.
Impact: 2.5× faster loads; ~30% cost savings.
ICD-10 CM + provider signals with Python/Spark for real-time anomaly scoring across fraud, waste, abuse.
Impact: ↓ false positives ~18%; faster integrity reviews.
Airflow + Snowflake ELT; time-series KPIs & margin forecasts; exec dashboards for decision speed.
Impact: p95 report time 11m → 90s.
Project Management Institute
Credential ID: 3787028
Issued: Mar 2024 · Expires: Mar 2027
Microsoft
Credential ID: 2211773CB6CBA50F
Issued: Mar 2024 · Expires: Mar 2025
Accountable
Certificate #: 7a473b29-7fce-4813-a462-b7207e139c46
Valid: Aug 2025 – Aug 2026
Certified Project Management Professional (PMP®) with hands-on leadership in agile and hybrid delivery models for healthcare and finance data programs. Skilled at aligning technical execution with strategic objectives, cross-functional coordination, and continuous delivery.
Platforms and tooling used to build secure, scalable, and cloud-native data platforms — integrating Azure and AWS services for healthcare and finance analytics.
ADLS • ADF • Databricks • Synapse • Azure SQL • Functions • Key Vault • DevOps Pipelines
S3 • Glue • DMS • Lambda • Step Functions • Athena • EMR • IAM
Data pipelines and analytics designed for clinical, claims, and regulatory ecosystems — ensuring interoperability, integrity, and actionable intelligence.
Built an automated ETL framework for ingesting FHIR-compliant JSON data using PySpark and Snowflake Streams/Tasks. Implemented SCD2 for Member/Provider tables, dbt tests, and lineage tracking.
Impact: 2.5× faster loads, improved auditability, and 30% cost optimization.
Developed an anomaly detection pipeline using ICD-10 CM data to detect potential fraud, waste, and abuse in claims. Integrated PySpark feature engineering, statistical modeling, and BI dashboards.
Impact: Reduced false positives by 18%; enabled near real-time integrity review.
Designed a Looker Studio dashboard integrating EHR and claims data for patient utilization, LOS, and provider performance. Created semantic models and data marts on Snowflake for interactive insights.
Impact: Streamlined operational decision-making across ACO partner networks.
Scalable data systems for financial insights — integrating multi-source data for pricing, profitability, and performance intelligence.
Designed ETL pipelines for integrating product, sales, and cost data to calculate margins and KPIs. Built time-series marts in Snowflake and automated reports via Airflow.
Impact: Report runtime improved from 11m → 90s; 40% faster KPI analysis.
Developed automated data ingestion and forecasting models using Snowflake Tasks and Python ARIMA for sales trend prediction. Integrated outputs into Power BI dashboards for business planning.
Impact: Improved forecast accuracy by 22%, supporting strategic pricing decisions.
Built near-real-time fraud detection workflows leveraging event streams and anomaly detection in Snowflake with ML-based thresholds. Integrated dashboards to flag anomalies with root-cause drilldowns.
Impact: Reduced investigation time by 35%, improved detection recall by 15%.
My approach blends engineering precision with strategic execution—bridging data pipelines, analytics, and project delivery to drive measurable outcomes across healthcare and finance domains.
Based in Boston MA.(02135) and on STEM-OPT through March 2027, I’m open to opportunities in Data Engineering, Analytics, and Project Management where data can create meaningful impact.
Thank you for visiting my portfolio – I appreciate your time and interest in my work. Let's stay in touch. I would like to hear your thoughts and answer any questions you might have about my work and experience.