FHIR ETL on Spark → Snowflake
End-to-end pipeline for FHIR JSON into curated Snowflake models. SCD2 for Member/Provider, DQ checks, lineage with dbt.
Impact: 2.5× faster loads; ~30% cost savings.
I am a results-driven Data Engineering and Analytics professional and Certified Project Management Professional (PMP®) with deep experience in healthcare and financial data ecosystems. I specialize in designing secure, scalable, and cost-optimized Big Data pipelines that transform raw data into actionable insights for decision-making, compliance, and strategic growth.
My expertise spans ETL/ELT development, data governance, and cloud integration across Azure, AWS, and Snowflake, leveraging tools such as Databricks, Airflow, and SQL to build automated and auditable workflows. I apply project management principles to every stage of development — from scope definition to delivery — ensuring projects meet quality, cost, and timeline objectives.
Combining strong technical acumen with leadership, stakeholder management, and agile execution skills, I bridge engineering precision with business strategy. Passionate about using data to improve healthcare quality, financial transparency, and operational efficiency, I focus on turning complex challenges into scalable, insight-driven solutions that deliver measurable impact.
SQL / Python / PySpark / Databricks / Airflow / NiFi / Snowflake / DBT / Azure ADF / AWS Glue / ETL Design & Optimization
Power BI / Tableau / Looker Studio / Advanced Excel / Statistical Analysis / Data Modeling / KPIs & Reporting Automation
PMP® Certified / Agile Scrum / SDLC Lifecycle / Stakeholder Management / Risk Mitigation / Delivery Planning / Governance
Git / GitHub Actions / Docker / Linux Shell / PyCharm / DataGrip / Jupyter / Azure DevOps / ServiceNow / Jira
End-to-end pipeline for FHIR JSON into curated Snowflake models. SCD2 for Member/Provider, DQ checks, lineage with dbt.
Impact: 2.5× faster loads; ~30% cost savings.
ICD-10 CM + provider signals with Python/Spark for real-time anomaly scoring across fraud, waste, abuse.
Impact: ↓ false positives ~18%; faster integrity reviews.
Airflow + Snowflake ELT; time-series KPIs & margin forecasts; exec dashboards for decision speed.
Impact: p95 report time 11m → 90s.
Platforms and tooling used to build secure, scalable, and cloud-native data platforms — integrating Azure and AWS services for healthcare and finance analytics.
ADLS • ADF • Databricks • Synapse • Azure SQL • Functions • Key Vault • DevOps Pipelines
S3 • Glue • DMS • Lambda • Step Functions • Athena • EMR • IAM
Data platforms for FHIR, clinical, and claims ecosystems — enabling interoperability, care quality, and performance analytics.
Built an automated ETL framework for ingesting FHIR-compliant JSON data using PySpark and Snowflake Streams/Tasks. Implemented SCD2 for Member/Provider tables, dbt tests, and lineage tracking.
Impact: 2.5× faster loads, improved auditability, and ~30% cost optimization.
Developed an anomaly detection pipeline using ICD-10 CM and utilization signals to flag potential fraud, waste, and abuse. Includes profiling, feature engineering, and BI dashboards for SIU/integrity teams.
Impact: Reduced false positives by ~18%; faster case triage and review.
Data platforms for pricing, profitability, and risk — integrating multi-source financial data for faster, sharper decision-making.
Designed ELT pipelines that integrate product, sales, discount, and cost data to compute net price, margin, and KPIs. Snowflake marts power exec dashboards for segment, customer, and SKU-level profitability.
Impact: Report runtime improved from 11m → 90s; 40% faster KPI analysis.
Built near-real-time fraud detection workflows leveraging event streams and anomaly detection in Snowflake and Python. Features transaction velocity, merchant risk, device fingerprint, and geo anomalies for scoring.
Impact: Reduced investigation time by ~35%; improved detection recall by ~15%.
Data platforms and analytics for warehousing, transportation, and fulfillment — integrating WMS, TMS, and ERP data to improve inventory health, logistics cost, and OTIF performance.
End-to-end ETL pipeline integrating WMS, ERP, and POS data into curated inventory models on Snowflake / Databricks. Calculates inventory positions, reorder points, and safety stock by SKU–location with dbt tests and lineage.
Impact: Designed to reduce stockouts and excess inventory through data-driven replenishment.
Ingests shipment, carrier, and telematics (GPS) data to build lane-level KPIs such as cost-per-mile, cost-per-drop, and transit reliability. Uses Spark & Snowflake to power interactive dashboards for carrier and route benchmarking.
Impact: Enables optimization of freight spend and routing decisions across the network.
Scalable platforms for policy, claims, and actuarial data — enabling pricing, underwriting, fraud, and regulatory reporting at scale.
Designed a policy–claims data vault on Snowflake integrating policy admin, billing, and claims systems. Built hubs, links, and satellites to support 360° policy/insured view and downstream star schemas.
Impact: Simplified lineage and faster enablement of new actuarial & reporting use cases.
Built ELT pipelines to create exposure, premium, and loss triangles by segment, product, and geography. Supports pricing adequacy, loss ratio, and retention analysis for underwriting and actuarial teams.
Impact: Reduced manual spreadsheet work and accelerated pricing review cycles.
Data foundations for traffic, product, and order analytics — powering conversion, personalization, and profitable growth.
Ingests web events, sessions, and cart actions into a curated funnel model (visit → view → add-to-cart → checkout → purchase). Enables drop-off analysis by channel, device, and campaign.
Impact: Exposed high-drop funnels and improved conversion experiments.
Built a user–item interaction mart combining orders, views, and wishlists for recommendation engines (collaborative filtering / “customers also bought”). Serves real-time feature sets to downstream models.
Impact: Framework ready for A/B testing of personalized recommendations.
Project Management Institute
Credential ID: 3787028
Issued: Mar 2024 · Expires: Mar 2027
Microsoft
Credential ID: 2211773CB6CBA50F
Issued: Mar 2024 · Expires: Mar 2025
Accountable
Certificate #: 7a473b29-7fce-4813-a462-b7207e139c46
Valid: Aug 2025 – Aug 2026
356 Financial Analyst
Credential ID: 3787028
Issued: Dec 2025
Certified Project Management Professional (PMP®) with hands-on leadership in agile and hybrid delivery models for healthcare and finance data programs. Skilled at aligning technical execution with strategic objectives, cross-functional coordination, and continuous delivery.
My approach blends engineering precision with strategic execution—bridging data pipelines, analytics, and project delivery to drive measurable outcomes across healthcare and finance domains.
Based in Boston MA.(02135) and on STEM-OPT through March 2027, I’m open to opportunities in Data Engineering, Analytics, and Project Management where data can create meaningful impact.
Thank you for visiting my portfolio – I appreciate your time and interest in my work. Let's stay in touch. I would like to hear your thoughts and answer any questions you might have about my work and experience.