Profile photo

DATA ENGINEER

Batch • Streaming • Reliability

Data Engineer with 6 months internship experience at Intuit. I focus on upstream → downstream systems, debugging, data quality, and reliability.

SparkAirflowKafka AWSSQLPythondbt

Featured Projects

Real-Time Financial Data Pipeline

Kafka → Spark Streaming → Bronze/Silver/Gold → Analytics

  • Handled late-arriving + out-of-order events using event-time logic
  • Prevented duplicates with idempotent writes
  • Recovered safely via checkpoints

Customer 360 Data Platform

Multiple sources → Incremental ingestion → Spark → Warehouse

  • Unified CRM + Payments + Support into one customer model
  • Reduced runtime using incremental processing
  • Added dbt-style quality checks (nulls/uniqueness)

Data Observability Framework

Quality checks → Metrics → Dashboards → Alerts

  • Detected “pipeline success but data wrong” issues
  • Row-count + freshness anomaly checks
  • Alerting workflow for fast root-cause

Certifications

Skills

Core

Python, SQL, Spark, Airflow, Kafka

Cloud & Data

AWS (S3, Athena), Warehousing, Data Modeling, Quality

Experience

Data Engineer Intern — Intuit (6 months)

  • Worked on pipeline reliability, transformations, and debugging data issues
  • Improved data quality checks and investigated failures using logs/metrics
  • Collaborated with analytics/platform stakeholders

Contact

Reach out for Data Engineer opportunities or collaborations.