🚀 Data Engineer specializing in Big Data & Cloud
🎓 MTech (Cloud Computing) @ IIT Patna
⚡ Proven impact: 73% runtime reduction • Building high-performance data systems
🚀 5+ years in Data Engineering (BFSI, Life Sciences, Retail)
⚡ Reduced pipeline runtime by 73% (40h → 10.5h)
🏗️ Strong in PySpark, SQL, Airflow, AWS, Databricks
🧩 Experience in Medallion Architecture & SCD Type 3
🤝 Worked with American Express, Regeneron, Lloyds Banking Group
💡 I focus on building production-grade, scalable, high-performance data systems — not just pipelines.
PySpark Apache Spark Hadoop Python
AWS (S3, EC2) Databricks Iceberg
Airflow Jenkins GitHub Actions CI/CD
PostgreSQL Hive Impala
Python SQL Scala Bash
👉 https://bytesofdeepankar.hashnode.dev/
Design scalable distributed data pipelines
Optimize performance & reduce processing cost
Migrate legacy systems → modern cloud architectures
Build reliable orchestration workflows
Deliver clean, analytics-ready datasets
⚡ 73% runtime optimization on critical pipeline
📈 20% performance improvement via Spark tuning
🧩 Built MDM pipelines using Medallion Architecture
🔁 Migrated Parquet → Iceberg (future-ready data lake)
👨💻 Led teams & mentored developers
I love trekking. I've explored the lower and middle Himalayas and am now moving toward high-altitude crossover passes.
-
Completed:
- Nag Tibba (The 1st one!)
- Kheerganga
- Tungnath-Chandrashila
- Kedarkantha
- Raghupur fort (Shoja)
-
The Bucket List:
- Hampta Pass (Kullu to Spiti Crossover)
- Buran Ghati (For the legendary ice-slide)
- Kuari Pass (To see Nanda Devi up close)

