DEEPANKAR YADAV YadavDeepankar

👋 Hey, I'm Deepankar Yadav

🚀 Data Engineer specializing in Big Data & Cloud
🎓 MTech (Cloud Computing) @ IIT Patna
⚡ Proven impact: 73% runtime reduction • Building high-performance data systems

🧠 About Me

🚀 5+ years in Data Engineering (BFSI, Life Sciences, Retail)
⚡ Reduced pipeline runtime by 73% (40h → 10.5h)
🏗️ Strong in PySpark, SQL, Airflow, AWS, Databricks
🧩 Experience in Medallion Architecture & SCD Type 3
🤝 Worked with American Express, Regeneron, Lloyds Banking Group

💡 I focus on building production-grade, scalable, high-performance data systems — not just pipelines.

⚙️ Tech Stack

🚀 Data & Processing

PySpark Apache Spark Hadoop Python

☁️ Cloud & Platforms

AWS (S3, EC2) Databricks Iceberg

🔄 Orchestration & DevOps

Airflow Jenkins GitHub Actions CI/CD

🗄️ Storage & Query

PostgreSQL Hive Impala

💻 Languages

Python SQL Scala Bash

🌱 My Digital Garden

👉 https://bytesofdeepankar.hashnode.dev/

📊 What I Do

Design scalable distributed data pipelines
Optimize performance & reduce processing cost
Migrate legacy systems → modern cloud architectures
Build reliable orchestration workflows
Deliver clean, analytics-ready datasets

🏆 Key Highlights

⚡ 73% runtime optimization on critical pipeline
📈 20% performance improvement via Spark tuning
🧩 Built MDM pipelines using Medallion Architecture
🔁 Migrated Parquet → Iceberg (future-ready data lake)
👨‍💻 Led teams & mentored developers

🏔️ Beyond Code

I love trekking. I've explored the lower and middle Himalayas and am now moving toward high-altitude crossover passes.

Completed:
- Nag Tibba (The 1st one!)
- Kheerganga
- Tungnath-Chandrashila
- Kedarkantha
- Raghupur fort (Shoja)
The Bucket List:
- Hampta Pass (Kullu to Spiti Crossover)
- Buran Ghati (For the legendary ice-slide)
- Kuari Pass (To see Nanda Devi up close)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly