Pune District, Maharashtra, India
Data Engineer with 5+ years of experience building scalable, cloud-native data platforms and high-performance ETL/ELT pipelines across Azure and AWS ecosystems. Currently delivering production-grade solutions across the end-to-end data lifecycle, including data ingestion, transformation, orchestration, optimization, monitoring, and production support. Experienced in designing reliable and scalable architectures with a strong focus on performance, data quality, observability, and operational excellence. Key Skills: • Python, SQL, PySpark • Azure (ADF, ADLS Gen2, Key Vault, Synapse Analytics, Fabric, Logic Apps) • AWS (S3, Glue, Redshift, EMR, Lambda, Kinesis, Step Functions, SageMaker, Bedrock) • Data Processing - Apache Spark, Apache Kafka • Lakehouse Technologies - Iceberg, Hudi, Delta Lake • Data Modeling, Performance Optimization, Data Quality & SLA Management • Databricks - Unity Catalog, Lakeflow Declarative Pipelines, Data Asset Bundles • CI/CD & IaC - Git, GitHub, Bitbucket, Azure DevOps, Jenkins, Terraform Highlights: • Built scalable batch and near real-time data pipelines supporting analytics and reporting workloads. • Improved processing performance, reliability, and maintainability through optimized data models and transformations. • Contributed to production support, monitoring, incident resolution, and operational excellence for business-critical platforms. • Delivered cloud-native solutions following Bronze–Silver–Gold Medallion architecture with a strong focus on governance and scalability. • Built common frameworks for data quality, reconciliation, and operational monitoring to ensure data reliability and consistency. Certifications: 🏅 Microsoft Certified: Azure Data Engineer Associate 🏅 Microsoft Certified: Fabric Analytics Engineer Associate 🏅 Databricks Certified Data Engineer Associate
Client - Wesco International
Client - Reckitt Benckiser Group PLC
Client - Unilever PLC