Thane, Maharashtra, India
๐ก Data Engineer | ๐ ETL & Business Analytics | ๐ Big Data & Cloud Solutions I am a Data Engineer with nearly 4 years of experience designing scalable, high-performance data solutions that power insights and optimize costs across the Maritime, Telecom, and Retail sectors. My expertise lies in building robust data pipelines, streamlining ETL/ELT processes, and delivering real-time analytics using Snowflake, Dbt, AWS, Databricks, Azure, and Apache Spark. ๐ Career Highlights Fleet Management Limited (Oct 2025 โ Present) Infrastructure Modernization: Orchestrated the upgrade of Amazon EMR and Apache Airflow versions using AWS CloudFormation Templates (CFT), ensuring platform stability and performance. Performance Optimization: Refined complex data pipelines, achieving a ~40% reduction in job execution time. Data Quality Engineering: Developed automated validation scripts to enforce Null, Row count, and Primary Key (PK) integrity checks for Data Mart tables. Storage Efficiency: Built a custom compaction script to optimize table structures, significantly reducing S3 storage overhead and improving query performance. Ingestion Automation: Streamlined workflows by automating CSV-to-Table creation; engineered code that dynamically fetches S3 uploads, infers schemas, and generates corresponding tables. Jio Platforms Limited (July 2022 to Oct 2025) Associate Data Engineer: Contributed to large-scale data solutions within the telecom and retail ecosystems, improving system performance by 70โ75% through custom data models. Telecom Analytics: Optimized ETL pipelines for service request analytics, boosting First Time Resolution (FTR) tracking by 30%. Real-Time Monitoring: Developed dashboards for SAP HANA migration latency, ensuring seamless transitions to Big Data platforms. ๐ Tech Stack & Expertise Big Data & Infrastructure: Spark (PySpark), Kafka, Airflow, Amazon EMR, Hive, HDFS, Apache Iceberg Cloud Platforms: AWS (S3, CFT), Azure (Databricks, ADLS) Programming & Databases: Python, SQL, SAP HANA Data Visualization: Tableau, Power BI
At Fleet Management Limited, I lead infrastructure modernization by upgrading EMR and Airflow environments via AWS CloudFormation Templates, while optimizing ETL pipelines to achieve a 40% reduction in job execution time. I have strengthened data reliability by implementing automated validation scripts for Data Marts and engineered custom compaction solutions to resolve S3 storage issues and improve query performance. Additionally, I streamlined data onboarding by developing an automated ingestion engine that dynamically maps schemas and creates tables from S3-hosted CSV files.
Worked as a Data Engineer in the retail and telecom domains, responsible for designing and orchestrating large-scale data pipelines, ETL processes, and analytics platforms using PySpark, Hadoop, and SQL.
Make models based on business requirements and provide complete solution