Prasad Badhan

AI Data Engineer @ FML | Ex - Jio | AWS | Python | PySpark | Big Data | LLM Integration

Thane, Maharashtra, India

About

💡 Data Engineer | 🛠 ETL & Business Analytics | 🌐 Big Data & Cloud Solutions I am a Data Engineer with nearly 4 years of experience designing scalable, high-performance data solutions that power insights and optimize costs across the Maritime, Telecom, and Retail sectors. My expertise lies in building robust data pipelines, streamlining ETL/ELT processes, and delivering real-time analytics using Snowflake, Dbt, AWS, Databricks, Azure, and Apache Spark. 🚀 Career Highlights Fleet Management Limited (Oct 2025 – Present) Infrastructure Modernization: Orchestrated the upgrade of Amazon EMR and Apache Airflow versions using AWS CloudFormation Templates (CFT), ensuring platform stability and performance. Performance Optimization: Refined complex data pipelines, achieving a ~40% reduction in job execution time. Data Quality Engineering: Developed automated validation scripts to enforce Null, Row count, and Primary Key (PK) integrity checks for Data Mart tables. Storage Efficiency: Built a custom compaction script to optimize table structures, significantly reducing S3 storage overhead and improving query performance. Ingestion Automation: Streamlined workflows by automating CSV-to-Table creation; engineered code that dynamically fetches S3 uploads, infers schemas, and generates corresponding tables. Jio Platforms Limited (July 2022 to Oct 2025) Associate Data Engineer: Contributed to large-scale data solutions within the telecom and retail ecosystems, improving system performance by 70–75% through custom data models. Telecom Analytics: Optimized ETL pipelines for service request analytics, boosting First Time Resolution (FTR) tracking by 30%. Real-Time Monitoring: Developed dashboards for SAP HANA migration latency, ensuring seamless transitions to Big Data platforms. 🛠 Tech Stack & Expertise Big Data & Infrastructure: Spark (PySpark), Kafka, Airflow, Amazon EMR, Hive, HDFS, Apache Iceberg Cloud Platforms: AWS (S3, CFT), Azure (Databricks, ADLS) Programming & Databases: Python, SQL, SAP HANA Data Visualization: Tableau, Power BI

Experience

Data Engineer at Fleet Management Limited
Oct 2025 - Present · 9 mos
At Fleet Management Limited, I lead infrastructure modernization by upgrading EMR and Airflow environments via AWS CloudFormation Templates, while optimizing ETL pipelines to achieve a 40% reduction in job execution time. I have strengthened data reliability by implementing automated validation scripts for Data Marts and engineered custom compaction solutions to resolve S3 storage issues and improve query performance. Additionally, I streamlined data onboarding by developing an automated ingestion engine that dynamically maps schemas and creates tables from S3-hosted CSV files.
Associate Data Engineer at Jio Platforms Limited (JPL)
Mar 2024 - Oct 2025 · 1 yr 8 mos
Worked as a Data Engineer in the retail and telecom domains, responsible for designing and orchestrating large-scale data pipelines, ETL processes, and analytics platforms using PySpark, Hadoop, and SQL.
Jio (Navi Mumbai, Maharashtra, India)
- Data Engineer
  Mar 2023 - Mar 2024 · 1 yr 1 mo
  Make models based on business requirements and provide complete solution
- Assistant Manager
  Jul 2022 - Mar 2023 · 9 mos
Team Lead at Vouchers Portal
Jul 2021 - Jun 2022 · 1 yr
Python & ML at TCR Innovation
Aug 2021 - Oct 2021 · 3 mos