Prasad Badhan

AI Data Engineer @ FML | Ex - Jio | AWS | Python | PySpark | Big Data | LLM Integration

Thane, Maharashtra, India

About

๐Ÿ’ก Data Engineer | ๐Ÿ›  ETL & Business Analytics | ๐ŸŒ Big Data & Cloud Solutions I am a Data Engineer with nearly 4 years of experience designing scalable, high-performance data solutions that power insights and optimize costs across the Maritime, Telecom, and Retail sectors. My expertise lies in building robust data pipelines, streamlining ETL/ELT processes, and delivering real-time analytics using Snowflake, Dbt, AWS, Databricks, Azure, and Apache Spark. ๐Ÿš€ Career Highlights Fleet Management Limited (Oct 2025 โ€“ Present) Infrastructure Modernization: Orchestrated the upgrade of Amazon EMR and Apache Airflow versions using AWS CloudFormation Templates (CFT), ensuring platform stability and performance. Performance Optimization: Refined complex data pipelines, achieving a ~40% reduction in job execution time. Data Quality Engineering: Developed automated validation scripts to enforce Null, Row count, and Primary Key (PK) integrity checks for Data Mart tables. Storage Efficiency: Built a custom compaction script to optimize table structures, significantly reducing S3 storage overhead and improving query performance. Ingestion Automation: Streamlined workflows by automating CSV-to-Table creation; engineered code that dynamically fetches S3 uploads, infers schemas, and generates corresponding tables. Jio Platforms Limited (July 2022 to Oct 2025) Associate Data Engineer: Contributed to large-scale data solutions within the telecom and retail ecosystems, improving system performance by 70โ€“75% through custom data models. Telecom Analytics: Optimized ETL pipelines for service request analytics, boosting First Time Resolution (FTR) tracking by 30%. Real-Time Monitoring: Developed dashboards for SAP HANA migration latency, ensuring seamless transitions to Big Data platforms. ๐Ÿ›  Tech Stack & Expertise Big Data & Infrastructure: Spark (PySpark), Kafka, Airflow, Amazon EMR, Hive, HDFS, Apache Iceberg Cloud Platforms: AWS (S3, CFT), Azure (Databricks, ADLS) Programming & Databases: Python, SQL, SAP HANA Data Visualization: Tableau, Power BI

Experience

  • Data Engineer at Fleet Management Limited
    Oct 2025 - Present ยท 9 mos

    At Fleet Management Limited, I lead infrastructure modernization by upgrading EMR and Airflow environments via AWS CloudFormation Templates, while optimizing ETL pipelines to achieve a 40% reduction in job execution time. I have strengthened data reliability by implementing automated validation scripts for Data Marts and engineered custom compaction solutions to resolve S3 storage issues and improve query performance. Additionally, I streamlined data onboarding by developing an automated ingestion engine that dynamically maps schemas and creates tables from S3-hosted CSV files.

  • Associate Data Engineer at Jio Platforms Limited (JPL)
    Mar 2024 - Oct 2025 ยท 1 yr 8 mos

    Worked as a Data Engineer in the retail and telecom domains, responsible for designing and orchestrating large-scale data pipelines, ETL processes, and analytics platforms using PySpark, Hadoop, and SQL.

  • Jio (Navi Mumbai, Maharashtra, India)
    • Data Engineer
      Mar 2023 - Mar 2024 ยท 1 yr 1 mo

      Make models based on business requirements and provide complete solution

    • Assistant Manager
      Jul 2022 - Mar 2023 ยท 9 mos

  • Team Lead at Vouchers Portal
    Jul 2021 - Jun 2022 ยท 1 yr

  • Python & ML at TCR Innovation
    Aug 2021 - Oct 2021 ยท 3 mos