Gurugram, Haryana, India
Senior Engineering Manager with 12 years of experience building and scaling data platforms, lakehouse architectures, and AI-powered products. Proven track record of leading high-performing engineering teams, defining platform strategy, and delivering reliable, cost-efficient infrastructure at petabyte scale. Built and scaled engineering teams, led the development of multi-petabyte data platforms integrating 100+ data sources, and delivered $500K+ in annual infrastructure savings through FinOps initiatives. Experienced in partnering with product, analytics, and business stakeholders to align engineering investments with organizational goals. Expertise includes Engineering Leadership, Data Platforms, Lakehouse Architecture (Iceberg), Distributed Systems, Cloud Infrastructure, FinOps, Data Engineering. Top 3% Stack Overflow contributor with 13,500+ reputation points.
Leading engineering efforts for AI products.
• Managed a Data Engineering team with 8 direct reports (including 3 senior engineers), driving delivery execution, talent development, and engineering excellence. • Owned data platform strategy and ingestion architecture for critical business and telemetry datasets, partnering with stakeholders to define SLAs, data contracts, and governance standards. • Led the development and operations of a multi-petabyte S3 + Iceberg lakehouse, integrating 100+ data sources while ensuring strong governance, compliance, and access controls. • Drove FinOps initiatives including S3 lifecycle optimization, ARM migration, and compute right-sizing, delivering $500K+ in annual cost savings; generated QBR engineering metrics for senior leadership.
• Built and scaled the Data Pipelines Engineering team in India from scratch through hiring, mentorship, and internship programs. • Managed lakehouse ingestion for 50+ data sources, implementing robust data quality controls and operational SLAs for batch and streaming workloads. • Implemented Medallion Architecture (Bronze/Silver/Gold) to improve data reliability, governance, and standardization. • Owned the Airflow platform supporting multiple analytics teams, driving standardized deployments, platform reliability, and operational efficiency at scale. • Defined and executed the data quality strategy, implementing controls for completeness, accuracy, schema drift detection, reconciliation, and automated reporting.
• Built ETL data pipelines using AWS EMR and AWS Batch. • Developed scalable, low-latency microservices on Kubernetes (EKS) for recommendation systems. • Collaborated with the Data Science team to productionize machine learning models in cloud environments.
• Designed a query federation layer using Spark and Drill for a big data warehouse product. • Built a blockchain-agnostic data tracking solution to improve data lineage. • Developed a metadata management module supporting RDBMS, Hive, and NoSQL datastores with full-text search using Solr.
• Developed a data blending module for complex data transformations and migrations. • Built Java-based REST APIs for a data warehouse web application using Spring. • Contributed to Kundera, an open-source polyglot ORM for NoSQL datastores.