Gurugram, Haryana, India
I turn data into decisions—and data pipelines into business growth. - With 3+ years of experience in building and optimizing large-scale data processing systems, I help organizations unlock the true value of their data through scalable ETL/ELT workflows, cloud-first architecture, and strong data governance. - Known for improving pipeline reliability and accelerating analytics delivery, I have contributed to mission-critical data solutions for enterprise clients including Google and Hewlett-Packard, ensuring high availability, accuracy, and performance at scale. 💡 Career Highlights ★ Improved pipeline reliability with automation — reducing failures & downtime by 30% in 24/7 ETL systems ★ Led migration of 10TB+ data with 99.9% integrity, boosting performance and accelerating insights delivery ★ Optimized ETL workflows in Vertica to reduce processing time and improve overall system efficiency by 7–7.5% ★ Proactive incident management ensuring minimal business disruption and faster RCA turnaround 🧩 Areas of Expertise • End-to-end Data Engineering • Cloud Data Pipelines — Azure & AWS • Data Integration & ETL (Talend, Informatica, ADF, KETL) • Big Data Processing — Spark, Hadoop • Data Quality, Automation & Monitoring • SQL & Python-based analytics workflows 🛠️ Technical Skillset • Tech Stack: Python, SQL, PySpark, Linux/Unix • Cloud: Azure (ADF, ADLS), AWS (S3) • Databases & DWH: Vertica, MySQL, Redshift, Synapse • Version Control: Git, GitHub
Key Highlights • Built & maintained 24/7 ETL pipelines for large-scale weather data • Automated validations + error workflows → 30% reduction in failures & downtime • Ensured pipeline stability by monitoring health using Flower & PLX • Improved data quality through proactive schema handling & issue resolution Tech: Python, PySpark, SQL, Databricks, ADF, ADLS, Talend, KETL
Orchestrated the seamless extraction and transformation of raw data from TERA Hive to CDR Vertica. Adeptly utilized Talend as the ETL tool. Achieved a notable 20% enhancement in sales pipeline data management. Proficiently wielded SQL for data manipulation and analysis. Executed expertise in using data visualization tools like Tableau and Power BI for impactful reporting. Developed a sophisticated predictive model using machine learning algorithms. Applied advanced feature engineering techniques to extract pertinent patterns and trends. Proactively managed and resolved intricate technical incidents, leading initiatives to swiftly identify root causes. Proficient in incident management processes, including incident identification, prioritization, and escalation.