Karan V Upadhyay

Data at DoorDash

San Francisco Bay Area

About

Seasoned Developer, Problem solver and an Engineer. Working from last 5 years in an overlapping role of Software engineer, Data Engineer and a little bit of Machine learning engineer. Main focus areas are ETL, Pipeline, Cloud Technologies, Software engineering life cycle and CI/CD. I am working as a DE at AWS where I get to implement ETL pipelines for huge datasets in AWS infrastructure where I learnt AWS proprietary Pipelines, internal tools and how product company operates. I have also worked for almost 5 years at MTX Group Inc. It was a small startup when I joined focused on creating a machine learning product and learnt about multiple roles in a company at early age where I learnt implementing ML models, Data Pipeline and Google cloud platform.

Experience

  • Data Engineer at DoorDash
    Apr 2026 - Present · 3 mos

    Data Engineer for New Verticals. Responsible for Merchant Reports.

  • Data Engineer at Meta
    Sep 2024 - Mar 2026 · 1 yr 7 mos

    • Designed telemetry analytics pipelines for Meta Ray-Ban smart glasses, processing data from over 5M devices. • Architected batch data pipelines using Spark, transforming high-volume telemetry streams into analytics-ready tables. • Developed engineering dashboards in Unidash, supporting reliability analytics across multiple telemetry domains.

  • Data Engineer at Amazon Web Services (AWS)
    Dec 2022 - Jul 2024 · 1 yr 8 mos

    Designed and developed multiple ETl pipelines with PySpark, Redshift, EMR, Airflow and AWS Glue. Was responsible for developing infrastructure with AWS CDK with typescript for CI/CD continues development. Got exposure to understand Apache Druid for sub second aggregation query results for 100s of GB of Data.

  • Software Engineer - Machine Learning / Data Engineer at MTX Group Inc
    Jan 2019 - Nov 2022 · 3 yrs 11 mos

    - Lead ETL project with 5 Engineers for a government agency to transfer data from Salesforce to Google Cloud Bigquery with help of GCP Data Fusion, Salesforce, Python, Spark, Ephemeral Clusters, Bigquery and IAM for Row level security. - Responsible for multiple data migration from Legacy data bases to Salesforce with help of SQL, Salesforce connected app, Python, Spark, Jupyter, Pandas, Numpy and excel. - Implemented multiple Dashboards for complex datasets in GCP Data Studio and Tablues including but not limit to Histograms, Tabular Data, World maps, Bubble Charts and density graphs. - Exposure to multiple Machine Learning models from Loan Approval probability with binary classification to Analyzing condition of infrastructure with images using CNN neural networks with help of PyTorch, Python, keras, scikit-learn, numpy, pandas and GCP.

  • Salesforce Technical Consultant at MTX Group Inc
    May 2018 - Aug 2018 · 4 mos

    MTX is a service based(Salesforce) company as well as product(Artificial Intelligence) based company. Salesforce Development : - Learnt Salesforce Lightning Development. - I was part of a 3 sprint project which includes development with help of Communities, Lightning component, APEX classes, Triggers, Custom Objects , Batch jobs and Data Loader. - Built a custom search functionality which finds all the records associated with chatter HashTags. Artificial Intelligence: - Created a Convolutional Neural Network to recognise Pneumonia from a data set of 5000 graphs with help of Pytorch Library. - With help of OpenCV library converted a Drone's video footage in to frames and identified faces from it and accurately measured emotion of a person