Aayush Chaudhry

Software Engineer at Google

San Francisco Bay Area

About

Experience

  • Google (Full-time · 4 yrs 2 mos)
    • Software Engineer III
      Nov 2025 - Present · 9 mos

      ML at Google Maps, Visual Lanes. Working on Roads Data Quality, improving bus lanes detection and lane connectivity on all road priorities by using SoTA Computer Vision modeling on Satellite, Aerial and StreetView imagery.

    • Software Engineer III
      Apr 2024 - Nov 2025 · 1 yr 8 mos

      ML at Google Search, Content Understanding - Lead quality efforts to improve scoring/relevance model for named entity recognition system for Google Search which annotates and scores knowledge graph entities on XXB indexed documents. - Fine-tuned a SOTA Gemini Encoder model which acts as a teacher model for a BERT based student model. Performed successful teacher distillation to student, preserving most of the teacher model improvements. - Led efforts to reduce serving latency for Lens query generation model. Experimented with various serving optimization to reduce serving latency by ~50% on XXXM traffic.

    • Software Engineer II
      Jun 2022 - Apr 2024 · 1 yr 11 mos

      Part of Knowledge Engine Quality team in Google Search, improved search ranking and relevance. Projects undertaken - Metrics correlation and forecasting: Did an impact analysis of Document Understanding system of Knowledge Graph on Search. Established a correlation between Document Understanding metrics and Search metrics to drive future launches to get an estimate of improvement in Search. - Short Video Ranking: Improved ranking of Short Videos being displayed on Search Result Page by using a similarity match between annotated/ contextual KG entities for a query and KG entities for retrieved documents. - Embedding based Document Understanding and Nearest Neighbor Vector Search: Created a system for annotating KG entities in a document which used a Dual Encoder architecture to create document and KG entities embeddings. The document embeddings were used to perform a nearest neighbor search on an index of all KG entities to retrieve the closest entities relevant to the document.

  • Software Development Engineer 1 at Amazon
    Feb 2022 - Jun 2022 · 5 mos

    • Worked on designing and implementing API and services for KYC process of sellers, assisting compliance team to verify documents submitted by sellers, thereby increasing velocity for seller registration. • Did impact analysis for legacy APIs for the initiative of sellers registering for various marketplaces. • Actively involved in On-Call tasks for bug fixing and unblocking registering sellers.

  • Walmart Global Tech India (Bangalore Urban, Karnataka, India)
    • Software Engineer II
      Jul 2021 - Feb 2022 · 8 mos

      • Developed data ingestion pipeline for external data to be processed and consumed by internal research orgs. • Contributed in development of a uniform data ingestion framework used org wide. Built an efficient config parser along with test driven development. • Developed an efficient tool for scraping and parsing open source data and making it available for downstream users

    • Software Engineer Intern
      Jan 2021 - Jul 2021 · 7 mos

      •Documented optimal client configurations for various databases in a polyglot cloud environment. •Parallelized CI/CD builds thereby reducing the build time by 3 times. •Developed and integrated the Cassandra endpoint in the framework. Built the ORM for Cassandra and wrote robust unit tests.

  • Software Engineer Intern at Walmart Global Tech India
    May 2020 - Jul 2020 · 3 mos

    • Automated ETL generation, table creation and table alteration by templatizing the respective YAML files. It resulted in reduction of number of lines of code by 10 times, reduction in touch points for code modification, easy debugging since the number of lines have been reduced significantly, could be used to validate legacy ETLs, less engineer hours being spent to modify the existing ETL • Built an ElasticSearch backend for the Global Data Organization. Designed and wrote the schema for the ElasticSearch indices. The indices were to be populated by parsing the respective Confluence pages. Created a queuing mechanism, using minimum API calls, to parse subsequent child pages when given a parent document, created an efficient insertion and updation mechanism of data into the ElasticSearch indices. • Designed an algorithm for efficiently validating the difference in data between two databases. Came up with a hashing based join operation to validate the data present in both the databases and coded it in PySpark. Wrote a multithreaded program in python for simultaneously running query on both the databases.

  • Data Science Intern at LYTIQ GmbH
    May 2019 - Jul 2019 · 3 mos

    • Worked as a Research developer, developing and deploying neural network models for different customer projects. • Worked on Protein Folding problem, understanding the problem and it’s complexities. Implemented a differential neuralnetwork in PyTorch and performed various ablation studies. • Worked on developing a Child Pornography Detection Pipeline for North Rhine - Westphalia government. The pipeline was a two staged process,first developed a model for pornography detection implemented in Fastai on PyTorch backend then implemented a pre-trained model for multiple face detection and age estimation. The out put from two models were combined to make a final prediction. • The prediction pipeline was then deployed as a Docker Container and shipped to the client.