Johannes Rausch

Deep Learning Engineer @ NVIDIA

Zurich, Zurich, Switzerland

About

I am working as Deep Learning Engineer at NVIDIA on research and engineering problems related to LLM efficiency optimization. I've done a PhD in Computer Science at ETH Zurich, specializing in hierarchical document parsing, and have a background in Machine Learning, Computer Vision, LLMs and Medical AI. I'm passionate about leveraging machine learning to tackle real-world challenges and contributing my technical and interpersonal expertise to drive advancements at the intersection of technology and industry.

Experience

  • Deep Learning Engineer at NVIDIA
    May 2024 - Present · 2 yrs 2 mos

    Research and engineering for LLM and VLM efficiency optimization.

  • Scientific Assistant & PhD Student at ETH Zürich
    Nov 2017 - Sep 2023 · 5 yrs 11 mos

    - Developed end-to-end ML systems for parsing of document renderings (e.g. PDF files, scans) that achieved SOTA performance by leveraging object detection methods, weak supervision, and novel large-scale datasets. [Python, TensorFlow, PyTorch]. - Developed multi-modal transformer models to enable full end-to-end text recognition from document images [PyTorch]. - Managed industry collaborations and supervised students in ML research projects, and served as teaching assistant at ETH. - PhD thesis: "Building end-to-end Systems for Hierarchical Document Parsing and OCR".

  • Research Intern at NVIDIA
    Jun 2022 - Feb 2023 · 9 mos

    - Contributed LLM decoder-based neural network for Optical Character Recognition (OCR) to open source library [PyTorch]. - Created large-scale dataset of rendered documents for training and evaluation of LLM-based document OCR systems [Python]. - Developed system for joint, end-to-end layout recognition and OCR on document images by leveraging a multi-modal transformer architecture for processing both images and text [PyTorch].

  • Visiting Researcher at Stanford University
    Aug 2018 - Sep 2018 · 2 mos

    - Demonstrated how to extract information from Wikipedia articles with limited manual labels by leveraging data programming and document structure with Fonduer.

  • Working Student at tacterion
    Dec 2016 - Mar 2017 · 4 mos

    - Development of a live visualization for novel tactile sensor technology.