Johannes Rausch

Deep Learning Engineer @ NVIDIA

Zurich, Zurich, Switzerland

About

I am working as Deep Learning Engineer at NVIDIA on research and engineering problems related to LLM efficiency optimization. I've done a PhD in Computer Science at ETH Zurich, specializing in hierarchical document parsing, and have a background in Machine Learning, Computer Vision, LLMs and Medical AI. I'm passionate about leveraging machine learning to tackle real-world challenges and contributing my technical and interpersonal expertise to drive advancements at the intersection of technology and industry.

Experience

Deep Learning Engineer at NVIDIA
May 2024 - Present · 2 yrs 2 mos
Research and engineering for LLM and VLM efficiency optimization.
Scientific Assistant & PhD Student at ETH Zürich
Nov 2017 - Sep 2023 · 5 yrs 11 mos
- Developed end-to-end ML systems for parsing of document renderings (e.g. PDF files, scans) that achieved SOTA performance by leveraging object detection methods, weak supervision, and novel large-scale datasets. [Python, TensorFlow, PyTorch]. - Developed multi-modal transformer models to enable full end-to-end text recognition from document images [PyTorch]. - Managed industry collaborations and supervised students in ML research projects, and served as teaching assistant at ETH. - PhD thesis: "Building end-to-end Systems for Hierarchical Document Parsing and OCR".
Research Intern at NVIDIA
Jun 2022 - Feb 2023 · 9 mos
- Contributed LLM decoder-based neural network for Optical Character Recognition (OCR) to open source library [PyTorch]. - Created large-scale dataset of rendered documents for training and evaluation of LLM-based document OCR systems [Python]. - Developed system for joint, end-to-end layout recognition and OCR on document images by leveraging a multi-modal transformer architecture for processing both images and text [PyTorch].
Visiting Researcher at Stanford University
Aug 2018 - Sep 2018 · 2 mos
- Demonstrated how to extract information from Wikipedia articles with limited manual labels by leveraging data programming and document structure with Fonduer.
Working Student at tacterion
Dec 2016 - Mar 2017 · 4 mos
- Development of a live visualization for novel tactile sensor technology.