David Greenberg

Building things hands-on

Port Washington, New York, United States

About

I am a deeply passionate engineer and technical leader who gravitates towards challenging problems in scaling systems, datacenter design, and statistical inference. Over the years, I have built numerous distributed scheduling systems to improve utilization of large clusters; in 2015, I published a book on this topic with O'Reilly Media. More recently, I've also begun to work on quantitative research, statistical inferencing, and machine learning, but also I continue to be very deep in software infrastructure--from networking to build systems to customizing datacenters.

Experience

  • Staff Engineer at Meta
    Jan 2022 - Present · 4 yrs 6 mos

    - Rewrote build system to unblock ML research usage of the Research Super Cluster’s storage system - Implemented flash storage backend for fast and reliable access to 2000PB (2 exabytes) of AI storage - Created API for large language models and synthesized research for efficiency optimizations

  • Senior Quant Developer at ExodusPoint Capital Management, LP
    2018 - 2021 · 3 yrs

    - Built out AWS environment, implementing NAT/VPN PrivateLink to firm networks, batch computation environment, dynamically allocated live & paper trading environments for testing complete stack in “prod” without risk - Implemented ML-based sentiment models with a $50mm book using Spark - Implemented caching, distributed, snapshotting filesystem for all production data for stable research - Created numerous research tools for signal construction and tradeability/factor analysis based on Pandas, Dask, and Spark - Improved existing technical models by engineering new features and improving training techniques - Migrated PM team to Bazel, integrating 3 disparate build systems into a unified polyglot build environment

  • Senior Software Engineer at Amazon
    Jul 2017 - Aug 2018 · 1 yr 2 mos

    - First senior engineer hire in Boulder site, mentored other engineers and established office culture - Integrated 7 systems created by teams across 3 other sites to define and launch new attribution product - Designed and implemented serverless realtime data pipeline on AWS handling 25k transactions per second

  • Architect at Independent Consultant
    Jan 2016 - Jul 2017 · 1 yr 7 mos

  • Two Sigma (Greater New York City Area)
    • Vice President
      Jan 2015 - Jan 2016 · 1 yr 1 mo

    • Software Developer
      Jul 2011 - Jan 2016 · 4 yrs 7 mos