Dhyey Mavani

Moonshotting AI with C-suite @ LinkedIn | Stanford | Amherst College | Featured in Business Insider || Author, Speaker & Researcher

San Francisco Bay Area

About

(+) ๐…๐จ๐ฅ๐ฅ๐จ๐ฐ for latest and diverse perspectives on ๐€๐ˆ, ๐„๐๐ฎ๐œ๐š๐ญ๐ข๐จ๐ง and ๐…๐ฎ๐ญ๐ฎ๐ซ๐ž ๐จ๐Ÿ ๐–๐จ๐ซ๐ค! For speaking engagements, media, and business inquiries: [email protected]. Dhyey is driven by complex problems at the intersection of mathematics, computer science, statistics, and economics. His liberal arts foundation equips him uniquely to build innovative solutions, leveraging strategic thinking and deep technical expertise. A strategic thinker at heart, Dhyey excels in dissecting intricate problems and devising actionable solutions. He believes in the power of collaboration, cherishing partnerships with professionals from diverse backgrounds to produce outstanding results. He has built AI infra, models, and agentic product capabilities at ๐‹๐ข๐ง๐ค๐ž๐๐ˆ๐ง after short but formative stints as CDN engineer at ๐€๐–๐’, and quantitative trader at ๐•๐š๐ฅ๐ค๐ฒ๐ซ๐ข๐ž. At LinkedIn, he delivered major LLM inference wins with speculative decoding, and significant GPU savings while pioneering LinkedIn's first Rust-based model serving engine for PyTorch Embedding Based Retrieval (EBR) models. He also integrated MCP into enterprise workflows that contributed to scalable model onboarding across Azure OpenAI and Kubernetes. He even proposed strategic ideas directly to CEO Ryan Roslansky, leading to cross-functional collaborations. He graduated from ๐€๐ฆ๐ก๐ž๐ซ๐ฌ๐ญ ๐‚๐จ๐ฅ๐ฅ๐ž๐ ๐ž with a ๐ญ๐ซ๐ข๐ฉ๐ฅ๐ž ๐ฆ๐š๐ฃ๐จ๐ซ in Computer Science, Mathematics, and Statistics, earning 2 ๐’๐ฎ๐ฆ๐ฆ๐š ๐‚๐ฎ๐ฆ ๐‹๐š๐ฎ๐๐ž theses, along with induction to ๐๐ก๐ข ๐๐ž๐ญ๐š ๐Š๐š๐ฉ๐ฉ๐š and ๐’๐ข๐ ๐ฆ๐š ๐—๐ข honor societies. His mathematical work includes building a Lean4-based machine-assisted proof framework for chip-firing and Graphical Riemannโ€“Roch alongside a Python package for simulations (chipfiring), while his statistics thesis led to โ€œccrvam,โ€ an open-source Python package for discrete copula modeling. He also completed six graduate-level math and statistics courses at ๐‚๐จ๐ฅ๐ฎ๐ฆ๐›๐ข๐š in one semester and continues advanced computational mathematics studies at ๐’๐ญ๐š๐ง๐Ÿ๐จ๐ซ๐. Outside the professional sphere, adventure calls out to Dhyey. Whether it's traversing new terrains, perfecting his archery shot, or engaging in a spirited badminton match, he's always up for a challenge. Other adventurous interests of Dhyey include jetskiing, ATV quad biking, and bungee jumping. **Opinions, posts, and views expressed here are solely Dhyey Mavani's own and do not reflect that of his employers.**

Experience

  • Generalist, Agentic AI MLE (Special projects with C-suite) at LinkedIn
    May 2024 - Present ยท 2 yrs 2 mos

    2025-2026: - Building the next generation of digital work AI agents, directly with LinkedIn's Chief AI Officer, President of Engineering, and LinkedIn's CEO. - Founded & Led a pod of 3 directly with Chief AI Officer, published a research paper on stability margins for LLM robustness (coined "Support Tokens"). - Youngest MLE working daily alongside Distinguished Engineers; Dabbling across the board: Influencing product, GTM, and governance across context, tools, and agent infra. - Promoted to Highly Confidential capacity, and given access to tented access to broader MSFT + OpenAI resources. - Secured override to traditional policies to build-my-own role after emailing the C-suite! 2025 H1: - Optimized LLM inference in Hiring Agent (LangGraph); Increased throughput by 4x; Slashed latency by 66% with speculative decoding. - Built first-ever MCP integrations with Azure OpenAI deployment infra to improve model onboarding and k8s quota management. - Built SOTA fine-grained observability into the production vLLM and SGLang based hybrid serving stack with a centralized dashboard. - Recruited, Interviewed and Mentored Masters & PhDs on AI assisted coding and LLM serving specifics. Summer 2024: - Boldly pitched 3 novel product ideas to the CEO, Ryan Roslansky as an intern, and that led to further discussions with product leaders across 3 business units for formulating them for the future B2B2C strategies at LinkedIn. - Built & integrated an end-to-end Rust-based low-latency inference engine for Torch. Developed first production-ready PyTorch EBR kNN model with custom-filtering kernels. - My work was being actively used to serve 5+ business use-cases leading to P99 inference latency reduction from 4 ms to 2 ms, and P99 tail data ingestion latency reduction from 70 ms to 7 ms. This also increased throughput by 4x, leading to GPU savings of ~ $XX million.

  • Applied Researcher โ€“ Computational Mathematics & Statistics at Amherst College
    Aug 2021 - Jun 2026 ยท 4 yrs 11 mos

    ccrvam - Python package on a novel discrete copula method (30k+ downloads in a few months) - summa cum laude honors chipfiring - Python pakage on an open graph / number theory problem (7k+ downloads and industry adoption) chip-firing-with-lean - First-ever formalization of chip firing dynamics in Lean4 (addressed a gap in Microsoft developed language); summa cum laude honors QuantileFlow - state of the art performance quantile calculations in service logs (beating Datadogโ€™s Python; 6k+ downloads) RBlocks - block-based programming interface for R; deployed in classrooms for accessibility; addressed gap in Googleโ€™s Blockly

  • Quantitative Trader at Valkyrie Trading
    2024 - 2024 ยท Less than a year

    Developed a Monte-Carlo simulation-based strategy for theo-value estimation and shadowed quants/traders in treasuries and equity options desks. Gained hands-on experience in the execution and risk management of delta and gamma hedging market-making strategies through intensive mock sessions.

  • Software Engineer at Amazon Web Services (AWS)
    2023 - 2023 ยท Less than a year

    Populated a metrics dashboard, pushed latency optimizations to production at AWS CloudFront; Proposed & documented use-case of AutoML to analyze change-propagation/caching service logs to predict future outages

  • Fellow (list with details in description) at Various Companies
    2022 - 2023 ยท 1 yr

    โžก Columbia x Citadel Trading Challenge based on Auction Theory (New York, Apr 2023) โžก Akuna Options 201 course (Virtual, Apr 2023) โžก Schonfeld Early Engagement Summit (New York, Apr 2023) โžก Amplify x Citadel Trading Challenge (New York, Feb 2023) โžก D.E. Shaw & Co. Connect Event (New York, Sep 2022) โžก D.E. Shaw & Co. Latitude Fellowship (New York, Aug 2022) โžก Citadel Securities Invitational Terminal (Virtual, Aug 2022) โžก Citadel Securities Invitational Datathon (Virtual, Jul 2022) โžก Hudson River Trading BIPOC Tech Summit (Virtual, Jun 2022) โžก Goldman Sachs Insights Program (Virtual, Apr 2022 - Jun 2022) โžก Jane Street SEE Quant Research & Trading (New York, May 2022) โžก Jane Street FOCUS Software Engineering (New York, Mar 2022) โžก SIG Freshman Discovery Program (Virtual, Apr 2022)