Ning Xue

ASIC Design, micro-Architecture, Performance, Silicon

San Jose, California, United States

About

ASIC & FPGA design for AI acceleration, GPU, Machine Learning/Deep Learning systems. Coursera deeplearning.ai specialization 5 course series, Stanford ML/CV courses

Experience

  • ASIC Design, MTS at Etched
    Apr 2026 - Present · 4 mos

    AI acceleration ASIC Design

  • Mobile GPU Micro-architect & Design at Samsung Semiconductor US
    Feb 2023 - Apr 2026 · 3 yrs 3 mos

    GPU micro-architecture, design, modeling, verification, physical design, PPA

  • Silicon Graphics Pipeline & Machine Learning Architect / Design at Facebook
    May 2020 - Jan 2023 · 2 yrs 9 mos

    Augmented Realty (AR) Graphic IP & GPU micro-architecture, design, verification, modeling, performance/PPA

  • Intel Corporation (Full-time · 4 yrs)
    • Principal Design, Intel AI (formerly Nervana Systems)
      May 2017 - May 2020 · 3 yrs 1 mo

      Neural Network Processor ASIC architecture/micro-architecture, design, verification and implementation, performance/power/area trade off analysis, low power, high speed (2 GHz+); GEMM, LLM, transformer, high performance computing systolic array, tensor, fused QKV, flashattention, tiling, KV compression, embedding, Norm, softmax, on-chip networks, memory, HBM, Tensorflow, Keras, Numpy, Python; VCS, Lint, CDC, formal verification, synthesis, Floorplan, STA, ECO, test plan, UVM, coverage, assertions, HLS, HW/SW co-design, FPGA emulation, system fast prototyping, performance modeling, C++/Python, lab bring up, firmware Deeplearning.ai 5-course specialization, Coursera Machine Learning course, Stanford CS231n CNN for Visual Recognition, CS224n Natural Language processing & CS229 (Machine Learning), MIT 60001 Python programming deeplearning.ai specialization course mentor

    • Sr. Manager, Programmable Hardware Engineering (Altera)
      Jun 2016 - Sep 2017 · 1 yr 4 mos

      High-speed transceiver subsystem silicon design, PCIe, 10G/25G, 40G/50G/100G Ethernet, Design Development, Project management, Product management, Customer support

  • Altera (13 yrs 9 mos)
    • Sr. Manager, IP Engineering
      Mar 2012 - Jun 2016 · 4 yrs 4 mos

      PCI Express Gen1/2/3, IO Virtualization, DSP (fixed point and floating point IEEE-754), 10G/40G/100G Ethernet, Interlaken IP definition, development, verification (OVM/UVM), physical implementation, applications in 20nm/14nm FPGAs, hardware emulation, HLS

    • Project lead/Manager/Sr Manager, IC design
      Oct 2002 - Mar 2012 · 9 yrs 6 mos

      Taped out 90/65/45/28/20nm FPGA families, high-speed I/O, DSP, FPGA control block, DDR IP and subsystems