Stanford, California, United States
We were born too late to explore the earth and too early to explore the galaxies. We were born just in time to solve robotics. I'm on a mission to make Physical AI a reality, one motor at a time. Twitter/X: https://twitter.com/DrJimFan Google Scholar: https://scholar.google.com/citations?user=sljtWIUAAAAJ&hl=en My email can be found here: https://jimfan.me
Solving Physical AI. Spearheading Project GR00T: foundation models and techniques for general-purpose robotics. Co-leading GEAR team.
I co-founded a new research team called "GEAR" (Generalist Embodied Agent Research) at NVIDIA. We believe in a future where every machine that moves will be autonomous, and robots and simulated agents will be as ubiquitous as iPhones. We are building the Foundation Agent — a generally capable AI that learns to act skillfully in many worlds, virtual and real. 2024 is the Year of Robotics, the Year of Gaming AI, and the Year of Simulation. We are setting out on a moon-landing mission, and getting there will spin off mountains of learnings and breakthroughs.
Lead of Voyager, Eureka, Prismer, VIMA, MineDojo, and other works on multimodal foundation models for AI agents, robotics, game AI, and simulation.
Ph.D. advised by Prof. Fei-Fei Li. Research in deep reinforcement learning, robotics, computer vision, and large-scale distributed learning.
Deep reinforcement learning.
I was the very first intern of OpenAI since its founding. Co-developed the OpenAI Universe Initiative. Coauthored World of Bits (ICML 2017), the first AI agent that learns to control the web browser through keyboard and mouse.
Research assistant advised by Prof. Yoshua Bengio and Prof. Aaron Courville on deep semi-supervised learning.