ETH and NVIDIA trained a humanoid robot on a single workstation. Here is why it matters.
Training a humanoid robot to walk has, until recently, meant one thing: you needed access to a GPU cluster. That assumption is what ETH Zurich and NVIDIA just tried to dismantle.
A team from ETH's Robotic Systems Lab and NVIDIA published a workflow showing that the Unitree H1, a full-size humanoid robot priced under $90,000, can learn to walk over rough terrain using a single DGX Spark workstation. The Spark is NVIDIA's compact desktop AI computer, roughly the size of a small stereo component, built around the GB10 Grace-Blackwell Superchip. It sells for $3,999, according to AIToolDiscovery. No cluster required.
The system runs Isaac Lab, NVIDIA's robotics simulation framework, at 65,000 simulation steps per second on a single machine, according to Semiconductor Engineering's walkthrough of the workflow. Training the H1 to a stable walking policy took 1,350 iterations. At iteration 50, the robot was falling almost immediately, joint actions noisy and uncoordinated. By iteration 1,350, it walked forward consistently, maintained balance on uneven ground, and recovered from small disturbances.
The workflow is worth understanding in detail, because it is not just a demonstration. It reflects a deliberate engineering choice. The training pipeline uses RSL-RL, a reinforcement learning library developed by Nikita Rudin and David Hoeller during their time at ETH Zurich and NVIDIA, with PPO (Proximal Policy Optimization) as the core algorithm. The system launches with 512 parallel simulation environments running simultaneously on the Blackwell GPU. Physics simulation and neural network policy updates share a unified memory space via NVLink-C2C, a high-bandwidth interconnect that eliminates the conventional PCIe bottleneck between CPU and GPU. In most reinforcement learning systems, tensors bounce back and forth across the bus on every training step. Here they do not move at all.
The implication is hardware democratization, at least for the training side of the problem. A university lab, an early-stage robotics startup, a factory automation team inside a mid-size manufacturer can all now replicate a workflow that, two years ago, required infrastructure only a large tech company or major research institution could afford. Compute represents 15 to 35 percent of a humanoid robot's total bill of materials, according to one recent analysis. If the training compute can shift from a multi-GPU cluster to a single workstation, that cost curve bends.
This matters against a specific backdrop. The Unitree H1 is already working. Chinese EV makers including BYD, XPeng, and Nio have deployed H1 robots in production lines for material handling and inspection tasks, as MIT Technology Review reported. The robot is not a research platform anymore. It is a real machine doing real factory work. What ETH and NVIDIA demonstrated is that getting a new skill onto that machine, teaching it a new terrain or a new gait, no longer requires a datacenter.
The gap between demonstration and deployment has always had two parts. The first is whether the robot can physically do the thing. The second is whether anyone who wants the robot to do the thing can afford to train it. This workflow attacks the second problem directly.
There are limits. The DGX Spark delivers roughly one petaFLOP of AI compute, which is sufficient for locomotion policy training but not for training large foundation models or generating synthetic data at scale. The 1X robotics team, which operates the NEO humanoid robot, uses NVIDIA Blackwell HGX B200 GPUs for its model training, a different weight class entirely. And the workflow as described trains a single skill, not a generalist policy. Getting a robot to learn multiple tasks simultaneously remains an open problem.
But the trajectory is clear. What ETH and NVIDIA showed is a proof of concept for a workflow, not a ceiling. Isaac Lab, RSL-RL, and the Arm-native Isaac Sim build are all publicly documented. The hardware is available at retail. A small team with a few thousand dollars of equipment and access to a research paper can now reproduce what used to require a cluster. That is the actual story.
The question to watch next is whether the training workflow generalizes beyond locomotion. Teaching a humanoid to walk on rough terrain is a milestone, but it is also the relatively solved part of the problem. Teaching a humanoid to reliably pick up objects, adapt to novel clutter, or respond to edge cases in a dynamic environment: those are the tasks that matter for factory and warehouse deployment, and they are where the compute requirements scale differently. If the single-workstation approach holds for manipulation skills as well as locomotion, the democratization story becomes much larger.
The ETH and NVIDIA blog post describing this workflow is on Semiconductor Engineering, with a companion learning path on Arm's developer site. The RSL-RL library is on GitHub.