. 2022 Feb 16;602(7897):414–419. doi: 10.1038/s41586-021-04301-9

Extended Data Table 2.

Simulation parameters for actuator, sensor and current diffusion models

Parameter values as identified from data. The action bias was fit on the power supply output voltage. Measurement noise is Gaussian additive noise and randomly sampled at each simulation time step. We use a fixed action bias with an additive random offset to account for non-ideal behaviour of power supply hardware. Current diffusion-parameter variations account for the uncontrolled operating conditions. Parameter variations are sampled at the beginning of each episode but kept constant during the episode. The samples are drawn from uniform (action bias) and log-uniform (current diffusion) distributions using the bounds in this table. For single-plasma training, R_p, β_p and q_A are varied, whereas in a multiple-plasmas training, we vary $σ_{∥}$ and I_OH. In the latter case, we sample an overall geometric mean offset of the two $σ_{∥}$ from a log-uniform distribution. We sample the log of the multiplicative difference between them from B_s (4,4), for which B_s is a scaled β distribution. We sample a single I_OH value for both coils. Parameters are sampled as absolute values unless explicitly indicated as scaling factors.