The fastest way to teach a neural network to hunt for new physics is also the easiest way to make it stop noticing when the universe breaks the rules it learned. That is the working message of a new paper from the Flatiron Institute and Princeton University, published through a SISSA Medialab release and surfacing this week on ScienceDaily (source).
The technique in question is transfer learning. Cosmology researchers cannot afford to simulate the universe from scratch every time they want to test a new theory, so they pretrain a neural network on cheap standard-model simulations and then fine-tune it on a smaller batch of expensive beyond-standard-model runs. The savings are real. On the Quijote simulation suite, the community benchmark built by Francisco Villaescusa-Navarro and collaborators, the team reports that transfer learning cut the number of costly simulations needed for some beyond-standard-model tasks by more than a factor of ten (arXiv:2510.19168).
The same paper, however, names the structural reason those savings come with a price tag. When the target simulation lives far from the training distribution, the pretrained network under-performs, and in the worst case it stops learning the new physics altogether. The authors call this "negative transfer," and the failure is not random. It tracks a real degeneracy in cosmological models: signatures of new physics such as massive neutrinos sit on top of an existing standard-model observable, the matter-clustering amplitude known as σ8, and a network that has been trained to treat σ8 variation as ordinary ΛCDM noise reads the new signal as familiar background.
What makes the finding land for cosmologists, rather than for a generic machine-learning audience, is that the σ8-versus-neutrino-mass degeneracy is not an artifact of the training pipeline. It is the same degeneracy that already complicates real cosmic-shear analyses in surveys such as Euclid, the Vera C. Rubin Observatory's LSST, and NASA's Roman Space Telescope, all of which will lean heavily on machine-learning inference to make sense of petascale data. A systematically ΛCDM-biased inference layer is exactly the wrong failure mode for discovery science, because the experiments most likely to surface a deviation are the ones whose targets sit farthest from the ΛCDM training set.
The authors are careful about what the result does and does not establish. The work is simulation-only. No real-survey data has been used, and the paper does not claim that current ML pipelines are missing new physics in actual observations. Instead, the team frames the negative-transfer effect as a concern to mitigate before the next generation of surveys goes live. The mitigation guidance is concrete. Pair transfer learning with targeted fresh simulations drawn from the regions where the source-target distance is largest. Add human-designed checks for the degeneracies the training set cannot break. Treat the catch as a measurement of where the next compute dollar should go, rather than a verdict on whether to use the technique at all.
That framing turns a cautionary tale into a design specification. The paper is also a useful reminder that negative transfer is a property of distributional distance, not a sign that a model is "overconfident" or "broken." When source and target overlap, transfer learning is fast. When they do not, the same architecture will silently reinforce the prior it was handed. The honest engineering response, which is the response the authors recommend, is to know in advance where the two diverge, and to spend the expensive simulations there.
For the Rubin, Euclid, and Roman teams now finalizing their analysis pipelines, the practical question is no longer whether machine learning belongs in cosmological inference. It is whether the inference layer is being told, in advance, which corners of parameter space it has never seen. A pipeline that knows the answer to that question can use transfer learning aggressively. One that does not will find exactly the physics it was trained to expect.