The most credible test yet of whether quantum computers large enough to attempt physics that defeats classical machines can do work physicists actually care about comes from a simulated particle collision — the other three IBM's
A simulated particle appearing inside a gate-based quantum computer is not a new fundamental discovery. But it may be the most honest test yet of whether utility-scale quantum hardware can do work physicists actually care about, rather than circuits designed to look fast.
That result, from Caltech's Roland Farrell and the University of Washington's Nikita Zemlevskiy using processor time donated under IBM's Quantum Credits program, sits inside a freshly published batch of four peer-reviewed and preprint outputs flowing through the same pipeline. The other three are algorithmic or scaling milestones. The honest read is which one moves the needle on physics, and which ones only move the engineering chart.
IBM's Quantum Credits program is a merit-based grant that gives academic groups free clock time on IBM's superconducting quantum machines. It is not a product launch, and "free compute" means donated processor access, not a metaphor. The four results in the industry roundup all came through that pipeline, and each one needs its own scientific merit rating.
The strongest of the four is the Farrell and Zemlevskiy W-state scattering simulation. A W-state is a specific multi-qubit entangled resource used in quantum networking and error correction. The team built a constant-depth preparation algorithm for localized particle "wavepackets" and used mid-circuit measurement with classical feedforward to simulate two particles colliding inside a gate-based quantum computer. A new particle emerged during the simulated collision. That is a physics question: high-energy scattering, where classical simulation scales badly. The result is on arXiv, not yet in a journal, but the problem choice is the right one for testing quantum utility on a problem physicists actually want solved.
Second tier: the 96-qubit mixed-state reconstruction by Benoît Vermersch and Matteo Votto, published in Physical Review Letters. They used randomized measurements and tensor-network methods to reconstruct noisy mixed quantum states at 96 qubits and validated global entanglement and entropy boundaries on IBM hardware. The peer-reviewed venue is meaningful. The caveat is that mixed-state reconstruction via tensor networks is an active literature with prior baselines on superconducting hardware. The result extends the scale; the "beyond classical" framing needs independent benchmark confirmation rather than reliance on the company-curated roundup.
Muhammad Ahsan's 103-qubit kagome lattice VQE is a materials-science scaling result. A kagome lattice is a frustrated magnetic structure: a pattern of atoms whose geometry prevents ordinary magnetic ordering and makes classical simulation hard. Ahsan combined a hardware-efficient ansatz with a new Hamiltonian calibration strategy and a subproblem VQE decomposition to compute the ground-state energy of a 103-qubit version. Frustrated magnetism is a legitimate target for quantum advantage claims. The work is in the ACM Digital Library, a peer-reviewed proceedings venue rather than a top-tier physics journal. The interesting question is whether the classical baseline (exact diagonalization and tensor-network methods on the same problem) has been run on equal footing.
The fourth result, on Hamiltonian formulations of lattice gauge theories aimed at the sign problem, is the most speculative. The sign problem is the central obstacle to using quantum Monte Carlo methods for certain many-body systems: the calculation collapses to noise because probability amplitudes become negative. A Hamiltonian reformulation can in principle sidestep the issue. The published output is a formulation, not a hardware demonstration. Treating this as "beyond classical" requires the demonstration, not the recipe.
IBM's framing in the roundup calls these results "beyond classical limits." The peer-reviewed and preprint literature is more cautious. Two of the four are arXiv-only; one is in Physical Review Letters; one is in an ACM proceedings volume. None of the source artifacts captured here include independent benchmark figures: wall-clock time, classical comparison cost, or fidelity against a known classical method. The honest ranking is that the W-state collision result is the most plausible candidate for genuine physics-relevant quantum utility; the mixed-state and kagome work are legitimate scaling milestones that need independent baseline comparison; and the sign-problem output is a method paper, not a result.
What to watch next: independent benchmark runs on the Vermersch 96-qubit reconstruction and the Ahsan 103-qubit kagome ground-state energy against tensor-network and exact-diagonalization baselines. If those comparisons land on equal footing, the four-thread story either consolidates into a real utility-scale moment, or stays a curated roundup.