A Stress Test for AI That Acts on Its Own: 45 Agentic Systems, One Benchmark — type0 | type0