Nucleus Wants One Container Runtime for Both AI Agents and Production Services

Nucleus Wants One Container Runtime for Both AI Agents and Production Services — type0 | type0

PREVIEWNucleus Wants One Container Runtime for Both AI Agents and Production Services · MD

Nucleus, a Rust-based OCI container runtime surfaced on Show HN this week, makes a specific bet: the ephemeral sandbox where an AI agent runs untrusted code and the long-lived NixOS service that fronts your API are not two different problems. They are the same isolation-and-reproducibility problem, applied to two workload classes, and they deserve one runtime, one pinned rootfs, one audit log.

The project's README on GitHub is explicit about that framing. The two worlds the runtime targets, agent sandboxes and NixOS production services, are not treated as separate feature lists stapled together. They are two configurations of the same machinery: cgroups v2, namespaces for PID, mount, network, UTS, IPC, user and cgroup (with an optional time namespace), capability drop by default, a seccomp allowlist with per-service JSON profiles that can be generated from traces, and Landlock LSM on Linux 5.13 and above. Optional gVisor integration produces a standard OCI bundle. None of that is novel by itself. The constructive claim is that the agent and the production service are not tradeoff axes on the same system. They are the same axis.

The project ships three operating modes, and the differences between them are where the claim gets concrete. The default agent mode is built for fast startup and short-lived execution, with workspaces configurable as bind-rw, bind-ro, or copy-in-out, default mount flags nosuid,nodev,noexec, and a Landlock policy that denies execution from /workspace unless --workspace-exec is passed. The strict-agent mode is a fail-closed version of the same: no production rootfs, no health checks, no sd_notify. It exists because the project expects some operators to want a hard line between "runs a possibly adversarial prompt" and "serves traffic." The production mode is the same isolation stack, plus declarative NixOS services with health checks, sd_notify, journald, deny-by-default egress policy (per-CIDR or per-domain via namespace iptables), externalized seccomp/caps/Landlock policies with SHA-256 pinning, rootfs attestation, mount flag verification post-setup, kernel lockdown assertions, structured audit logs, optional OpenTelemetry export, and in-memory secret staging at /run/secrets with volatile buffer zeroing. Production mode also rejects writable executable workspaces.

The Nix integration is the second half of the claim. A Docker image, at the level of filesystem contents, is whatever the build happened to produce. Nucleus starts from a Nix closure: pinned store paths, reproducible by content hash, declaratively defined in a flake. The project ships a NixOS module and a nucleus.lib.mkRootfs helper for minimal service closures, so the same code path that defines a service also defines the rootfs it runs against. Networking is intentionally narrow: none, host, or bridge, and no CNI plugins. The reasoning, again, is that everything you need is in the closure, and the rest is network policy. If you want a sidecar, you write it in the same Nix expression.

The performance story is the part the project itself flags as noisy. The README claims a 12-millisecond cold start versus Docker's roughly 500 milliseconds, and a PostgreSQL 18 benchmark on Linux 6.18 x86_64, native runtime, bind-mounted pgdata, and --network host. On a SELECT-only workload, the README reports 105,965 transactions per second at 0.075 ms latency for Nucleus versus 100,222 TPS at 0.080 ms for bare metal. On TPC-B, 1,757 TPS at 4.55 ms versus 1,490 TPS at 5.38 ms. The README notes that occasional wins over bare metal are "likely benchmark noise" rather than a guaranteed speedup, and it characterizes the configuration as specific to the test setup. Any reader evaluating the numbers should treat them as the project's own bench, not as independent measurement.

The strongest objection to the unifying-claim framing is one the project makes for you. The README states its non-goals plainly: no OCI image, layer, or registry model; no persistent storage beyond explicit --volume binds; no macOS, Windows, BSD, or 32-bit support; and it is not a drop-in Docker replacement. The dual-workload claim is constructive, not evangelical. The runtime assumes you have already chosen Nix as your packaging layer. The reproducibility story is real if and only if you buy into the Nix store model. If you are running a Kubernetes cluster with multi-architecture images and a private registry, this is not the substrate you would bolt onto it. It is a different substrate.

There is also the question of where the project actually sits today. The repository on GitHub had 26 stars and 152 commits as of June 9, 2026. It is dual-licensed MIT and Apache-2.0, written in Rust with a Nix flake, and registered on crates.io as nucleus-container. There is no security audit linked from the README, no production-user quote, no third-party benchmark, and no hydration of the Show HN thread in the materials in hand. The README's install command, nix run github:wiggum-cc/nucleus, points to a different GitHub org than the repository URL (sig-id/nucleus), and that discrepancy has not been resolved in the sources available here. The piece is built on the project's own documentation, and the article is a fair characterization of what that documentation claims, not an independent verification.

What a reader should take from Nucleus right now is the design, not the numbers. It is an early-stage attempt to argue that agent infrastructure and service infrastructure should converge onto one declarative, pinned, audit-logged, fail-closed default. If the project can land an external security review, reproduce the bench on a workload that resembles real agent traffic, and resolve the install-command inconsistency, it becomes a candidate substrate for that thesis. If it cannot, the README remains a precise design document for a runtime the broader ecosystem has not yet adopted. Watch the commit history on the repository and any future HN thread for signs of either path.

Nucleus Wants One Container Runtime for Both AI Agents and Production Services

Sources