AI Models Can Audit Computer-Use Agents — But Disagree on Complex Tasks — Markdown | type0 | type0