AI Models Can Audit Computer-Use Agents — But Disagree on Complex Tasks — type0 | type0