Measuring AI progress has often meant testing scientific information or logical reasoning – however whereas the…