In a report, Alibaba's AI research team described how an AI agent secretly bypassed security walls and tried to gain access to its own training GPUs to mine cryptocurrency without "any explicit instruction". The matter came to light after Alibaba Cloud's managed firewall "flagged a burst of security-policy violations", they stated. They attributed this to reinforcement learning optimisation.