Google unveils plan to protect itself if its AI agents act harmfully

Google has introduced the AI Control Roadmap, a framework for building and managing the advanced AI deployed within the company. Treating agents as potential "insider threats", the system uses trusted AI supervisors to monitor behaviour and detect risks. If a supervisor sees the agent trying to do something harmful, they can step in to block the action before damage occurs.

Load More