AI agents — AI systems designed to complete digital tasks without much supervision — may be everywhere, but they’re not exactly ready for primetime. Over the last year, they’ve been caught slandering people, deleting user emails, and wiping out entire hard drives.
Most recently, a free-spirited AI agent was caught moonlighting as a crypto miner — a behavior which startled its keepers, Axios reported.
Called ROME, the AI agent was being run as part of a research project by an AI lab affiliated with Chinese online retail giant Alibaba. In their ensuing research paper, the researchers describe the agent’s strange side-hustle as a set of “unsafe behaviors” that “arose without any explicit instruction and, more troublingly, outside the bounds of the intended sandbox.”
Early one morning as the experiment was going on, the researchers became aware of unusual activity on their network, not through the AI’s indication, but from security alerts.
“The alerts were severe and heterogeneous, including attempts to probe or access internal-network resources and traffic patterns consistent with cryptomining-related activity,” they write. “We initially treated this as a conventional security incident… However, the violations recurred intermittently with no clear temporal pattern across multiple runs.”
By tracking the time at which the security incidents occurred, the researchers were able to trace it back to the guilty party. “In the corresponding model logs, we observed the agent proactively initiating the relevant tool calls and code-execution steps that led to these network actions,” the research team explained.
Essentially, the agent had deviated from its original tasks into the wider world of cryptocurrency on its own volition, silently diverting computing resources away from its training tasks and toward mining. To complete its mission, ROME went so far as to dig out a “reverse SSH tunnel,” what Axios describes as a hidden backdoor to an unauthorized computer.
Thankfully, the AI agent was caught by researchers and charged with much stricter guidelines, so no real-world damage was done. Still, the bizarre foray into crypto shows just how unpredictable AI agents can be — which, given how popular they seem to have become throughout the corporate world, isn’t exactly reassuring.
More on AI agents: New AI Agent Logs Directly Into College Platform Canvas to Do Your Homework for You