A new study has revealed that the large language models (LLMs) can behave unpredictably when given autonomous access to digital tools.
Large language models with autonomous tool access showed unpredictable, risky behaviors in tests Researchers gave AI agents email, Discord, code execution in a sealed lab to stress-test their limits ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results