A new benchmark pitting AI against previously unseen maths problems shows systems still fall short of top human expertise.
Agents can handle work requiring judgment and unstructured information, not just the clean rules-based tasks RPA was designed ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results