[June 5, 2025] We have released the exploration code for collecting subtask instructions in OmniBench, as well as the evaluation script used to evaluate virtual agents. Overview of OmniBench, a ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results