Skip to content

Add Online-Mind2Web benchmark harness (packages/bench)#39

Closed
jarugupj wants to merge 5 commits into
mainfrom
phani/cua-bench-runner
Closed

Add Online-Mind2Web benchmark harness (packages/bench)#39
jarugupj wants to merge 5 commits into
mainfrom
phani/cua-bench-runner

Cap benchmark retries and count permanent failures in accuracy

0cddee0
Select commit
Loading
Failed to load commit list.