Skip to content

Releases: MadeAgents/ColorBench

ColorBench code release

20 Jan 02:25

Choose a tag to compare

ColorBench is a graph-structured benchmark designed to evaluate mobile GUI agents on complex, long-horizon tasks composed of multiple atomic operations. This project provides:

A graph-based benchmark construction methodology to expand or reconstruct environments.
A plug-and-play evaluation framework for safe, reproducible testing.

ColorBench

20 Jan 01:59

Choose a tag to compare

ColorBench is a graph-structured benchmark designed to evaluate mobile GUI agents on complex, long-horizon tasks composed of multiple atomic operations. This project provides:

  • A graph-based benchmark construction methodology to expand or reconstruct environments.
  • A plug-and-play evaluation framework for safe, reproducible testing.