We know percentiles often help found the outlier such as node failure or bug.
But for test normal system, can we judge that a run is better than another run by only look at the metric percentiles ? or What is the your suggest to found which run or which benchmark is better using grafana, not using benckmark-compare repo?
And what was the initial purpose of the fio percentiles in the below link?
Question is from the fio dashboard.json https://github.com/cloud-bulldozer/arsenal/blob/master/fio-distributed/grafana/6.3.0/dashboard.json.