Luca Collorone, Stefano D'Arrigo, Massimiliano Pappa, Guido Maria D'Amely di Melendugno, Giovanni Ficarra, Fabio Galasso
We introduce the novel task of Crowd Volume Estimation (CVE), defined as the process of estimating the collective body volume of crowds using only RGB images. Besides event management and public safety, CVE can be instrumental in approximating body weight, unlocking weight sensitive applications such as infrastructure stress assessment, and assuring even weight balance. We propose the first benchmark for CVE, comprising ANTHROPOS-V, a synthetic photorealistic video dataset featuring crowds in diverse urban environments. Its annotations include each person's volume, SMPL shape parameters, and keypoints. Also, we explore metrics pertinent to CVE, define baseline models adapted from Human Mesh Recovery and Crowd Counting domains, and propose a CVE specific methodology that surpasses baselines. Although synthetic, the weights and heights of individuals are aligned with the real-world population distribution across genders, and they transfer to the downstream task of CVE from real images.
The ANTHROPOS-V dataset is a synthetic, photorealistic video dataset containing crowds in diverse urban environments. The dataset is designed to reflect realistic weight and height distributions across genders, ensuring transferability to real-world scenarios.
This work includes:
- Baseline models adapted from Human Mesh Recovery and Crowd Counting.
- A CVE-specific methodology that outperforms the baselines, namely STEERER-V.
- Evaluation metrics pertinent to the Crowd Volume Estimation (CVE) task.
For more details, please refer to the paper and the supplementary material provided.
This repository uses a Conda environment to manage dependencies. Follow the steps below to set up your environment.
-
Clone the Repository
git clone https://github.com/colloroneluca/Crowd-Volume-Estimation.git cd Crowd-Volume-Estimation -
Install Environment
conda env create -f environment.yml conda activate STEERERV
-
Download Dataset
cd Crowd-Volume-Estimation mkdir ProcessedDataDownload the dataset from this link and place it under the folder
ProcessedData. Your ProcessedData should look like:anthopos_v ├── images │ ├── train_set │ │ └── part_{1,2,3} │ │ ├── seq_0 │ │ └── ... │ │ │ ├── test_set │ │ └── part_1 │ │ ├── seq_0 │ │ └── ... │ │ │ └── val_set │ └── part_1 │ ├── seq_0 │ └── ... │ ├── jsons │ ├── train_set │ │ └── part_{1,2,3} │ │ ├── seq_0 │ │ └── ... │ │ │ ├── test_set │ │ └── part_1 │ │ ├── seq_0 │ │ └── ... │ │ │ └── val_set │ └── part_1 │ ├── seq_0 │ └── ... │ │ └── txts ├── train.txt ├── test.txt (Validation) └── test2.txt (Test)mkdir PretrainedModels
Download the pretrained weights from this link and place it under
PretrainedModels.
- Train
cd STEERER-V python tools/train_cc.py --cfg configs/STEERER_V.py - Test
cd STEERER-V python tools/test_cc.py --cfg configs/STEERER_V_Test.py --checkpoint=../PretrainedModels/Anthropos-STEERER-V.pth
If you find this work useful, please consider citing our paper:
@InProceedings{Collorone_2025_WACV,
author = {Collorone, Luca and Darrigo, Stefano and Pappa, Massimiliano and di Melendugno, Guido M. Damely and Ficarra, Giovanni and Galasso, Fabio},
title = {ANTHROPOS-V: Benchmarking the Novel Task of Crowd Volume Estimation},
booktitle = {Proceedings of the Winter Conference on Applications of Computer Vision (WACV)},
month = {February},
year = {2025},
pages = {5284-5294}
}
This code is primarily based on the work from the STEERER original repository. We extend our gratitude to Matteo Fabbri for providing the materials essential for the initial setup of the dataset generation.
