GitHub - thanhhff/MultiTSF: MultiSensor-Home: A Wide-area Multi-modal Multi-view Dataset for Action Recognition and Transformer-based Sensor Fusion

MultiSensor-Home: A Wide-area Multi-modal Multi-view Dataset for Action Recognition and Transformer-based Sensor Fusion

This work was presented at the 19th IEEE International Conference on Automatic Face and Gesture Recognition (FG2025). Best Student Paper Award.

Authors: Trung Thanh Nguyen, Yasutomo Kawanishi, Vijay John, Takahiro Komamizu, Ichiro Ide

Introduction

This repository contains the implementation of MultiTSF on the MultiSensor-Home dataset.

Download dataset: https://huggingface.co/datasets/thanhhff/MultiSensor-Home1/

A simple way to download the dataset:

# Make sure hf CLI is installed: pip install -U "huggingface_hub[cli]"
hf download thanhhff/MultiSensor-Home1 --repo-type=dataset --local-dir dataset

Environment

The Python code is developed and tested in the environment specified in requirements.txt. Experiments on the MultiSensor-Home dataset were conducted on four NVIDIA A100 GPUs, each with 32 GB of memory. You can adjust the batch_size parameter in the code to accommodate GPUs with smaller memory.

Dataset

Download the MultiSensor-Home dataset and place it in the dataset/MultiSensor-Home directory.

Training

To train the model, execute the following command:

    bash ./scripts/train.sh

Inference

To perform inference, use the following command:

    bash ./scripts/infer.sh

📄 Citation

@inproceedings{nguyen2025multisensor,
  author    = {Trung Thanh Nguyen and Yasutomo Kawanishi and Vijay John and Takahiro Komamizu and Ichiro Ide},
  title     = {MultiSensor-Home: A Wide-area Multi-modal Multi-view Dataset for Action Recognition and Transformer-based Sensor Fusion},
  booktitle = {Proceedings of the 19th IEEE International Conference on Automatic Face and Gesture Recognition},
  year      = {2025},
  note      = {Best Student Paper Award}
}

Acknowledgment

This work was partly supported by Japan Society for the Promotion of Science (JSPS) KAKENHI JP21H03519 and JP24H00733.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
config		config
dataset		dataset
eval		eval
models		models
scripts		scripts
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
_config.yml		_config.yml
inference_mm.py		inference_mm.py
main.py		main.py
main_object_detection.py		main_object_detection.py
model_factory.py		model_factory.py
requirements.txt		requirements.txt
run_eval.py		run_eval.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

MultiSensor-Home: A Wide-area Multi-modal Multi-view Dataset for Action Recognition and Transformer-based Sensor Fusion

Introduction

Environment

Dataset

Training

Inference

📄 Citation

Acknowledgment

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

thanhhff/MultiTSF

Folders and files

Latest commit

History

Repository files navigation

MultiSensor-Home: A Wide-area Multi-modal Multi-view Dataset for Action Recognition and Transformer-based Sensor Fusion

Introduction

Environment

Dataset

Training

Inference

📄 Citation

Acknowledgment

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages