Skip to content

ViT implementation for fusion between Neuromorphic Vision Sensor data and Visible Light Sensor frames

Notifications You must be signed in to change notification settings

HamadRemeithi/EViFT

Repository files navigation

EViFT - Event-Visible frame Fusion Transformer

Platform

Python >= 3.9
Pytorch >= 1.9.1

Training Dataset

MS-COCO 2014 (T.-Y. Lin, M. Maire, S. Belongie, J. Hays, P. Perona, D. Ramanan, P. Dollar, and C. L. Zitnick. Microsoft coco: Common objects in context. In ECCV, 2014. 3-5.) is utilized to train our auto-encoder network.

KAIST (S. Hwang, J. Park, N. Kim, Y. Choi, I. So Kweon, Multispectral pedestrian detection: Benchmark dataset and baseline, in: Proceedings of the IEEE conference on computer vision and pattern recognition, 2015, pp. 1037–1045.) is utilized to train the RFN modules.

Acknowledgement

This codebase is still in progress and is being built on top of Image-Fusion-Transformer by Vibashan.

About

ViT implementation for fusion between Neuromorphic Vision Sensor data and Visible Light Sensor frames

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages