[🎥 ICCV2025] ESSENTIAL: Episodic and Semantic Memory Integration for Video Class-Incremental Learning

👨‍💻 Authors

Jongseo Lee^1*	Kyungho Bae^2*	Kyle Min³	Gyeong-Moon Park^4†	Jinwoo Choi^1†

^* Equal contribution, ^† Corresponding author

📜 License

This work is licensed under a Creative Commons Attribution 4.0 International License (CC BY 4.0).

📌 Highlight

Accepted at ICCV 2025 (Highlight Presentation)
Proposes ESSENTIAL, a framework inspired by human memory that integrates episodic and semantic memory for video class-incremental learning (VCIL).
Achieves a favorable trade-off between memory efficiency and recognition performance compared to prior VCIL methods.
Code release includes training, evaluation, and visualization tools.

🚀 Installation

We recommend using conda to create a clean environment.
The code has been tested with Python 3.8, PyTorch 2.0.1, and CUDA 11.7.

Step 1. Create conda environment

conda create -n ESSENTIAL python=3.8 -y
conda activate ESSENTIAL
conda install pytorch==2.0.1 torchvision==0.15.2 torchaudio==2.0.2 pytorch-cuda=11.7 -c pytorch -c nvidia
pip install -r requirements.txt

📂 Dataset

We provide the annotation files for ESSENTIAL on Hugging Face Hub.

Step 1. Download annotations

Please download the annotation files from the link above.

Step 2. Organize dataset

Place the downloaded files under the ./data/ directory as follows:

ESSENTIAL/
│── data/
│   ├── clip_temporal.pth
│   ├── TCD/
│   │   ├── ...
│   │   └── 
│   ├── vCLIMB/
│   │   ├── ...
│   │   └── 
│   └── …

Step 3. Prepare raw videos

The benchmark datasets (e.g., Kinetics-400) should be downloaded separately.

🎯 Training

We provide training scripts for two representative benchmarks:

TCD (Something-Something V2 based): scripts/ssv2_final.sh
vCLIMB (UCF101 based): scripts/ucf_final.sh

These scripts contain the recommended hyperparameters and configurations for each benchmark.
For other datasets, please adapt the script by changing the following arguments:

--data_set : the dataset name (e.g., SSV2, UCF101, Kinetics400)
--anno_path : the path to the corresponding annotation file (e.g., ESSENTIAL/data/TCD/...pkl)
--num_tasks : the number of incremental tasks for the experiment

By modifying these options, the same framework can be applied to various datasets under different class-incremental learning scenarios.

📊 Evaluation

To evaluate a trained model, specify the path to the experiment folder using:

--fine_tune_path : path to the folder containing the trained checkpoints

For evaluation-only mode, enable the following flags:

--no_training : disable further training
--no_rehearsal : disable rehearsal during evaluation

With these options, ESSENTIAL will load the trained checkpoints and report performance without performing additional training or rehearsal.

📝 Note

For the convenience of follow-up research and reproducibility,
we implemented ESSENTIAL to store tokens in memory during training and evaluation,
instead of directly saving them into files.
This design choice makes it easier for others to adapt the codebase for new experiments and extend it to different research directions.

Name		Name	Last commit message	Last commit date
Latest commit History 187 Commits
dataset		dataset
docs		docs
model		model
scripts		scripts
LICENSE		LICENSE
README.md		README.md
datasets_cil.py		datasets_cil.py
engine_for_cil.py		engine_for_cil.py
functional.py		functional.py
masking_generator.py		masking_generator.py
mixup.py		mixup.py
optim_factory.py		optim_factory.py
rand_augment.py		rand_augment.py
random_erasing.py		random_erasing.py
requirements.txt		requirements.txt
run_cil.py		run_cil.py
transforms.py		transforms.py
utils.py		utils.py
video_transforms.py		video_transforms.py
volume_transforms.py		volume_transforms.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

[🎥 ICCV2025] ESSENTIAL: Episodic and Semantic Memory Integration for Video Class-Incremental Learning

👨‍💻 Authors

📜 License

📌 Highlight

📑 Contents

🚀 Installation

Step 1. Create conda environment

📂 Dataset

Step 1. Download annotations

Step 2. Organize dataset

Step 3. Prepare raw videos

🎯 Training

📊 Evaluation

📝 Note

About

Uh oh!

Releases

Packages

Contributors 3

Uh oh!

Languages

License

KHU-VLL/ESSENTIAL

Folders and files

Latest commit

History

Repository files navigation

[🎥 ICCV2025] ESSENTIAL: Episodic and Semantic Memory Integration for Video Class-Incremental Learning

👨‍💻 Authors

📜 License

📌 Highlight

📑 Contents

🚀 Installation

Step 1. Create conda environment

📂 Dataset

Step 1. Download annotations

Step 2. Organize dataset

Step 3. Prepare raw videos

🎯 Training

📊 Evaluation

📝 Note

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Uh oh!

Languages

Packages