Currently, the library only supports Resnet_Glove, CLIP, and CLIP_Slowfast features for the ActivityNet and Charades datasets. However, several papers use other feature types, such as I3D (for Charades-STA) and C3D (for ActivityNet), as noted in Table 2 of the EaTR paper (link to paper). Expanding support to include these features would align with a broader range of existing research.
Additionally, I noticed that for ActivityNet, many works report results on the val_2 split, while here only the val split is provided, which I believe corresponds to val_1. Could you clarify this aspect? How can I access to val_2 split?
Thank you for your efforts in creating this unified framework—it’s much appreciated!