Description
pumpp features currently rely on low-level librosa implementations, but we could also have wrappers for pre-trained feature extractors like openl3 and vggish (the latter as implemented by openmic).
There's some details to work out in terms of standardizing the parameters (hop size, etc).