-
Notifications
You must be signed in to change notification settings - Fork 30
Open
Description
According to the instructions provided here:
Note 2: To test with a custom audio, you need to replace the video_name/video_name.wav and deepspeech feature video_name/deepfeature32/video_name.npy. The output length will depend on the shortest length of the audio and driven poses. Refer to here for more details.
I have copied a custom audio file in 16khz sampling rate, like the following:
video_processed/00014
├── 00014.wav
├── deepfeature32
├── latent_evp_25
└── poseimg
From the above, how do I get here?
video_processed/00014
├── 00014.wav
├── deepfeature32
│ └── 00014.npy
├── latent_evp_25
│ └── 00014.npy
└── poseimg
└── 00014.npy.gz
Metadata
Metadata
Assignees
Labels
No labels