Validate that length of audio and text seem to be in the right range Has to be an interval, of course, depends on speech rate etc.