LiveAV-Summarization Implementaion of BiModal Transformer to generate real time captions using audio and video feed. Click here to view example.