CASIA-IVA-Lab
Popular repositories Loading
-
ChatBridge
ChatBridge PublicChatBridge, an approach to learning a unified multimodal model to interpret, correlate, and reason about various modalities without relying on all combinations of paired data.
Repositories
- ChatSearch Public
ChatSearch: a Dataset and a Generative Retrieval Model for General Conversational Image Retrieval
CASIA-IVA-Lab/ChatSearch’s past year of commit activity - COSA Public
[ICLR2024] Codes and Models for COSA: Concatenated Sample Pretrained Vision-Language Foundation Model
CASIA-IVA-Lab/COSA’s past year of commit activity - VALOR Public
[TPAMI2024] Codes and Models for VALOR: Vision-Audio-Language Omni-Perception Pretraining Model and Dataset
CASIA-IVA-Lab/VALOR’s past year of commit activity - SC-Tune Public
Official code for CVPR 2024 paper, "SC-Tune: Unleashing Self-Consistent Referential Comprehension in Large Vision Language Models"
CASIA-IVA-Lab/SC-Tune’s past year of commit activity - VAST Public
[NIPS2023] Code and Model for VAST: A Vision-Audio-Subtitle-Text Omni-Modality Foundation Model and Dataset
CASIA-IVA-Lab/VAST’s past year of commit activity
People
This organization has no public members. You must be a member to see who’s a part of this organization.
Most used topics
Loading…