SVLD: The Social Vision and Language Dataset
The social vision and language dataset is a large-scale multimodal dataset designed for research into social contextual learning. Read the paper
The social vision and language dataset is a large-scale multimodal dataset designed for research into social contextual learning. Read the paper