[AAAI 2023 (Oral)] CrissCross: Self-Supervised Audio-Visual Representation Learning with Relaxed Cross-Modal Synchronicity
representation-learning
action-recognition
audioset
dcase
ucf101
hmdb51
sound-classification
self-supervised-learning
kinetics-datasets
kinetics400
esc50
-
Updated
Jul 11, 2023 - Python