**Visual-Spatial and Temporal Perception**, **ICCV 2023** Spatio-temporal focal modulation for video recognition is an efficient network.
Jul 13, 2023
**MICCAI 2023** Frequency domain adversarial training for robust medical segmentation.
May 25, 2023
**Vision-Language Model**, **CVPR 2023** Adapting vision language Foundational models like CLIP for video recognition.
Feb 27, 2023