**Vision-Language Model**, **NeurIPS 2023** Test-Time Alignment of Foundational Models for Zero-shot.
Nov 12, 2023
**Vision-Language Model**, **ICCV 2023** Self-regularization for foundational vision-language models during fine-tuning.
Jul 13, 2023
**Visual-Spatial and Temporal Perception**, **ICCV 2023** Spatio-temporal focal modulation for video recognition is an efficient network.
Jul 13, 2023