**Multi-modal Large Language Model**, **CVPR 2024** VLM for remote sensing dialogue and analysis.
Jan 12, 2024
**Text-to-Image Model**, **ICLR 2024** Leaverging LLM to generate complex scenes in Zero-Shot.
Jan 12, 2024
**Vision-Language Model**, **AAAI 2024**, **Oral**, **Top 9.5%** Self-structural Alignment of Foundational Models for Zero-Shot.
Dec 12, 2023
**Vision-Language Model**, **NeurIPS 2023** Test-Time Alignment of Foundational Models for Zero-shot.
Nov 12, 2023
**Vision-Language Model**, **ICCV 2023** Self-regularization for foundational vision-language models during fine-tuning.
Jul 13, 2023
**Vision-Language Model**, **ICCV 2023** Face anti-spoofing by adapting foundational vision-language models like CLIP.
Jul 13, 2023
**Visual-Spatial and Temporal Perception**, **ICCV 2023** Spatio-temporal focal modulation for video recognition is an efficient network.
Jul 13, 2023
**MICCAI 2023** Frequency domain adversarial training for robust medical segmentation.
May 25, 2023
**Vision-Language Model**, **CVPR 2023** Adapting vision language Foundational models like CLIP for video recognition.
Feb 27, 2023
**Self-Learning**, **CVPR 2023** Novel class discovery through prompting.
Feb 27, 2023