Yunkai Dang
I am a first-year Ph.D. student at the School of Intelligence Science and Technology, Nanjing University, advised by Prof. Wenbin Li and co-supervised by Prof. Yuekun Yang.
My research focuses on multimodal foundation models, with particular interests in high-resolution vision–language modeling, remote sensing foundation models, and uncertainty quantification and calibration.
If you have any questions, please feel free to contact me at yunkaidang1@gmail.com.
news
| Dec 24, 2025 | I have created my own personal academic homepage and hope for good luck in the future. |
|---|
selected publications
-
FUSE-RSVLM: Feature Fusion Vision-Language Model for Remote Sensinghttps://arxiv.org/pdf/2512.24022, 2025 -
A Benchmark for Ultra-High-Resolution Remote Sensing MLLMsarXiv preprint arXiv:2512.17319, 2025 -
Exploring response uncertainty in mllms: An empirical evaluation under misleading scenariosIn Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing(EMNLP Main), 2025 -
Explainable and Interpretable Multimodal Large Language Models: A Comprehensive SurveyarXiv preprint arXiv:2412.02104, 2024 -
RLAIF-V: Aligning MLLMs through Open-Source AI Feedback for Super GPT-4V TrustworthinessarXiv e-prints, 2024 -
Multi-level correlation network for few-shot image classificationIn IEEE International Conference on Multimedia and Expo (ICME), 2023 -
FILM: How can Few-Shot Image Classification Benefit from Pre-Trained Language Models?arXiv preprint arXiv:2307.04114, 2023