I specialize in Multimodal Large Language Models (MLLMs), including multimodal understanding and reasoning.
Besides, I also lead numerous projects in mutiple domains, including Robust-R1 (AAAI 2026 Oral), Hawk (NeurIPS 2024), IUF (ECCV 2024), Film Removal (CVPR 2024) and EPCE-HDR (ECAI 2024 Oral). You can find more information at jqt.me.
If you're interested in learning more about my work, please feel free to reach out to me.


