I am a Principal Researcher at Microsoft Azure AI, primarily working on Vision-and-Language Multimodal Intelligence, a research area that sits at the nexus of computer vision and natural language processing, such as Vision-and-Language Pre-training (VLP), Visual Question Answering (VQA), image captioning, and video-text modeling. I also have broad interests on other machine learning topics, such as sparse neural networks, adversarial training, and self-supervised visual representation learning. I received my Ph.D. degree from Duke University in Spring 2018. Before that, I received my Master's and B.Sc. from Peking University in 2013 and 2010, respectively. My Ph.D. advisor is Lawrence Carin. I can be reached at zhe.gan@microsoft.com.

I am serving (or, has served) as an Area Chair for NeurIPS 2021/2020/2019, ICML 2021, ICLR 2021, ACL 2021, AAAI 2022, and a Senior Program Committee (SPC) member for AAAI 2021/2020, and received AAAI-20 Outstanding SPC Award.


Research Highlights: