2014-2019 Ph.D Candidate Northwestern Polytechnical University
2010-2014 B.S. Honors College, Northwestern Polytechnical University
Work Experience
2020-Now Assistant Professor, Renmin University of China
2019-2020 Research Scientist, Baidu Research
RESEARCH INTERESTS
Machine Multimodal Perception and Learning: Mining and exploring the potential problems and methods of multimodal messages (such as image, sound, touch etc.) in the direction of machine perception, reasoning and understanding, then equipping the machines with “multisensory cognitive ability”.
Prospective Students/Staffs
Curious about things surrounding, self-driven, aiming to do interesting, meaningful and valuable research
PUBLICATIONS
2024
Enhancing Multi-modal Cooperation via Fine-grained Modality Valuation
Yake Wei , Ruoxuan Feng , Zihe Wang , Di Hu
Computer Vision and Pattern Recognition(CVPR)
Quantifying and Enhancing Multi-modal Robustness with Modality Preference
Zequn Yang , Yake Wei , Ce Liang , Di Hu
The Twelfth International Conference on Learning Representations (ICLR)
SphereDiffusion: Spherical Geometry-aware Distortion Resilient Diffusion Model
Tao Wu , Xuewei Li , Zhongang Qi , Di Hu , Xintao Wang , Ying Shan , Xi Li
The 38th Annual AAAI Conference on Artificial Intelligence
Prompting Segmentation with Sound is Generalizable Audio-Visual Source Localizer
Yaoting Wang* , Weisong Liu* , Guangyao Li , Jian Ding , Di Hu , Xi Li
The 38th Annual AAAI Conference on Artificial Intelligence