视频简介

教育经历

  • 2010-2019年 西北工业大学 本科-博士

工作经历

  • 2020年至今,中国人民大学高瓴人工智能学院,准聘助理教授
  • 2019-2020年,百度研究院,人工智能研究员

研究方向

机器多模态感知与学习:以大脑的多通道知觉为背景,挖掘并探究多模态信息(如图像、声音、触觉等)在机器感知、推理与理解等方向的潜在问题与方法,让机器具备『多感官认知能力』。

学生要求

对客观存在保持好奇心,自驱,刻苦,以做有趣、有温度、有价值的研究为目标。

教授课程

  • 本科生课程:《人工智能与Python程序设计》
  • 研究生课程:《模式识别与计算机视觉》

科研项目

  • 腾讯AI Lab犀牛鸟专项研究计划(2021):动态视音场景下多说话人跟踪与日志方法研究

学术成果

2021
Cyclic Co-Learning of Sounding Object Visual Grounding and Sound Separation
Yapeng Tian, Di Hu*, Chenliang Xu*
CVPR

Unsupervised Multi-Source Domain Adaptation for Person Re-Identification
Zechen Bai, Zhigang Wang, Jian Wang, Di Hu*, Errui Ding*
CVPR

Temporal Relational Modeling with Self-Supervision for Action Segmentation
Dong Wang, Di Hu*, Xingjian Li, Dejing Dou
AAAI

2020
Discriminative Sounding Objects Localization via Self-supervised Audiovisual Matching
Di Hu, Rui Qian, Minyue Jiang, Xiao Tan, Shilei Wen, Errui Ding, Weiyao Lin, Dejing Dou
NeurIPS

A Two-Stage Framework for Multiple Sound-Source Localization
Rui Qian, Di Hu, Heinrich Dinkel, Mengyue Wu, Ning Xu, Weiyao Lin
In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshop (CVPRW), 2020.

Co-Learn Sounding Object Visual Grounding and Visually Indicated Sound Separation in A Cycle
Yapeng Tian, Di Hu, Chenliang Xu
In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshop (CVPRW), 2020.

Does Ambient Sound Help? - Audiovisual Crowd Counting
Di Hu, LichaoMou, Qingzhong Wang, Junyu Gao, Yuansheng Hua, Dejing Dou, and Xiaoxiang Zhu
In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshop (CVPRW), 2020.

Heterogeneous Scene Analysis via Self-supervised Audiovisual Learning
Di Hu, Zheng Wang, HaoyiXiong, Dong Wang, FeipingNie, and Dejing Dou
In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshop (CVPRW), 2020.

Multiple Sound Sources Localization from Coarse to Fine
Rui Qian, Di Hu, Heinrich Dinkel, Mengyue Wu, Ning Xu, and Weiyao Lin
In Proceedings of the European Conference on Computer Vision (ECCV), 2020.

Cross-Task Transfer for Multimodal Aerial Scene Recognition
Di Hu, Xuhong Li, LichaoMou, Pu Jin, Dong Chen, Liping Jing, Xiaoxiang Zhu, and Dejing Dou
In Proceedings of the European Conference on Computer Vision (ECCV), 2020.

2019
Dense Multimodal Fusion for Hierarchically Joint Representation
Di Hu, Chengze Wang, FeipingNie, and Xuelong Li
In Proceedings of the IEEE Conference on Acoustics, Speech and Signal Processing (ICASSP), 2019.

Listen to the Image
Di Hu, Dong Wang, FeipingNie, Qi Wang, and Xuelong Li
In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2019. (CCF A)

Deep Multimodal Clustering for Unsupervised Audiovisual Learning
Di Hu, FeipingNie, and Xuelong Li
In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2019. (CCF A)

Deep Linear Discriminant Analysis Hashing
Di Hu, FeipingNie, and Xuelong Li
Sci Sin Inform, 2019. (CCF A)

2018
Deep Binary Reconstruction for Cross-modal Hashing
Di Hu, FeipingNie, and Xuelong Li
IEEE Trans. Multimedia (TMM), 2018.

Discrete Spectral Hashing for Efficient Similarity Retrieval
Di Hu, FeipingNie, and Xuelong Li
IEEE Trans. Image Processing (TIP), 2018. (CCF A)

2017
Large Graph Hashing with Spectral Rotation
Xuelong Li, Di Hu, and FeipingNie
In Proceedings of the AAAIConferenceonArtificialIntelligence (AAAI), 2017. (CCF A)

Deep Binary Reconstruction for Cross-modal Hashing
Xuelong Li, Di Hu, and FeipingNie
In Proceedings of the ACM Conference on Multimedia (ACMMM), 2017. (CCF A)

Image2song: Song Retrieval via Bridging Image Content and Lyric Words
Xuelong Li, Di Hu, and Xiaoqiang Lu
In Proceedings of the IEEE Conference on Computer Vision (ICCV), 2017. (CCF A)

2016
Temporal Multimodal Learning in Audiovisual Speech Recognition
Di Hu, Xuelong Li, and Xiaoqiang Lu
In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016. (CCF A)

Multimodal Learning via Exploring Deep Semantic Similarity
Di Hu, Xiaoqiang Lu, and Xuelong Li
In Proceedings of the ACM Conference on Multimedia (ACMMM), 2016. (CCF A)

荣誉奖励

  • 2020.9 荣获中国人工智能学会优秀博士论文奖
  • 2019.8 入选百度『AIDU』全球顶尖人工智能人才计划
  • 2019.8 荣获ACM XI’AN优秀博士论文奖(共2人)
  • 2019.5 入选CVPR Doctoral Consortium博士生论坛(大陆共4人)
  • 2018.7 荣获国家留学基金委赴卡内基梅隆大学联合培养学金

社会兼职

  • 期刊审稿人: TIP, TKDE, TMM, Neurocomputing
  • 会议程序委员: NeurIPS 2020-2021, CVPR 2018 2020 2021, ICCV 2019 2021, ECCV2020, ICML 2021, AAAI 2018 2020 2021, ACCV 2018 2020
  • 联合组织者:
  • CVPR 2021 Tutorial on Audio-visual Scene Understanding
  • WACV 2021 Tutorial on Audio-visual Scene Understanding
  • ICDM 2019 Tutorial on Automated Deep Learning: Theory, Algorithms, Platforms, and Applications

contact

电话:--

邮箱:dihu[at]ruc.edu.cn

个人网页:https://dtaoo.github.io/

办公地址:北京市海淀区中关村大街59号文化大厦2102