CV
Education
- B.S. in Department of Electronic and Information Engineering, Tongji University, 2017-2021
- Major in: Automation
- Awards: The First Prize Scholarship (2018, 2019, 2021),
- M.S. in Robotics and Artifical Intelligence Lab (RAIL Lab), Tongji University, 2021-2024
- Topic: Large Multimodal Model, Visual Object Navigation, Artifical Intelligence
- Supervisor: Prof. Chengju Liu
- Awards: The First Prize Scholarship (2022)
Work Experience
- Summer 2023: Research Intern
- SenseTime
- Duties included: Applying Large Multimodal Model (LMM) to generate instruction automatically for REC tasks
- Supervisor: Prof. Yibing Song
- Summer 2020: Research Intern
- Institute of Automation, Chinese Academy of Sciences.
- Duties included: Explorating differentiable NAS neural architecture search methods
- Supervisor: Prof. Dongbin Zhao
Research Field
- Instruction Tuning for Large Multimodal Model (LMM)
- Soft Prompt Learning for Visual-Language Pretraining
- Visual Object Navigation based on Reinforcement Learning
- Skeleton Action Recognition based on Graph Nerual Network (GCN)
Publications
Dang, Ronghao, et al. "Bionic Body Wave Control for an Eel-Like Robot Based on Segmented Soft Actuator Array." 2021 40th Chinese Control Conference (CCC). IEEE, 2021.
Dang, Ronghao, et al. "Channel attention and multi-scale graph neural networks for skeleton-based action recognition." AI Communications Preprint (2022): 1-19
Dang, Ronghao, et al. "Unbiased Directed Object Attention Graph for Object Navigation." Proceedings of the 30th ACM International Conference on Multimedia. 2022.
L. Wang, Z. He, R. Dang, H. Chen, C. Liu and Q. Chen, "RES-StS: Referring Expression Speaker via Self-training with Scorer for Goal-Oriented Vision-Language Navigation," in IEEE Transactions on Circuits and Systems for Video Technology.
Zeng, Qinyang, et al. "DL-PCN: Differential learning and parallel convolutional network for action recognition." AI Communications Preprint (2023): 1-15.
Wang, Liuyi, et al. "A Dual Semantic-Aware Recurrent Global-Adaptive Network For Vision-and-Language Navigation." arXiv preprint arXiv:2305.03602 (2023).
Dang, Ronghao, et al. "Multiple thinking achieving meta-ability decoupling for object navigation." arXiv preprint arXiv:2302.01520 (2023).
Dang, Ronghao, et al. "Search for or Navigate to? Dual Adaptive Thinking for Object Navigation." arXiv preprint arXiv:2208.00553 (2022).
Service and leadership
Reviewer of the following conferences and journals:
- ACMMM, ICML, IJCAI
- TCSVT, AI COMMUNICATIONS