I am a Senior Research Engineer at DiDi Autonomous Driving, working on Embodied AI and Vision-Language-Action (VLA) systems for robotic platforms that support autonomous vehicle operations. Previously, I was a Postdoctoral Fellow at the ETH AI Center, collaborating with Prof. Siyu Tang and Prof. Christian Holz, where I worked on human motion capture from egocentric videos and wearable IMUs. I received my Ph.D. in 2024 from Xiamen University, supervised by Prof. Cheng Wang and Prof. Chenglu Wen, focusing on 3D computer vision, 4D human motion capture, and human–scene interaction reconstruction.
My goal is to enable embodied agents to understand, reason about, and interact with the physical world for real-world robotic and autonomous systems.
News!
- 02/2025: ClimbingCap was accepted by CVPR 2025 (🪄Highlight (13.4%)) .
- 09/2024: HiSC4D was accepted by TPAMI.
- 07/2024: HmPEAR was accepted to ACM MM2024.
- 03/2024: RELI11D was accepted to CVPR2024.
- 04/2023: SLOPER4D dataset V1.0 released!
- 02/2023: Two papers were accepted to CVPR2023.
- 03/2022: One paper was accepted to CVPR2022.