# **Sen Yang's Personal Website** ## About Me - **Computer Vision Researcher** - Research Interests - Multimodal Large Language Models - Autonomous Driving - Human Pose Estimation ## Education - **Ph.D.**: Southeast University (2019.5-2023.3) - Master: Southeast University (2017.9-2019.1) - Bachelor: Jilin University (2013.9-2017.7) ## Work Experience - **Baidu VIS Senior R&D Engineer** (2023.7-Present) - Tencent TPG Intern (2021.12-2022.8) - Megvii Intern (2021.1-2021.10) ## Research Publications - **Autonomous Driving** - TopoSD - MGMapNet - HisTrackMap - **Multimodal Large Models** - Vision Remember - MomentSeg - EM-KD - **Pose Estimation** - TransPose - SimCC - TokenPose - Capturing the motion of every joint - Detecting and grouping keypoints ## Technical stack - **Multimodal Large Models** - Tasks: multilingual understanding, video understanding, referseg, visual reasoning and action - Training Techniques: SFT, Autoregressive Models, RL - Token Compression, Large-scale Distributed Training - **Autonomous Driving Perception** - BEV Visual Mapping, Temporal Modeling - Multimodal Fusion: Vision + Map Structured Data - Navigation Map Integration, Probabilistic Planning, Topology prediciton - **Deep Learning Frameworks** - PyTorch, Python, C++ - Transformer Models, GPU/Ascend NPU Development ## Contact Information - Email: yangsenius@gmail.com - Blog: senyang-ml.github.io - Google Scholar Profile