I'm a graduate student in Computer Science & Technology at Fudan University (2025β2028), advised by Prof. Siyu Zhu in the Fudan Generative Vision Lab.
My research applies generative models to autonomous driving β masked-diffusion vision-language-action (VLA) frameworks and discrete flow matching for motion planning. I'm interested in making driving policies more controllable, efficient, and reliable, and in building open-source systems that turn these ideas into things people can run.
WAM-Diff
A masked-diffusion VLA framework with mixture-of-experts and online reinforcement learning for end-to-end autonomous driving.
WAM-Flow
Parallel coarse-to-fine motion planning via discrete flow matching, generating accurate driving trajectories in few inference steps.