Peking University · CS PhD Candidate
Hongcheng Wang
I am a fourth-year CS PhD candidate at Peking University. My supervisor is Prof.
Hao Dong.
My current research interests are organized into two parts:
-
Chain-of-thought Generation: I study reinforcement-learning post-training for large models,
especially how to stably train <think> to better guide <answer>.
-
Human-centered Robot Decision Making: I take a Cognitive Behavioral Theory's perspective to enable robots to understand human cognition (e.g., latent demands and preferences) and behavior (e.g., observable habits and norms), so they can better serve people.