Hao Liang



Hao Liang Hao Liang is a Research Associate at King's College London. He received his PhD from The Chinese University of Hong Kong (CUHK), Shenzhen, under the supervision of Zhi-Quan (Tom) Luo. His research lies at the intersection of statistical and computational efficiency in decision-making algorithms, with a particular focus on risk awareness, safety, and multi-agent systems. He is also interested in exploring the applications of these algorithms in various domains, including large language models (LLMs), energy system, and robotics.



Email: haoliang1 at link.cuhk.edu.cn
[CV] [Google Scholar] [LinkedIn]

News

October 2025: Our paper, "Why GRPO Needs Normalization: A Local-Curvature Perspective on Adaptive Gradients" will be presented at NeurIPS 2025 Workshop on Efficient Reasoning
👉 TL;DR: We reveal that GRPO’s normalization acts as an adaptive gradient mechanism aligned with local curvature, accelerating and stabilizing LLM reasoning training.
September 2025: Our paper, "Causality Meets Locality: Provably Generalizable and Scalable Policy Learning for Networked Systems" is accepted by NeurIPS 2025 (Spotlight)
👉 TL;DR: We develop GSAC, a causality-aware framework enabling provable scalability and fast cross-domain adaptation in large networked systems.
August 2024: Our paper, "An Economic and Low-carbon Dispatch Algorithm for Microgrids with Electric Vehicles" is awarded the best oral presentation at PSET 2024
July 2024: My paper, "Bridging Distributional and Risk-sensitive Reinforcement Learning with Provable Regret Bounds" is accepted by JMLR
👉 TL;DR: We bridge distributional and risk-sensitive RL under entropic risk measures, achieving near-optimal regret with computationally efficient DRL algorithms.
July 2024: Present "Bridging Distributional and Risk-Sensitive Reinforcement Learning: Balancing Statistical, Computational, and Risk Considerations" at ICML 2024 FoRLaC Workshop
March 2024: Deliver a talk "Efficient Risk-aware Decision-making: A Distributional Perspective" at Vector Institute
March 2024: Deliver a talk "A Distribution Optimization Framework for Confidence Bounds of Risk Measures" at the Informs Optimization Society (IOS) Conference
Janurary 2024: My paper, "Regret Bounds for Risk-sensitive Reinforcement Learning with Lipschitz Dynamic Risk Measures" is accepted at AISTATS 2024
July 2023: Present "A Distribution Optimization Framework for Confidence Bounds of Risk Measures" at ICML 2023

I am serving as Local Organizing Chair of the 7th International Conference on Distributed Artificial Intelligence (DAI 2025), which will be held on November 21–24, 2025 in London, UK.
The conference will bring together researchers and practitioners to discuss the latest advances in distributed AI and multi-agent systems.
We warmly welcome the community to join us in London!


Selected Papers

Why GRPO Needs Normalization: A Local-Curvature Perspective on Adaptive Gradients NeurIPS 2025 Workshop on Efficient Reasoning
Cheng Ge*, Heqi Yin*, Hao Liang†, Jiawei Zhang†
Causality Meets Locality: Provably Generalizable and Scalable Policy Learning for Networked Systems NeurIPS 2025 (Spotlight)
Hao Liang*, Shuqing Shi*, Yudi Zhang, Biwei Huang, Yali Du
Bridging Distributional and Risk-sensitive Reinforcement Learning with Provable Regret Bounds [link] JMLR
Hao Liang, Zhi-Quan Luo
Regret Bounds for Risk-sensitive Reinforcement Learning with Lipschitz Dynamic Risk Measures [link] AISTATS 2024
Hao Liang, Zhi-Quan Luo
A Distribution Optimization Framework for Confidence Bounds of Risk Measures [link] ICML 2023
Hao Liang, Zhi-Quan Luo