Hanyuan Jiang

I try to understand and harness general, robust, and emergent intelligence.

My research focuses on frontier deep learning and reinforcement learning architecture, as well as their application in LLM. Recently, I am working on multi-agent RL, meta RL (learning to learn), multi-step reasoning, and efficient training & inference (e.g., TTT). I am also interested in game theory and mechanism design.

Until recently, I researched stress-testing and red-teaming with Anthropic. I also contributed to various in-house projects and collaborated with ByteDance, Qwen, and Citadel Securities.

I love Go and Chess. I earned a 7th Dan and Candidate Master (CM). Some of my informal thoughts can be found here, which represent my thinking at some particular stage (though I am constantly learning, and my perspectives evolve).

Recent Updates

Apr 22, 2026 News I serve as a reviewer for NeurIPS 2026 and ICML 2026.

Mar 2, 2026 Blog Humanity’s Thousand-Year Alignment Experiment Feb 26, 2026 Blog The Rating You See Is Pricing Tomorrow Feb 22, 2026 Blog Research Interests Feb 20, 2026 Blog On Learning, Longing, and All the Ideas We Cannot Name