刘奇,东北大学机器人科学与工程学院特聘副研究员。2024年获哈尔滨工业大学(深圳)控制科学与工程专业博士学位(导师:李衍杰教授楼云江教授),2019年获哈尔滨工业大学控制科学与工程专业硕士学位(导师:遆晓光教授马杰教授)。研究方向为深度强化学习算法、具身智能和大模型等。在国内外著名学术期刊和会议上发表论文20余篇,其中第一作者或通讯作者论文10余篇。担任多个国际期刊和会议 (IEEE TNNLS/TITS/TCYB/TIV, ICRA, IROS等) 审稿人。

Qi Liu is currently an Associate Researcher at the Faculty of Robot Science and Engineering, Northeastern University, China. He received the Ph.D. degree in Control Science and Engineering from Harbin Institute of Technology (Shenzhen) in 2024 (advised by Prof. Yanjie Li and Prof. Yunjiang Lou), and the M.S. degree in Control Science and Engineering from Harbin Institute of Technology in 2019 (advised by Prof. Xiaoguang Di and Prof. Jie Ma). His research focuses on deep reinforcement learning algorithms, embodied intelligence, and large language models. He has published over 20 papers in renowned international journals and conferences, including over 10 papers as the first or corresponding author. He also serves as a reviewer for several prestigious international journals and conferences (e.g., IEEE TNNLS/TITS/TCYB/TIV, ICRA, IROS).

研究方向

主要研究方向为深度强化学习算法和具身智能:

  • 深度强化学习算法:如价值函数估计、智能体探索、安全强化学习等
  • 多智能体深度强化学习算法:如多智能体协作、信用分配等
  • 大语言模型对齐:如安全价值对齐、人类反馈强化学习等
  • 具身智能:基于学习(深度强化学习、模仿学习和大模型等)的各类智能机器人控制、决策和协作,如:四足机械狗的步态和技能学习、机械臂和多指灵巧手的操作、双足人形机器人全身控制、轮式机器人和无人驾驶导航。

欢迎对上述方向感兴趣的学生(本硕博)和工业界朋友联系,进行科研(可远程)和项目合作。
也欢迎对深度强化学习在其他领域的应用(如:智能电网、推荐系统、自动股票交易等)感兴趣的学生和工业界朋友联系,本人也做过相关的研究。
邮箱:liuqi@mail.neu.edu.cn,手机和微信号:13713517967。

Research Interests

His primary research interests include deep reinforcement learning algorithms and embodied intelligence:

  • Deep Reinforcement Learning Algorithms: Value function estimation, agent exploration, safe reinforcement learning, etc.
  • Multi-Agent Deep Reinforcement Learning Algorithms: Multi-agent coperation, credit assignment, etc.
  • Large Language Model Alignment: Safe value alignment, reinforcement learning from human feedback, etc.
  • Embodied Intelligence: Learning-based (deep reinforcement learning, imitation learning, and large language models) control, decision-making, and collaboration for various intelligent robots, such as gait and skill learning for quadruped robots, manipulation with robotic arms and multi-fingered dexterous hands, whole-body control of bipedal humanoid robots, and navigation for wheeled robots and autonomous vehicles.

Students (undergraduate, master, and PhD) and industry professionals who are interested in the aforementioned research directions are warmly welcome to reach out for potential research collaborations (remote collaboration is available).
Additionally, students and industry professionals interested in the application of deep reinforcement learning in other fields (e.g., smart grids, recommendation systems, automated stock trading, etc.) are also encouraged to contact me, as I have conducted related research in these areas.
Email:liuqi@mail.neu.edu.cn

研究经历

星动纪元   北京   08/2024 - 11/2024
研究方向:具身智能研究实习生,基于深度强化学习的机械臂和5指灵巧手控制。
Mentor:陈建宇,清华大学交叉信息研究院助理教授。

智谱AI   北京   03/2024 - 06/2024
研究方向:大语言模型(Large Language Model, LLM) 研究实习生,研究基于大模型的代码生成。
Mentor:牛艺霖,智谱AI大模型对齐(RLHF)组负责人。

粤港澳大湾区数字经济研究院(IDEA)   深圳   08/2023 - 02/2024
研究方向:LLM研究实习生,研究LLM对齐(Reinforcement Learning from Human Feedback, RLHF)及其改进。
Mentor:张家兴,IDEA研究院讲席科学家。

地平线机器人   上海自动驾驶研发中心   07/2018 - 12/2018
研究方向:无人驾驶研究实习生,研究室外无人车定位与建图算法(SLAM)。
Mentor:徐斌峰,地平线机器人算法工程师。

Research Experience

ROBOTERA, Inc.   Beijing   08/2024 - 11/2024
Topic: Embodied Intelligence Research Intern @Algorithm Team, Research on the control of robot arms and dexterous hands based on deep reinforcement learning.
Mentor: Jianyu Chen, Assistant Professor at Institute for Interdisciplinary Information Sciences, Tsinghua University.

Zhipu AI, Inc.   Beijing   03/2024 - 06/2024
Topic: LLM Research Intern @LLM RLHF Team, Research on LLM Code Generation and its improvement.
Mentor: Yilin Niu, Leader of LLM RLHF Team at Zhipu AI.

International Digital Economy Academy (IDEA)   Shenzhen   08/2023 - 02/2024
Topic: LLM Research Intern @LLM RLHF Team, Research on LLM Alignment (RLHF) and its improvement.
Mentor: Jiaxing Zhang, Chair Professor at IDEA.

Horizon Robotics, Inc.   Shanghai Research Center   07/2018 - 12/2018
Topic: Autonomous Driving @SLAM Team, Research on the localization and mapping of autonomous vehicles.
Mentor: Binfeng Xu, Algorithm Engineer at Horizon Robotics, Inc.

科研论文(部分)#共同一作, *通讯作者)

  • Qi Liu, Yanjie Li, Xiongtao Shi, Ke Lin, Yuecheng Liu, Yunjiang Lou. Distributional Policy Gradient With Distributional Value Function. IEEE Transactions on Neural Networks and Learning Systems, 2024. (JCR 1区,中科院大类1区,IF: 10.4,TOP期刊)
  • Qi Liu, Yanjie Li, Yuecheng Liu,Ke Lin,Jianqi Gao, Yunjiang Lou. Data Efficient Deep Reinforcement Learning With Action-Ranked Temporal Difference Learning, IEEE Transactions on Emerging Topics in Computational Intelligence, 2024. (JCR 1区,中科院大类2区,IF: 5.3)
  • Qi Liu, Yanjie Li, Shiyu Chen, Ke Lin, Xiongtao Shi, Yunjiang Lou. Distributional Reinforcement Learning With Epistemic and Aleatoric Uncertainty Estimation. Information Sciences, 2023. (JCR 1区,中科院大类1区,IF: 8.1,TOP期刊)
  • Zheng Zhang#, Qi Liu#, Yanjie Li, Ke Lin, Linyu Li. Safe Reinforcement Learning in Autonomous Driving With Epistemic Uncertainty Estimation. IEEE Transactions on Intelligent Transportation Systems, 2024. (JCR 1区,中科院大类1区,IF: 8.5,TOP期刊)
  • Pengbin Chen#, Qi Liu#, Yanjie Li, Shuaikang Ma. An Environmental-Complexity-Based Navigation Method Based on Hierarchical Deep Reinforcement Learning. 2024 IEEE International Conference on Robotics and Automation (ICRA), 2024. (机器人顶会,CSRankings)
  • Jianqi Gao, Xizheng Pang, Qi Liu*, Yanjie Li*. Hierarchical Reinforcement Learning for Safe Mapless Navigation with Congestion Estimation. 2025 IEEE International Conference on Robotics and Automation (ICRA), 2025. (机器人顶会,CSRankings)
  • Qi Liu, Xiaoguang Di, Binfeng Xu. Autonomous Vehicle Self-localization in Urban Environments based on 3D Curvature Feature points – Monte Carlo Localization. Robotica, 2022. (JCR 3区,中科院大类3区,IF: 2.7)
  • Qi Liu#, Jingxiang Guo#, Zhongjian Qiao, Pengbin Chen, Jinxuan Zhu, Yanjie Li. Logarithmic Function Matters Policy Gradient Deep Reinforcement Learning. The Sixth International Conference on Distributed Artificial Intelligence (DAI), 2024.
  • Linyu Li#, Qi Liu#, Yanjie Li, Yongjin Mu, and Zheng Zhang. A Risk-sensitive Automatic Stock Trading Strategy Based on Deep Reinforcement Learning and Transformer. 2024 IEEE 20th International Conference on Automation Science and Engineering (CASE), 2024.
  • Qi Liu, Yanjie Li, Yuecheng Liu, Meiling Chen, Shaohua Lv and Yunhong Xu. Exploration via Distributional Reinforcement Learning with Epistemic and Aleatoric Uncertainty Estimation. 2021 IEEE 17th International Conference on Automation Science and Engineering (CASE), 2021.
  • Qi Liu, Yanjie Li, Lintao Liu. A 3D Simulation Environment and Navigation Approach for Robot Navigation via Deep Reinforcement Learning in Dense Pedestrian Environment. 2020 IEEE 16th International Conference on Automation Science and Engineering (CASE), 2020.
  • Qi Liu#, Jingxiang Guo#, Sixi Lin, Shuaikang Ma, Jinxuan Zhu, Yanjie Li. MASQ: Multi-Agent Reinforcement Learning for Single Quadruped Robot Locomotion. arXiv preprint arXiv:2408.13759
  • Qi Liu, Jianqi Gao, et al. Multi-Agent Target Assignment and Path Finding for Intelligent Warehouse: A Cooperative Multi-Agent Deep Reinforcement Learning Perspective. arXiv preprint arXiv:2408.13750. submitted to IROS2025. (机器人顶会,CSRankings)
  • Dongjie Zhu, Zhuo Yang, Tianhang Wu, Luzhou Ge, Xuesong Li, Qi Liu*, Xiang Li*. Dynamic Legged Ball Manipulation on Rugged Terrains with Hierarchical Reinforcement Learning. submitted to IROS2025. (机器人顶会,CSRankings)
  • In Peer Review: IEEE TNNLS, IEEE TETC, IROS2025*2, KBS

具身智能示例(部分)

  • 复杂地形环境下基于分层强化学习的四足机器人足球控制
  • 基于多智能体深度强化学习的单体四足机械狗控制
  • 基于深度强化学习的机械臂和多指灵巧手操作
  • 双足人形机器人控制
  • 基于分层深度安全强化学习的无地图机器人导航
  • 密集行人环境下基于深度强化学习的机器人导航

  • 复杂场景下基于轨迹规划的无人驾驶泊车
  • 基于深度安全强化学习的无人驾驶
  • 智能仓储环境下基于多智能体深度强化学习的任务分配和路径规划
  • 城市环境下基于激光雷达的无人车定位与建图

学术兼职

担任多个国际期刊和会议审稿人:

  • IEEE Transactions on Neural Networks and Learning Systems (TNNLS)
  • IEEE Transactions on Intelligent Transportation Systems (TITS)
  • IEEE Transactions on Cybernetics (TCYB)
  • IEEE Transactions on Intelligent Vehicles (TIV)
  • IEEE International Conference on Robotics and Automation (ICRA)
  • IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)

获奖荣誉(部分)

  • 优秀博士毕业生, 校级,2024
  • 优秀学生干部, 校级,2021
  • 国家励志奖学金, 省级,2016
  • 国家奖学金, 国家级,2015
  • 国家奖学金, 国家级,2014

联系方式

  • 电子邮箱:liuqi@mail.neu.edu.cn, liuqi8827@gmail.com
  • 手机和微信号:13713517967
  • 办公室:东北大学(浑南校区)建筑学馆B座410室

Flag Counter