Pretrain soft q-learning with imperfect demonstrations Xiaoqin Zhang, Yunfei Li, Huimin Ma, Xiong Luo May 9, 2019 PDF Cite Yunfei Li PhD student My research interests include reinforcement learning and robotics.