【1】RORL: Robust Offline Reinforcement Learning via Conservative Smoothing
Rui Yang · Chenjia Bai · Xiaoteng Ma · Zhaoran Wang · Chongjie Zhang · Lei Han
PPT: https://nips.cc/media/neurips-2022/Slides/53767.pdf
【2】Model-Based Offline Reinforcement Learning with Pessimism-Modulated Dynamics Belief
Kaiyang Guo · Shao Yunfeng · Yanhui Geng
【3】A Policy-Guided Imitation Approach for Offline Reinforcement Learning
Haoran Xu · Li Jiang · Li Jianxiong · Xianyuan Zhan
【4】A Policy-Guided Imitation Approach for Offline Reinforcement Learning
Haoran Xu · Li Jiang · Li Jianxiong · Xianyuan Zhan
【5】Supported Policy Optimization for Offline Reinforcement Learning
Jialong Wu · Haixu Wu · Zihan Qiu · Jianmin Wang · Mingsheng Long
【6】LAPO: Latent-Variable Advantage-Weighted Policy Optimization for Offline Reinforcement Learning
Xi Chen · Ali Ghadirzadeh · Tianhe Yu · Jianhao Wang · Alex Yuan Gao · Wenzhe Li · Liang Bin · Chelsea Finn · Chongjie Zhang
【7】LAPO: Latent-Variable Advantage-Weighted Policy Optimization for Offline Reinforcement Learning
Xi Chen · Ali Ghadirzadeh · Tianhe Yu · Jianhao Wang · Alex Yuan Gao · Wenzhe Li · Liang Bin · Chelsea Finn · Chongjie Zhang