Publications
|
Diffusing States and Matching Scores: A New Framework for Imitation Learning
Runzhe Wu, Yiding Chen, Gokul Swamy, Kianté Brantley, Wen Sun
ICLR, 2025
[Code]
|
Computationally Efficient RL under Linear Bellman Completeness for Deterministic
Dynamics
Runzhe Wu*, Ayush Sekhari*, Akshay Krishnamurthy, Wen Sun
ICLR, 2025 (Oral)
[Talk at RL Theory Seminars]
|
Making RL with Preference-based Feedback Efficient via Randomization
Runzhe Wu, Wen Sun
ICLR, 2024
|
Contextual Bandits and Imitation Learning via Preference-Based Active Queries
(Alphabetical order) Ayush Sekhari, Karthik Sridharan, Wen Sun, Runzhe Wu
NeurIPS, 2023 (Also at the ILHF & MFPL Workshops @ ICML, 2023)
[Code]
|
Selective Sampling and Imitation Learning via Online Regression
(Alphabetical order) Ayush Sekhari, Karthik Sridharan, Wen Sun, Runzhe Wu
NeurIPS, 2023 (Also at the ILHF Workshop @ ICML, 2023)
|
The Benefits of Being Distributional: Small-Loss Bounds for Reinforcement Learning
Kaiwen Wang, Kevin Zhou, Runzhe Wu, Nathan Kallus, Wen Sun
NeurIPS, 2023
[Code]
|
Distributional Offline Policy Evaluation with Predictive Error Guarantees
Runzhe Wu, Masatoshi Uehara, Wen Sun
ICML, 2023
[Code]
|
MALib: A parallel framework for population-based multi-agent reinforcement learning
Ming Zhou, Ziyu Wan, Hanjing Wang, Muning Wen, Runzhe Wu, Ying Wen, Yaodong Yang,
Weinan Zhang, Jun Wang
JMLR, 2023
[Website]   [Code]
|
Offline Constrained Multi-Objective Reinforcement Learning via Pessimistic Dual Value
Iteration
Runzhe Wu, Yufeng Zhang, Zhuoran Yang, Zhaoran Wang
NeurIPS, 2021
|
Website template from here.
|
|