Dongsu Lee

Hello! I am a PhD student at the University of Texas at Austin, where I’m supervised by Prof. Amy Zhang in the MIDI Lab. Previous affiliation. I collaborated with Prof. Ding Zhao in the Safe AI Lab, Carnegie Mellon University (CMU); with Prof. Minhae Kwon in the BMIL; and Prof. Sangsoo Kim in the Post-genome Informatics Lab at Soongsil University in Seoul, South Korea.

I’m interested in realistic RL problems, e.g, partial observability, coordination with unseen partners, and human-robot collaboration. Then, I aim to make an RL algorithm applicable in via both self-play and data-driven techniques.

I am always happy to talk about research, so feel free to reach out if you want to chat!

Email CV Scholar GitHub

Selected Publications

A Recipe for Stable Offline Multi-agent Reinforcement Learning

Dongsu Lee, Daehee Lee, Amy Zhang
Preprint

paper| code| write-up

Multi-agent Coordination via Flow Matching

Dongsu Lee, Daehee Lee, Amy Zhang
ICLR 2026

paper| code| write-up

Unifying Agent Interaction and World Information for Multi-agent Coordination

Dongsu Lee, Daehee Lee, Yaru Niu, Honguk Woo, Amy Zhang, Ding Zhao
NeurIPS 2025 ARLET Workshop (Oral)

paper| code| project page

Policy Compatible Skill Incremental Learning via Lazy Learning Interface

Daehee Lee, Dongsu Lee, TaeYoon Kwack, Wonje Choi, Honguk Woo
NeurIPS 2025 (Spotlight)

paper| code

Episodic Future Thinking Mechanism for Multi-agent Reinforcement Learning

Dongsu Lee, Minhae Kwon
NeurIPS 2024

paper| project page| talk

All Publications

2026

A Recipe for Stable Offline Multi-agent Reinforcement Learning

Dongsu Lee, Daehee Lee, Amy Zhang
Preprint

paper| code| write-up

Multi-agent Coordination via Flow Matching

Dongsu Lee, Daehee Lee, Amy Zhang
ICLR 2026

paper| code| write-up

2025

Unifying Agent Interaction and World Information for Multi-agent Coordination

Dongsu Lee, Daehee Lee, Yaru Niu, Honguk Woo, Amy Zhang, Ding Zhao
NeurIPS 2025 ARLET Workshop (Oral)

paper| code| project page

Policy Compatible Skill Incremental Learning via Lazy Learning Interface

Daehee Lee, Dongsu Lee, TaeYoon Kwack, Wonje Choi, Honguk Woo
NeurIPS 2025 (Spotlight)

paper| code

Scenario-free Autonomous Driving with Multi-task Offline-to-online Reinforcement Learning

Dongsu Lee, Minhae Kwon
IEEE Transactions on Intelligent Transportation Systems

paper| code

Temporal Distance-aware Transition Augmentation for Offline Model-based Reinforcement Learning

Dongsu Lee, Minhae Kwon
ICML 2025

paper| project page| talk (5min)

2024

Episodic Future Thinking Mechanism for Multi-agent Reinforcement Learning

Dongsu Lee, Minhae Kwon
NeurIPS 2024

paper| project page| talk (5min)

AD4RL: Autonomous Driving Benchmarks for Offline Reinforcement Learning with Value-based Dataset

Dongsu Lee, Chanin Eom, Minhae Kwon
ICRA 2024

paper| project page| talk (5min)| media

Write-ups

Blog A lazy, borrowed recipe

Mar 2026 • Discourse: right direction of the offline MARL community.

Blog Fast and expressive multi-agent coordination

Oct 2025 • Capturing complex joint behavior while keeping inference fast.

Blog Research statement: Towards generalizable MARL

Aug 2025 • Bottlenecks and a roadmap for generalizable MARL.