Hi, I am Linh or Levi. I'm a PhD candidate at Applied Artificial Intelligence Institute (A2I2), Deakin University, advised by Prof. Sunil Gupta and Dr. Hung Tran The . Previously, I earned a first-class honors degree in Information Technology from the University of Engineering and Technology, Vietnam National University, Hanoi.
My research interests center on Machine Learning, with a current focus on Reinforcement Learning (RL) in offline, online, and offline-to-online settings. I am also interested in the applications of Diffusion Models and Foundation Models (LLMs/VLMs) within the context of RL.
I'm always open to discussions and collaborations. Please feel free to reach out via email if you have a potential opportunity in mind, are interested in collaborating, or would just like to chat about research.
Outside of research, I love reading books, listening music, taking photos, running (both road and trails marathon), cycling, and hiking. I'm also learning to play (classic) guitar.
") does not match the recommended repository name for your site ("").
", so that your site can be accessed directly at "http://".
However, if the current repository name is intended, you can ignore this message by removing "{% include widgets/debug_repo_name.html %}" in index.html.
",
which does not match the baseurl ("") configured in _config.yml.
baseurl in _config.yml to "".
Linh Le, Minh Hoang Nguyen, Duc Kieu, Hung Le, Hung The Tran, Sunil Gupta
European Conference on Artificial Intelligence (ECAI) 2025
We study cross-domain offline RL with limited target data, where neural domain-gap estimators often overfit and only part of the source dataset overlaps with the target domain. We propose DmC, combining a k-NN proximity estimator with a nearest-neighbor–guided diffusion model to generate target-aligned source samples, and show strong gains over prior methods on MuJoCo benchmarks.
Linh Le, Minh Hoang Nguyen, Duc Kieu, Hung Le, Hung The Tran, Sunil Gupta
We study cross-domain offline RL with limited target data, where neural domain-gap estimators often overfit and only part of the source dataset overlaps with the target domain. We propose DmC, combining a k-NN proximity estimator with a nearest-neighbor–guided diffusion model to generate target-aligned source samples, and show strong gains over prior methods on MuJoCo benchmarks.
Linh Le, Hung The Tran, Sunil Gupta
International Conference on Autonomous Agents and Multiagent Systems (AAMAS) 2024
We tackle cross-domain policy transfer under large dynamics mismatch, where the common "full support" assumption (the simulator covers all target transitions) is unrealistic. We propose a simple method that skews and extends source support toward target support to reduce support deficiencies, and show consistent gains over prior approaches across diverse benchmarks.
Linh Le, Hung The Tran, Sunil Gupta
We tackle cross-domain policy transfer under large dynamics mismatch, where the common "full support" assumption (the simulator covers all target transitions) is unrealistic. We propose a simple method that skews and extends source support toward target support to reduce support deficiencies, and show consistent gains over prior approaches across diverse benchmarks.
Reviews: TMLR (2025, 2026), RLC (2025, 2026), ECML(2025), ACML(2025), EWRL (2025).