Linh Le
PhD Candidate
A2I2, Deakin University Logo
About Me Cat

Hi, I am Linh or Levi. I'm a PhD candidate at Applied Artificial Intelligence Institute (A2I2), Deakin University, advised by Prof. Sunil Gupta and Dr. Hung Tran The . Previously, I earned a first-class honors degree in Information Technology from the University of Engineering and Technology, Vietnam National University, Hanoi.

My research interests center on Machine Learning, with a current focus on Reinforcement Learning (RL) in offline, online, and offline-to-online settings. I am also interested in the applications of Diffusion Models and Foundation Models (LLMs/VLMs) within the context of RL.

I'm always open to discussions and collaborations. Please feel free to reach out via email if you have a potential opportunity in mind, are interested in collaborating, or would just like to chat about research.

Outside of research, I love reading books, listening music, taking photos, running (both road and trails marathon), cycling, and hiking. I'm also learning to play (classic) guitar.

News
2026
I’m delighted to announce that I’ll be joining Professor Scott Sanner's Lab at the University of Toronto as a Postdoctoral Researcher, where I’ll work on world models and reinforcement learning, with applications to traffic signal control.
Feb 12
2025
Attend ECML-PKDD 2025 and EWRL 2025.
Sep 15
Our paper on Offline Cross domain RL got accepted (Oral presentation) at ECAI 2025.
Jul 15
Our paper on Hybrid Robust RL got accepted at ECML 2025.
May 15
Our paper on Offline RL got accepted at IJCAI 2025. Congrats Hoang.
Apr 20
2024
Our paper on Cross-domain Offline RL with limited target samples setting got accepted at GenPlan2025 workshopat AAAI 2025.
Dec 01
2023
Our paper on Cross-domain in Reinforcement Learning with Deficient Support was accepted (Oral presentation) at AAMAS 2024 in Auckland, New Zealand. Grateful to my supervisors for their invaluable support.
Dec 20
2022
I moved to Geelong, VIC, Australia and started my PhD journey.
Aug 10
I received the offer letter from Applied Artificial Intelligence Institute, Deakin University for a PhD position with Scholarship!
Jun 01
Selected Publications (view all )
ECAI
DmC: Nearest Neighbor Guidance Diffusion Model for Offline Cross-domain Reinforcement Learning

Linh Le, Minh Hoang Nguyen, Duc Kieu, Hung Le, Hung The Tran, Sunil Gupta

European Conference on Artificial Intelligence (ECAI) 2025

We study cross-domain offline RL with limited target data, where neural domain-gap estimators often overfit and only part of the source dataset overlaps with the target domain. We propose DmC, combining a k-NN proximity estimator with a nearest-neighbor–guided diffusion model to generate target-aligned source samples, and show strong gains over prior methods on MuJoCo benchmarks.

DmC: Nearest Neighbor Guidance Diffusion Model for Offline Cross-domain Reinforcement Learning

Linh Le, Minh Hoang Nguyen, Duc Kieu, Hung Le, Hung The Tran, Sunil Gupta

We study cross-domain offline RL with limited target data, where neural domain-gap estimators often overfit and only part of the source dataset overlaps with the target domain. We propose DmC, combining a k-NN proximity estimator with a nearest-neighbor–guided diffusion model to generate target-aligned source samples, and show strong gains over prior methods on MuJoCo benchmarks.

ECAI
AAMAS
Policy Learning for Off-Dynamics RL with Deficient Support

Linh Le, Hung The Tran, Sunil Gupta

International Conference on Autonomous Agents and Multiagent Systems (AAMAS) 2024

We tackle cross-domain policy transfer under large dynamics mismatch, where the common "full support" assumption (the simulator covers all target transitions) is unrealistic. We propose a simple method that skews and extends source support toward target support to reduce support deficiencies, and show consistent gains over prior approaches across diverse benchmarks.

Policy Learning for Off-Dynamics RL with Deficient Support

Linh Le, Hung The Tran, Sunil Gupta

We tackle cross-domain policy transfer under large dynamics mismatch, where the common "full support" assumption (the simulator covers all target transitions) is unrealistic. We propose a simple method that skews and extends source support toward target support to reduce support deficiencies, and show consistent gains over prior approaches across diverse benchmarks.

AAMAS
All publications
Experience
  • Sep. 2020 - Jul. 2022
    VNU University of Engineering and Technology
    VNU University of Engineering and Technology
    Teaching assistant for the Computer science courses: Introduction to Informatics, Advanced Programming, Data Structures and Algorithms, Image Processing.
  • Sep. 2020 - Jul. 2022
    UET AILab, Vietnam National University
    UET AILab, Vietnam National University
    Reasearch about Medical Image Translation.
Education
  • Aug. 2022 - present
    A2I2, Deakin University
    A2I2, Deakin University
    Deakin Applied Artificial Intelligence Initiative
    Ph.D. Candidate
  • Sep. 2016 - Jul. 2020
    University of Engineering and Technology, Vietnam National University, Hanoi
    University of Engineering and Technology, Vietnam National University, Hanoi
    B.S. in Computer Science (Honor Program)
🏆 Honors & Awards
  • Deakin University Postgraduate Research Scholarship for PhD candidature.
    2022
  • Certificate of merit awarded by the VNU University of Engineering and Technology for Outstanding student of the entire course.
    2020
  • 1st Digital Race FPT (Autonomous car competition) 2019, round UET.
    2019
Misc
Services

Reviews: TMLR (2025, 2026), RLC (2025, 2026), ECML(2025), ACML(2025), EWRL (2025).