Review of Deep Reinforcement Learning (DRL) Algorithms for MPPT and Partial Shading in Solar PV applications

Ameze Big-Alabo

doi:10.55579/jaec.2026102.521

ISSN (Online): 2588-123X
ISSN (Print):

About TDTU

Ton Duc Thang University (TDTU) is a public university with the main campus located in vibrant Ho Chi Minh City, Vietnam’s economic and educational hub. Founded in 1997, TDTU has developed into one of the largest and fastest growing universities in Vietnam with more than 22,000 students, enrolled in undergraduate and graduate programs ranging from science, engineering to business management, law, and humanities. To foster the country’s human resources and best serve the nation in the knowledge based economy of the 21st century, TDTU is combining vocational training with high-level research. The establishment of JAEC is one of TDTU’s efforts in this direction. More

Publication Information

Publisher

Ton Duc Thang University

Honorary Editor-in-Chief

Tran Trong Dao

Executive Editor

Nguyen Trung Thang

Chairman of the Editorial Board

Vice Chairman of the Editorial Board

Editorial Board

Hari Mohan Srivastava

Juan Carlos Burguillo Rial

Akhil Garg

Nguyen Pham Trung Hieu

Mahdi Shariati

Aleš Zamuda

Ngo Son Tung

User

Guide for Authors

View 'Guide for Authors' online

Submit Your Paper

In order to submit your paper, please login and navigate to the author page.

If you do not have an account, please consider registering one.

Track Your Paper

Track accepted paper
Once your article has been accepted you will receive an email from Author Services. This email contains a link to check the status of your articles.

Click here to track your accepted papers

Journal Content

Browse

Abstracting/Indexing

Review of Deep Reinforcement Learning (DRL) Algorithms for MPPT and Partial Shading in Solar PV applications

Ameze Big-Alabo

Abstract

The present study reviews Deep Reinforcement Learning (DRL) algorithms as applied to Photovoltaic (PV) systems. A literature survey was conducted on various DRL techniques for Maximum Power Point Tracking (MPPT) and Partial Shading Conditions. The survey shows Deep Deterministic Policy Gradient (DDPG) to be the most implemented technique because of its fast convergence speed. Deep Q-Network (DQN) was considered to achieve faster response than DDPG. Twin Delayed Deep Deterministic Policy Gradient (TD3) was considered preferable, while Soft Actor-Critic (SAC), approach better eliminates power oscillations, under partially shaded conditions. The implementation of DRL-based MPPT for critical and effective learning requires defining the state variable, action variable and reward function of the PV module. It is therefore important to observe the voltage, current, irradiance, and temperature data that can allow for easy adaptation to changing environmental conditions. DRL requires higher computational effort compared to conventional methods due to its training phase. However, the trained models can operate with relatively low computational effort, thus making it a promising approach for real-time applications. The literature survey also showed that the exploration–exploitation trade-off is a fundamental challenge in DRL-based MPPT control. Therefore, effective management of this trade-off, as well as bridging the gap between simulation and real-world hardware implementation, will enable DRL to become a practical solution for MPPT in PV systems.

Keywords

DRL Algorithms, Exploration-Exploitation, MPPT techniques, Solar - PV.

Full Text:

PDF

Time cited: 0

Download citation

DOI: http://dx.doi.org/10.55579/jaec.2026102.521

Refbacks

There are currently no refbacks.

This work is licensed under a Creative Commons Attribution 4.0 International License.

Owner: Ton Duc Thang University. All rights reserved.
License No: 507/GP-BTTTT, issued: 18th November 2016
Contact address: 19, Nguyen Huu Tho Street, Tan Hung Ward, Ho Chi Minh City
Tel: +84-28 3775 5037 Fax: +84-28 3775 5055

This work is licensed under a Creative Commons Attribution 4.0 International License.

Username
Password