Towards fine tuning wake steering policies in the field: an imitation-based approach

Claire Bizon Monroc; A. Bušić; D. Dubuc; J. Zhu

doi:10.1088/1742-6596/2767/3/032017

Article Dans Une Revue Journal of Physics: Conference Series Année : 2024

Towards fine tuning wake steering policies in the field: an imitation-based approach

(1, 2) , (2) , (1) , (1)

1
2

Claire Bizon Monroc

Fonction : Auteur correspondant
PersonId : 1378649

Connectez-vous pour contacter l'auteur

IFP Energies nouvelles

École normale supérieure - Paris

A. Bušić

Fonction : Auteur

École normale supérieure - Paris

D. Dubuc

Fonction : Auteur

IFP Energies nouvelles

J. Zhu

Fonction : Auteur

IFP Energies nouvelles

Résumé

Yaw misalignment strategies can increase the power output of wind farms by mitigating wake effects, but finding optimal yaws requires overcoming both modeling errors and the growing complexity of the problem as the size of the farm grows. Recent works have therefore proposed decentralized multi-agent reinforcement learning (MARL) as a model-free, data-based alternative to learn online. These solutions have led to significant increases in total power production on experiments with both static and dynamic wind farms simulators. Yet experiments in dynamic simulations suggest that convergence time remains too long for online learning on real wind farms. As an improvement, baseline policies obtained by optimizing offline through steady-state models can be fed as inputs to an online reinforcement learning algorithm. This method however does not guarantee a smooth transfer of the policies to the real wind farm. This is aggravated when using function approximation approaches such as multi-layer neural networks to estimate policies and value functions. We propose an imitation approach, where learning a policy is first considered a supervised learning problem by deriving references from steady-state wind farm models, and then as an online reinforcement learning task for adaptation in the field. This approach leads to significant increases in the amount of energy produced over a lookup table (LUT) baseline on experiments done with the mid-fidelity dynamic simulator FAST.Farm under both static and varying wind conditions.

Domaines

Physique Numérique [physics.comp-ph]

Fichier principal

Toward_fine_tuning.pdf (1.34 Mo)

Origine	Fichiers éditeurs autorisés sur une archive ouverte
licence	Paternité

Annie SYLVESTRE : Connectez-vous pour contacter le contributeur

https://ifp.hal.science/hal-04653089

Soumis le : jeudi 18 juillet 2024-15:52:39

Dernière modification le : samedi 20 juillet 2024-03:23:36

Dates et versions

hal-04653089 , version 1 (18-07-2024)

Licence

Paternité

Identifiants

HAL Id : hal-04653089 , version 1
DOI : 10.1088/1742-6596/2767/3/032017

Citer

Claire Bizon Monroc, A. Bušić, D. Dubuc, J. Zhu. Towards fine tuning wake steering policies in the field: an imitation-based approach. Journal of Physics: Conference Series, 2024, 2767 (3), pp.032017. ⟨10.1088/1742-6596/2767/3/032017⟩. ⟨hal-04653089⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

ENS-PARIS IFP PSL ANR PEPR_TASE

27 Consultations

16 Téléchargements

Towards fine tuning wake steering policies in the field: an imitation-based approach

Résumé

Domaines

Dates et versions

Licence

Identifiants

Citer

Exporter

Collections

Altmetric

Partager