Partial Observability and Domain Randomization in RL-Based Strategy for Optical Cavity Locking Optimization
A. Svizzeretto* and M. Bawaj
*: corresponding author
Full text: pdf
Pre-published on: January 06, 2026
Published on:
Abstract
The purpose of these proceedings is to report on developments of RL-based cavity locking strategy carried out after the presentation given at the MODE workshop 2025. Reflecting on discussions held during the workshop, we examine how concepts such as partial observability and domain randomization critically impact the training process and the generalization of control policies in deep reinforcement learning. We keep the focus on the topic of our work, concerning reinforcement learning techniques for lock acquisition optimization in gravitational wave detection layout. Future directions are outlined to improve robustness and real-world applicability, including domain-randomized parameters and memory based architectures for addressing partial observability.

DOI: https://doi.org/10.22323/1.491.0035
How to cite

Metadata are provided both in article format (very similar to INSPIRE) as this helps creating very compact bibliographies which can be beneficial to authors and readers, and in proceeding format which is more detailed and complete.

Open Access
Creative Commons LicenseCopyright owned by the author(s) under the term of the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.