A transfer reinforcement learning approach for capacity sharing in Beyond 5G networks

The use of Reinforcement Learning (RL) techniques has been widely addressed in the literature to cope with capacity sharing in 5G Radio Access Network (RAN) slicing. These algorithms consider a training process to learn an optimal capacity sharing decision-making policy, which is later applied to th...

Descripción completa

Detalles Bibliográficos
Autores: Vilà Muñoz, Irene|||0000-0002-7086-9591, Pérez Romero, Jordi|||0000-0001-9131-5013, Sallent Roig, Oriol|||0000-0002-2114-1406
Tipo de recurso: artículo
Fecha de publicación:2024
País:España
Institución:Universitat Politècnica de Catalunya (UPC)
Repositorio:UPCommons. Portal del coneixement obert de la UPC
Idioma:inglés
OAI Identifier:oai:upcommons.upc.edu:2117/423666
Acceso en línea:https://hdl.handle.net/2117/423666
https://dx.doi.org/10.3390/fi16120434
Access Level:acceso abierto
Palabra clave:RAN slicing
Capacity sharing
Reinforcement learning
Transfer learning
Transfer reinforcement learning
Àrees temàtiques de la UPC::Enginyeria de la telecomunicació
Descripción
Sumario:The use of Reinforcement Learning (RL) techniques has been widely addressed in the literature to cope with capacity sharing in 5G Radio Access Network (RAN) slicing. These algorithms consider a training process to learn an optimal capacity sharing decision-making policy, which is later applied to the RAN environment during the inference stage. When relevant changes occur in the RAN, such as the deployment of new cells in the network, RL-based capacity sharing solutions require a re-training process to update the optimal decision-making policy, which may require long training times. To accelerate this process, this paper proposes a novel Transfer Learning (TL) approach for RL-based capacity sharing solutions in multi-cell scenarios that is implementable following the Open-RAN (O-RAN) architecture and exploits the availability of computing resources at the edge for conducting the training/inference processes. The proposed approach allows transferring the weights of the previously learned policy to learn the new policy to be used after the addition of new cells. The performance assessment of the TL solution highlights its capability to reduce the training process duration of the policies when adding new cells. Considering that the roll-out of 5G networks will continue for several years, TL can contribute to enhancing the practicality and feasibility of applying RL-based solutions for capacity sharing.