Dreaming machine learning: Lipschitz extensions for reinforcement learning on financial markets

Calabuig, J. M.|||0000-0001-8398-8664; Sánchez Pérez, Enrique Alfonso|||0000-0001-8854-3154; Falciani, H.

Dreaming machine learning: Lipschitz extensions for reinforcement learning on financial markets

[EN] We consider a quasi-metric topological structure for the construction of a new reinforcement learning model in the framework of financial markets. It is based on a Lipschitz type extension of reward functions defined in metric spaces. Specifically, the McShane and Whitney extensions are conside...

ver descrição completa

Detalhes bibliográficos
Autores:	Calabuig, J. M.\|\|\|0000-0001-8398-8664, Sánchez Pérez, Enrique Alfonso\|\|\|0000-0001-8854-3154, Falciani, H.
Formato:	artículo
Fecha de publicación:	2020
País:	España
Recursos:	Universitat Politècnica de València (UPV)
Repositorio:	RiuNet. Repositorio Institucional de la Universitat Politécnica de Valéncia
Idioma:	inglés
OAI Identifier:	oai:riunet.upv.es:10251/172597
Acesso em linha:	https://riunet.upv.es/handle/10251/172597
Access Level:	acceso abierto
Palavra-chave:	Pseudo-metric Reinforcement learning Lipschitz extension Mathematical economics Financial market Model MATEMATICA APLICADA

Descrição
Resumo:	[EN] We consider a quasi-metric topological structure for the construction of a new reinforcement learning model in the framework of financial markets. It is based on a Lipschitz type extension of reward functions defined in metric spaces. Specifically, the McShane and Whitney extensions are considered for a reward function which is defined by the total evaluation of the benefits produced by the investment decision at a given time. We define the metric as a linear combination of a Euclidean distance and an angular metric component. All information about the evolution of the system from the beginning of the time interval is used to support the extension of the reward function, but in addition this data set is enriched by adding some artificially produced states. Thus, the main novelty of our method is the way we produce more states-which we call "dreams"-to enrich learning. Using some known states of the dynamical system that represents the evolution of the financial market, we use our technique to simulate new states by interpolating real states and introducing some random variables. These new states are used to feed a learning algorithm designed to improve the investment strategy by following a typical reinforcement learning scheme. (C) 2020 Elsevier B.V. All rights reserved.

Dreaming machine learning: Lipschitz extensions for reinforcement learning on financial markets

Registos relacionados