The cost of training machine learning models over distributed data sources

Federated learning is one of the most appealing alternatives to the standard centralized learning paradigm, allowing a heterogeneous set of devices to train a machine learning model without sharing their raw data. However, it requires a central server to coordinate the learning process, thus introdu...

ver descrição completa

Detalhes bibliográficos
Autores: Guerra, Elia, Wilhelmi Roca, Francesc, Miozzo, Marco, Dini, Paolo
Formato: artículo
Estado:Versión publicada
Fecha de publicación:2023
País:España
Recursos:Varias* (Consorci de Biblioteques Universitáries de Catalunya, Centre de Serveis Científics i Acadèmics de Catalunya)
Repositorio:Recercat. Dipósit de la Recerca de Catalunya
OAI Identifier:oai:dnet:recercat____::902691ad2f5720ab584811d983a0d6b8
Acesso em linha:https://hdl.handle.net/10230/73074
http://dx.doi.org/10.1109/OJCOMS.2023.3274394
Access Level:acceso abierto
Palavra-chave:Blockchain
Decentralized learning
Edge computingk
Energy efficiency
Federated learning
Machine learning
id ES_ffbc99102cbe661ce5e44fcecba02f0d
oai_identifier_str oai:dnet:recercat____::902691ad2f5720ab584811d983a0d6b8
network_acronym_str ES
network_name_str España
repository_id_str
spelling The cost of training machine learning models over distributed data sourcesGuerra, EliaWilhelmi Roca, FrancescMiozzo, MarcoDini, PaoloBlockchainDecentralized learningEdge computingkEnergy efficiencyFederated learningMachine learningFederated learning is one of the most appealing alternatives to the standard centralized learning paradigm, allowing a heterogeneous set of devices to train a machine learning model without sharing their raw data. However, it requires a central server to coordinate the learning process, thus introducing potential scalability and security issues. In the literature, server-less federated learning approaches like gossip federated learning and blockchain-enabled federated learning have been proposed to mitigate these issues. In this work, we propose a complete overview of these three techniques, proposing a comparison according to an integral set of performance indicators, including model accuracy, time complexity, communication overhead, convergence time, and energy consumption. An extensive simulation campaign permits to draw a quantitative analysis considering both feedforward and convolutional neural network models. Results show that gossip federated learning and standard federated solution are able to reach a similar level of accuracy, and their energy consumption is influenced by the machine learning model adopted, the software library, and the hardware used. Differently, blockchain-enabled federated learning represents a viable solution for implementing decentralized learning with a higher level of security, at the cost of an extra energy usage and data sharing. Finally, we identify open issues on the two decentralized federated learning implementations and provide insights on potential extensions and possible research directions on this new research field.This work was supported in part by the Spanish Project PID2020-113832RB-C22 (ORIGIN)/MCIN/AEI/10.13039/50110001103; in part by the European Union Horizon 2020 Research and Innovation Programme under Grant 953775 (GREENEDGE); and in part by CHIST-ERA-20-SICT-004 (SONATA) under Grant PCI2021-122043-2A/AEI/10.13039/501100011033.Institute of Electrical and Electronics Engineers (IEEE)2026202620232026info:eu-repo/semantics/articleinfo:eu-repo/semantics/publishedVersionapplication/pdfapplication/pdfhttps://hdl.handle.net/10230/73074http://dx.doi.org/10.1109/OJCOMS.2023.3274394https://hdl.handle.net/10230/73074reponame:Recercat. Dipósit de la Recerca de Catalunyainstname:Varias* (Consorci de Biblioteques Universitáries de Catalunya, Centre de Serveis Científics i Acadèmics de Catalunya)InglésIEEE Open Journal of the Communications Society. 2023;4:1111-26info:eu-repo/grantAgreement/ES/2PE/PID2020-113832RB-C22info:eu-repo/grantAgreement/EC/H2020/953775info:eu-repo/grantAgreement/ES/2PE/PCI2021-122043-2ACCBY - IEEE is not the copyright holder of this material. Please follow the instructions via https://creativecommons.org/licenses/by/4.0/ to obtain full-text articles and stipulations in the API documentation.https://creativecommons.org/licenses/by/4.0/info:eu-repo/semantics/openAccessoai:dnet:recercat____::902691ad2f5720ab584811d983a0d6b82026-05-29T05:05:01Z
dc.title.none.fl_str_mv The cost of training machine learning models over distributed data sources
title The cost of training machine learning models over distributed data sources
spellingShingle The cost of training machine learning models over distributed data sources
Guerra, Elia
Blockchain
Decentralized learning
Edge computingk
Energy efficiency
Federated learning
Machine learning
title_short The cost of training machine learning models over distributed data sources
title_full The cost of training machine learning models over distributed data sources
title_fullStr The cost of training machine learning models over distributed data sources
title_full_unstemmed The cost of training machine learning models over distributed data sources
title_sort The cost of training machine learning models over distributed data sources
dc.creator.none.fl_str_mv Guerra, Elia
Wilhelmi Roca, Francesc
Miozzo, Marco
Dini, Paolo
author Guerra, Elia
author_facet Guerra, Elia
Wilhelmi Roca, Francesc
Miozzo, Marco
Dini, Paolo
author_role author
author2 Wilhelmi Roca, Francesc
Miozzo, Marco
Dini, Paolo
author2_role author
author
author
dc.subject.none.fl_str_mv Blockchain
Decentralized learning
Edge computingk
Energy efficiency
Federated learning
Machine learning
topic Blockchain
Decentralized learning
Edge computingk
Energy efficiency
Federated learning
Machine learning
description Federated learning is one of the most appealing alternatives to the standard centralized learning paradigm, allowing a heterogeneous set of devices to train a machine learning model without sharing their raw data. However, it requires a central server to coordinate the learning process, thus introducing potential scalability and security issues. In the literature, server-less federated learning approaches like gossip federated learning and blockchain-enabled federated learning have been proposed to mitigate these issues. In this work, we propose a complete overview of these three techniques, proposing a comparison according to an integral set of performance indicators, including model accuracy, time complexity, communication overhead, convergence time, and energy consumption. An extensive simulation campaign permits to draw a quantitative analysis considering both feedforward and convolutional neural network models. Results show that gossip federated learning and standard federated solution are able to reach a similar level of accuracy, and their energy consumption is influenced by the machine learning model adopted, the software library, and the hardware used. Differently, blockchain-enabled federated learning represents a viable solution for implementing decentralized learning with a higher level of security, at the cost of an extra energy usage and data sharing. Finally, we identify open issues on the two decentralized federated learning implementations and provide insights on potential extensions and possible research directions on this new research field.
publishDate 2023
dc.date.none.fl_str_mv 2023
2026
2026
2026
dc.type.none.fl_str_mv info:eu-repo/semantics/article
info:eu-repo/semantics/publishedVersion
format article
status_str publishedVersion
dc.identifier.none.fl_str_mv https://hdl.handle.net/10230/73074
http://dx.doi.org/10.1109/OJCOMS.2023.3274394
https://hdl.handle.net/10230/73074
url https://hdl.handle.net/10230/73074
http://dx.doi.org/10.1109/OJCOMS.2023.3274394
dc.language.none.fl_str_mv Inglés
language_invalid_str_mv Inglés
dc.relation.none.fl_str_mv IEEE Open Journal of the Communications Society. 2023;4:1111-26
info:eu-repo/grantAgreement/ES/2PE/PID2020-113832RB-C22
info:eu-repo/grantAgreement/EC/H2020/953775
info:eu-repo/grantAgreement/ES/2PE/PCI2021-122043-2A
dc.rights.none.fl_str_mv https://creativecommons.org/licenses/by/4.0/
info:eu-repo/semantics/openAccess
rights_invalid_str_mv https://creativecommons.org/licenses/by/4.0/
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
application/pdf
dc.publisher.none.fl_str_mv Institute of Electrical and Electronics Engineers (IEEE)
publisher.none.fl_str_mv Institute of Electrical and Electronics Engineers (IEEE)
dc.source.none.fl_str_mv reponame:Recercat. Dipósit de la Recerca de Catalunya
instname:Varias* (Consorci de Biblioteques Universitáries de Catalunya, Centre de Serveis Científics i Acadèmics de Catalunya)
instname_str Varias* (Consorci de Biblioteques Universitáries de Catalunya, Centre de Serveis Científics i Acadèmics de Catalunya)
reponame_str Recercat. Dipósit de la Recerca de Catalunya
collection Recercat. Dipósit de la Recerca de Catalunya
repository.name.fl_str_mv
repository.mail.fl_str_mv
_version_ 1869425801475129344
score 15.81155