Parallel programming paradigms and frameworks in big data era

With Cloud Computing emerging as a promising new approach for ad-hoc parallel data processing, major companies have started to integrate frameworks for parallel data processing in their product portfolio, making it easy for customers to access these services and to deploy their programs. We have ent...

ver descrição completa

Detalhes bibliográficos
Autores: Dobre, Ciprian M.|||0000-0003-4638-7725, Xhafa Xhafa, Fatos|||0000-0001-6569-5497
Tipo de documento: artigo
Data de publicação:2014
País:España
Recursos:Universitat Politècnica de Catalunya (UPC)
Repositório:UPCommons. Portal del coneixement obert de la UPC
Idioma:inglês
OAI Identifier:oai:upcommons.upc.edu:2117/125606
Acesso em linha:https://hdl.handle.net/2117/125606
https://dx.doi.org/10.1007/s10766-013-0272-7
Access Level:Acceso aberto
Palavra-chave:Parallel programming (Computer science)
Big data
Cloud computing
MapReduce
Programming models
Challenges
Programació en paral·lel (Informàtica)
Macrodades
Computació en núvol
Àrees temàtiques de la UPC::Informàtica::Arquitectura de computadors::Arquitectures paral·leles
id ES_15b4fadce818a3f48dda094e893f8f26
oai_identifier_str oai:upcommons.upc.edu:2117/125606
network_acronym_str ES
network_name_str España
repository_id_str
spelling Parallel programming paradigms and frameworks in big data eraDobre, Ciprian M.|||0000-0003-4638-7725Xhafa Xhafa, Fatos|||0000-0001-6569-5497Parallel programming (Computer science)Big dataCloud computingMapReduceProgramming modelsChallengesProgramació en paral·lel (Informàtica)MacrodadesComputació en núvolÀrees temàtiques de la UPC::Informàtica::Arquitectura de computadors::Arquitectures paral·lelesWith Cloud Computing emerging as a promising new approach for ad-hoc parallel data processing, major companies have started to integrate frameworks for parallel data processing in their product portfolio, making it easy for customers to access these services and to deploy their programs. We have entered the Era of Big Data. The explosion and profusion of available data in a wide range of application domains rise up new challenges and opportunities in a plethora of disciplines-ranging from science and engineering to biology and business. One major challenge is how to take advantage of the unprecedented scale of data-typically of heterogeneous nature-in order to acquire further insights and knowledge for improving the quality of the offered services. To exploit this new resource, we need to scale up and scale out both our infrastructures and standard techniques. Our society is already data-rich, but the question remains whether or not we have the conceptual tools to handle it. In this paper we discuss and analyze opportunities and challenges for efficient parallel data processing. Big Data is the next frontier for innovation, competition, and productivity, and many solutions continue to appear, partly supported by the considerable enthusiasm around the MapReduce paradigm for large-scale data analysis. We review various parallel and distributed programming paradigms, analyzing how they fit into the Big Data era, and present modern emerging paradigms and frameworks. To better support practitioners interesting in this domain, we end with an analysis of on-going research challenges towards the truly fourth generation data-intensive science.Peer Reviewed20142014-10-0120182018-12-11journal articlehttp://purl.org/coar/resource_type/c_6501AMhttp://purl.org/coar/version/c_ab4af688f83e57aainfo:eu-repo/semantics/articleapplication/pdfhttps://hdl.handle.net/2117/125606https://dx.doi.org/10.1007/s10766-013-0272-7reponame:UPCommons. Portal del coneixement obert de la UPCinstname:Universitat Politècnica de Catalunya (UPC)Inglésengopen accesshttp://purl.org/coar/access_right/c_abf2info:eu-repo/semantics/openAccessoai:upcommons.upc.edu:2117/1256062026-05-27T15:37:01Z
dc.title.none.fl_str_mv Parallel programming paradigms and frameworks in big data era
title Parallel programming paradigms and frameworks in big data era
spellingShingle Parallel programming paradigms and frameworks in big data era
Dobre, Ciprian M.|||0000-0003-4638-7725
Parallel programming (Computer science)
Big data
Cloud computing
MapReduce
Programming models
Challenges
Programació en paral·lel (Informàtica)
Macrodades
Computació en núvol
Àrees temàtiques de la UPC::Informàtica::Arquitectura de computadors::Arquitectures paral·leles
title_short Parallel programming paradigms and frameworks in big data era
title_full Parallel programming paradigms and frameworks in big data era
title_fullStr Parallel programming paradigms and frameworks in big data era
title_full_unstemmed Parallel programming paradigms and frameworks in big data era
title_sort Parallel programming paradigms and frameworks in big data era
dc.creator.none.fl_str_mv Dobre, Ciprian M.|||0000-0003-4638-7725
Xhafa Xhafa, Fatos|||0000-0001-6569-5497
author Dobre, Ciprian M.|||0000-0003-4638-7725
author_facet Dobre, Ciprian M.|||0000-0003-4638-7725
Xhafa Xhafa, Fatos|||0000-0001-6569-5497
author_role author
author2 Xhafa Xhafa, Fatos|||0000-0001-6569-5497
author2_role author
dc.subject.none.fl_str_mv Parallel programming (Computer science)
Big data
Cloud computing
MapReduce
Programming models
Challenges
Programació en paral·lel (Informàtica)
Macrodades
Computació en núvol
Àrees temàtiques de la UPC::Informàtica::Arquitectura de computadors::Arquitectures paral·leles
topic Parallel programming (Computer science)
Big data
Cloud computing
MapReduce
Programming models
Challenges
Programació en paral·lel (Informàtica)
Macrodades
Computació en núvol
Àrees temàtiques de la UPC::Informàtica::Arquitectura de computadors::Arquitectures paral·leles
description With Cloud Computing emerging as a promising new approach for ad-hoc parallel data processing, major companies have started to integrate frameworks for parallel data processing in their product portfolio, making it easy for customers to access these services and to deploy their programs. We have entered the Era of Big Data. The explosion and profusion of available data in a wide range of application domains rise up new challenges and opportunities in a plethora of disciplines-ranging from science and engineering to biology and business. One major challenge is how to take advantage of the unprecedented scale of data-typically of heterogeneous nature-in order to acquire further insights and knowledge for improving the quality of the offered services. To exploit this new resource, we need to scale up and scale out both our infrastructures and standard techniques. Our society is already data-rich, but the question remains whether or not we have the conceptual tools to handle it. In this paper we discuss and analyze opportunities and challenges for efficient parallel data processing. Big Data is the next frontier for innovation, competition, and productivity, and many solutions continue to appear, partly supported by the considerable enthusiasm around the MapReduce paradigm for large-scale data analysis. We review various parallel and distributed programming paradigms, analyzing how they fit into the Big Data era, and present modern emerging paradigms and frameworks. To better support practitioners interesting in this domain, we end with an analysis of on-going research challenges towards the truly fourth generation data-intensive science.
publishDate 2014
dc.date.none.fl_str_mv 2014
2014-10-01
2018
2018-12-11
dc.type.none.fl_str_mv journal article
http://purl.org/coar/resource_type/c_6501
AM
http://purl.org/coar/version/c_ab4af688f83e57aa
dc.type.openaire.fl_str_mv info:eu-repo/semantics/article
format article
dc.identifier.none.fl_str_mv https://hdl.handle.net/2117/125606
https://dx.doi.org/10.1007/s10766-013-0272-7
url https://hdl.handle.net/2117/125606
https://dx.doi.org/10.1007/s10766-013-0272-7
dc.language.none.fl_str_mv Inglés
eng
language_invalid_str_mv Inglés
language eng
dc.rights.none.fl_str_mv open access
http://purl.org/coar/access_right/c_abf2
dc.rights.openaire.fl_str_mv info:eu-repo/semantics/openAccess
rights_invalid_str_mv open access
http://purl.org/coar/access_right/c_abf2
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.source.none.fl_str_mv reponame:UPCommons. Portal del coneixement obert de la UPC
instname:Universitat Politècnica de Catalunya (UPC)
instname_str Universitat Politècnica de Catalunya (UPC)
reponame_str UPCommons. Portal del coneixement obert de la UPC
collection UPCommons. Portal del coneixement obert de la UPC
repository.name.fl_str_mv
repository.mail.fl_str_mv
_version_ 1869403811064315904
score 15,300724