﻿{"id":1278,"date":"2018-02-13T11:57:10","date_gmt":"2018-02-13T11:57:10","guid":{"rendered":"http:\/\/uni.hi.is\/helmut\/?page_id=1278"},"modified":"2019-08-30T11:50:21","modified_gmt":"2019-08-30T11:50:21","slug":"comparison-of-big-data-and-high-performance-computing-platforms-and-applications","status":"publish","type":"page","link":"https:\/\/uni.hi.is\/helmut\/research\/comparison-of-big-data-and-high-performance-computing-platforms-and-applications\/","title":{"rendered":"Comparison of Big Data and High-Performance Computing platforms and applications (since 2017)"},"content":{"rendered":"<p>Big data analysis requires parallel processing. While the standard technology for huge non-embarrassingly parallel, but rather tightly-coupled computational  problems  is High-Performance Computing (HPC), highly  praised contenders for huge parallel processing problems are big data processing frameworks such as Apache Hadoop or Apache Spark. To  be  able  to  decide whether  HPC  or  big  data  platforms  are  better  suited  for  big data problems, this projects investigates and compares the two paradigms and their platforms.<br \/>\nAs a case study, the run-time performance and scalability of different implementations of the Density-Based Spatial Clustering of Applications with Noise (DBSCAN) clustering algorithm is investigated and compared.<\/p>\n<h3>Publications<\/h3>\n<p>Helmut Neukirchen.<br \/>\n<i>Elephant against Goliath: Performance of Big Data versus High-Performance Computing DBSCAN Clustering Implementations.<\/i>Simulation Science. First International Workshop, SimScience 2017, G\u00f6ttingen, Germany, April 27\u201328, 2017, Revised Selected Papers, Communications in Computer and Information Science (CCIS), volume 889, DOI: <a href=\"https:\/\/doi.org\/10.1007\/978-3-319-96271-9_16\">978-3-319-96271-9_16<\/a>, Springer 2018.<br \/>\n<a href=\"https:\/\/notendur.hi.is\/~helmut\/publications\/simscience_ccis_dbscan.pdf\">Download<\/a><\/p>\n<p>Helmut Neukirchen.<br \/>\n<i>Performance of Big Data versus High-Performance Computing: Some Observations.<\/i><br \/>\nExtended Abstract. Clausthal-G\u00f6ttingen International Workshop on Simulation Science, 27-28 April 2017, G\u00f6ttingen, Germany. <a href=\"http:\/\/www.simscience2017.uni-goettingen.de\/wp-content\/uploads\/2018\/01\/SimScienceWorkshop2017_final.pdf\">Proceedings of Accepted Abstracts<\/a>, Clausthal-G\u00f6ttingen Simulation Science Center, 2017, pp. 93-95.<br \/>\n<a href=\"https:\/\/notendur.hi.is\/~helmut\/publications\/PerformanceofBigDataversusHPC.pdf\">Download<\/a><\/p>\n<p>Helmut Neukirchen.<br \/>\n<i>Survey and Performance Evaluation of DBSCAN Spatial Clustering Implementations for Big Data and High-Performance Computing Paradigms.<\/i>Technical Report VHI-01-2016, Engineering Research Institute, University of Iceland, Reykjavik, Iceland, November 2016.<br \/>\n<a href=\"https:\/\/notendur.hi.is\/~helmut\/publications\/VHI-01-2016.pdf\">Download<\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Big data analysis requires parallel processing. While the standard technology for huge non-embarrassingly parallel, but rather tightly-coupled computational problems is High-Performance Computing (HPC), highly praised contenders for huge parallel processing problems are big data processing frameworks such as Apache Hadoop or Apache Spark. To be able to decide whether HPC or big data platforms are [&hellip;]<\/p>\n","protected":false},"author":512,"featured_media":0,"parent":20,"menu_order":6,"comment_status":"closed","ping_status":"closed","template":"","meta":{"footnotes":""},"class_list":["post-1278","page","type-page","status-publish","hentry"],"_links":{"self":[{"href":"https:\/\/uni.hi.is\/helmut\/wp-json\/wp\/v2\/pages\/1278","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/uni.hi.is\/helmut\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/uni.hi.is\/helmut\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/uni.hi.is\/helmut\/wp-json\/wp\/v2\/users\/512"}],"replies":[{"embeddable":true,"href":"https:\/\/uni.hi.is\/helmut\/wp-json\/wp\/v2\/comments?post=1278"}],"version-history":[{"count":3,"href":"https:\/\/uni.hi.is\/helmut\/wp-json\/wp\/v2\/pages\/1278\/revisions"}],"predecessor-version":[{"id":1557,"href":"https:\/\/uni.hi.is\/helmut\/wp-json\/wp\/v2\/pages\/1278\/revisions\/1557"}],"up":[{"embeddable":true,"href":"https:\/\/uni.hi.is\/helmut\/wp-json\/wp\/v2\/pages\/20"}],"wp:attachment":[{"href":"https:\/\/uni.hi.is\/helmut\/wp-json\/wp\/v2\/media?parent=1278"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}