dc.contributor.author | Eichstädt, J | |
dc.contributor.author | Green, M | |
dc.contributor.author | Turner, M | |
dc.contributor.author | Peiró, J | |
dc.contributor.author | Moxey, D | |
dc.date.accessioned | 2018-03-29T12:51:27Z | |
dc.date.issued | 2018-04-05 | |
dc.description.abstract | Heterogeneous manycore performance-portable programming models and libraries, such as Kokkos, have been
developed to facilitate portability and maintainability of high-performance computing codes and enhance their resilience
to architectural changes. Here we investigate the suitability of the Kokkos programming model for optimizing
the performance of the high-order mesh generator NekMesh, which has been developed to efficiently generate meshes
containing millions of elements for industrial problem involving complex geometries. We describe the variational approach
for a posteriori high-order mesh optimisation employed within NekMesh and its parallel implementation. We
discuss its implementation for modern manycore massively parallel shared-memory CPU and GPU platforms using
Kokkos and demonstrate that we achieve increased performance on multicore CPUs and accelerators compared with a
native Pthreads implementation. Further, we show that we achieve additional speedup and cost reduction by running
on GPUs without any hardware-specific code optimisation. | en_GB |
dc.description.sponsorship | JE gratefully acknowledges the support through EPSRC and the President’s Scholarship of Imperial College London.
MG acknowledges support from the PRISM project under EPSRC grant EP/L000407/1. MT acknowledges
Airbus and EPSRC for funding under an industrial CASE studentship. DM acknowledges support from the EU Horizon
2020 project ExaFLOW (grant 671571). The Quadro P5000 GPU used for this research was kindly donated by
the NVIDIA Corporation. | en_GB |
dc.identifier.citation | Published online 5 April 2018 | en_GB |
dc.identifier.doi | 10.1016/j.cpc.2018.03.025 | |
dc.identifier.uri | http://hdl.handle.net/10871/32270 | |
dc.language.iso | en | en_GB |
dc.publisher | Elsevier | en_GB |
dc.rights | © 2018 The Authors. Published by Elsevier B.V. Open Access funded by Engineering and Physical Sciences Research Council. Under a Creative Commons license: https://creativecommons.org/licenses/by/4.0/ | |
dc.subject | high-order mesh optimisation | en_GB |
dc.subject | architecture-independent programming model | en_GB |
dc.subject | Kokkos | en_GB |
dc.subject | portability | en_GB |
dc.subject | parallel hardware | en_GB |
dc.subject | variational framework | en_GB |
dc.title | Accelerating high-order mesh generation with an architecture-independent programming model | en_GB |
dc.type | Article | en_GB |
dc.identifier.issn | 0010-4655 | |
dc.description | This is the author accepted manuscript. The final version is available from Elsevier via the DOI in this record | en_GB |
dc.identifier.journal | Computer Physics Communications | en_GB |