Optimising the performance of the spectral/hp element method with collective linear algebra operations

Moxey, D; Cantwell, CD; Kirby, RM; Sherwin, SJ

dc.contributor.author	Moxey, D
dc.contributor.author	Cantwell, CD
dc.contributor.author	Kirby, RM
dc.contributor.author	Sherwin, SJ
dc.date.accessioned	2018-03-05T13:04:41Z
dc.date.issued	2016-10-01
dc.description.abstract	As computing hardware evolves, increasing core counts mean that memory bandwidth is becoming the deciding factor in attaining peak performance of numerical methods. High-order finite element methods, such as those implemented in the spectral/hp framework Nektar++, are particularly well-suited to this environment. Unlike low-order methods that typically utilise sparse storage, matrices representing high-order operators have greater density and richer structure. In this paper, we show how these qualities can be exploited to increase runtime performance on nodes that comprise a typical high-performance computing system, by amalgamating the action of key operators on multiple elements into a single, memory-efficient block. We investigate different strategies for achieving optimal performance across a range of polynomial orders and element types. As these strategies all depend on external factors such as BLAS implementation and the geometry of interest, we present a technique for automatically selecting the most efficient strategy at runtime.	en_GB
dc.description.sponsorship	We thank D. Ekelschot and M. Turner for their assistance in generating the mesh and parameters for the simulation of Section 6. We also thank F. Witherden for initial discussions motivating this study. This work was funded in part by support from the libHPC II EPSRC project under grant EP/K038788/1. DM additionally acknowledges support under the Laminar Flow Control Centre funded by Airbus/EADS and EPSRC under grant EP/I037946. SJS acknowledges Royal Academy of Engineering support under their research chair scheme. We thank the Imperial College High Performance Computing Service for computing time used to calculate the results seen in Section 6. We additionally acknowledge access to ARCHER with support from the UK Turbulence Consortium under EPSRC grant EP/L000261/1.	en_GB
dc.identifier.citation	Vol. 310, pp. 628 - 645	en_GB
dc.identifier.doi	10.1016/j.cma.2016.07.001
dc.identifier.uri	http://hdl.handle.net/10871/31820
dc.language.iso	en	en_GB
dc.publisher	Elsevier	en_GB
dc.rights	(C) 2016 The Author(s). Published by Elsevier B.V. This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).	en_GB
dc.subject	Spectral	en_GB
dc.subject	element method	en_GB
dc.subject	High-order finite elements	en_GB
dc.subject	Linear algebra optimisation	en_GB
dc.title	Optimising the performance of the spectral/hp element method with collective linear algebra operations	en_GB
dc.type	Article	en_GB
dc.date.available	2018-03-05T13:04:41Z
dc.identifier.issn	0045-7825
dc.description	This is the final version of the article. Available from Elsevier via the DOI in this record.	en_GB
dc.identifier.journal	Computer Methods in Applied Mechanics and Engineering	en_GB

Files in this item

Name:: 1-s2.0-S0045782516306739-main.pdf
Size:: 1.566Mb
Format:: PDF

View/Open

This item appears in the following Collection(s)

Engineering

Show simple item record

Show Statistical Information