Show simple item record

dc.contributor.authorMoxey, D
dc.contributor.authorCantwell, CD
dc.contributor.authorKirby, RM
dc.contributor.authorSherwin, SJ
dc.date.accessioned2018-03-05T13:04:41Z
dc.date.issued2016-10-01
dc.description.abstractAs computing hardware evolves, increasing core counts mean that memory bandwidth is becoming the deciding factor in attaining peak performance of numerical methods. High-order finite element methods, such as those implemented in the spectral/hp framework Nektar++, are particularly well-suited to this environment. Unlike low-order methods that typically utilise sparse storage, matrices representing high-order operators have greater density and richer structure. In this paper, we show how these qualities can be exploited to increase runtime performance on nodes that comprise a typical high-performance computing system, by amalgamating the action of key operators on multiple elements into a single, memory-efficient block. We investigate different strategies for achieving optimal performance across a range of polynomial orders and element types. As these strategies all depend on external factors such as BLAS implementation and the geometry of interest, we present a technique for automatically selecting the most efficient strategy at runtime.en_GB
dc.description.sponsorshipWe thank D. Ekelschot and M. Turner for their assistance in generating the mesh and parameters for the simulation of Section 6. We also thank F. Witherden for initial discussions motivating this study. This work was funded in part by support from the libHPC II EPSRC project under grant EP/K038788/1. DM additionally acknowledges support under the Laminar Flow Control Centre funded by Airbus/EADS and EPSRC under grant EP/I037946. SJS acknowledges Royal Academy of Engineering support under their research chair scheme. We thank the Imperial College High Performance Computing Service for computing time used to calculate the results seen in Section 6. We additionally acknowledge access to ARCHER with support from the UK Turbulence Consortium under EPSRC grant EP/L000261/1.en_GB
dc.identifier.citationVol. 310, pp. 628 - 645en_GB
dc.identifier.doi10.1016/j.cma.2016.07.001
dc.identifier.urihttp://hdl.handle.net/10871/31820
dc.language.isoenen_GB
dc.publisherElsevieren_GB
dc.rights(C) 2016 The Author(s). Published by Elsevier B.V. This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).en_GB
dc.subjectSpectralen_GB
dc.subjectelement methoden_GB
dc.subjectHigh-order finite elementsen_GB
dc.subjectLinear algebra optimisationen_GB
dc.titleOptimising the performance of the spectral/hp element method with collective linear algebra operationsen_GB
dc.typeArticleen_GB
dc.date.available2018-03-05T13:04:41Z
dc.identifier.issn0045-7825
dc.descriptionThis is the final version of the article. Available from Elsevier via the DOI in this record.en_GB
dc.identifier.journalComputer Methods in Applied Mechanics and Engineeringen_GB


Files in this item

This item appears in the following Collection(s)

Show simple item record