14–18 Jul 2015
Kobe International Conference Center
Asia/Tokyo timezone

Grid: A next generation C++ library for data parallel QCD

15 Jul 2015, 18:30
2h 30m
4th floor (Kobe International Conference Center)

4th floor

Kobe International Conference Center

6-9-1 Minatojima-nakamachi, Chuo-ku, Kobe, Hyogo 650-0046, Japan
Board: 25
Poster Algorithms and Machines Poster Session

Speaker

Dr Azusa Yamaguchi (University of Edinburgh)

Description

We discuss progress and performance from the development of the Grid data parallel interface for cartesian fields of tensor data types. The library is suitable for the simulation of QCD including multiple grids. Wherever appropriate the interface is quite similar to that of QDP++, but the library makes use of C++11 features to reduce the volume of code, compared to QDP++ while bringing greater generality and greater performance. While the library simultaneously targets MPI, OpenMP and SIMD parallelism, the SIMD optimisation is notably flexible. Performance is substantially improved through the use of a data layout transformation and benchmark results are presented for simple Lattice QCD operations and the Wilson operator under AVX, AVX2 and AVX512 instruction set targets. This work is performed as part of the Intel Parallel Computing Centre in the Higgs Centre for Theoretical Physics at Edinburgh. Submitted by Peter Boyle on 14 May 2015 at 23:48 Id: 5 Last modification: 14 May 2015 23:48 Contribution type: Talk Content We discuss progress and performance from the development of a data parallel interface for cartesian fields of tensor data types. The library is suitable for the simulation of QCD including multiple grids. Wherever appropriate the interface is quite similar to that of QDP++, but the library makes use of C++11 features to reduce the volume of code, compared to QDP++ while bringing greater generality and greater performance. While the library simultaneously targets MPI, OpenMP and SIMD parallelism, the SIMD optimisation is notably flexible. Performance is substantially improved through the use of a data layout transformation and benchmark results are presented for simple Lattice QCD operations and the Wilson operator under AVX, AVX2 and AVX512 instruction set targets.

Primary author

Dr Azusa Yamaguchi (University of Edinburgh)

Co-author

Prof. Peter Boyle (University of Edinburgh)

Presentation materials