PoS - Proceedings of Science
Volume 390 - 40th International Conference on High Energy physics (ICHEP2020) - Parallel: Computing and Data Handling
Data Analysis with GPU-Accelerated Kernels
I. Dutta*, J. Pata, N. Lu, J.r. Vlimant, H. Newman, M. Spiropulu, C. Reissel and D. Ruini
Full text: pdf
Pre-published on: March 03, 2021
Published on: April 15, 2021
Abstract
At HEP experiments, processing billions of records of structured numerical data can be a bottleneckin the analysis pipeline. This step is typically more complex than current query languages allow,such that numerical codes are used. As highly parallel computing architectures are increasinglyimportant in the computing ecosystem, it may be useful to consider how accelerators such as GPUscan be used for data analysis. Using CMS and ATLAS Open Data, we implement a benchmarkphysics analysis with GPU acceleration directly in Python based on efficient computational kernelsusing Numba/LLVM, resulting in an order of magnitude throughput increase over a pure CPU-based approach. We discuss the implementation and performance benchmarks of the physicskernels on CPU and GPU targets. We demonstrate how these kernels are combined to a modernML-intensive workflow to enable efficient data analysis on high-performance servers and remarkon possible operational considerations.
DOI: https://doi.org/10.22323/1.390.0908
How to cite

Metadata are provided both in "article" format (very similar to INSPIRE) as this helps creating very compact bibliographies which can be beneficial to authors and readers, and in "proceeding" format which is more detailed and complete.

Open Access
Creative Commons LicenseCopyright owned by the author(s) under the term of the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.