PyFR plugin in-memory execution model

AlmostSurelyRob · 15 February 2026 16:10

I now have some semi-production runs and I am looking to optimise collection of statistics, integrals and point sampling information. I was warned that this will be costly in terms of performance, but I was very surprised by how much. The utilisation of GPU dropped from 90% to almost 0% in the first iteration of my setup.

I wanted to understand this a bit better. Do plugins copy data to CPU for post-processing for tavg, sampler and fluidforce?

Practically, I can obviously decrease the frequency of collecting snapshots for averaging and increase the dt-out, so this is not I think an issue, though any advice will be greatly appreciated.

fdw · 15 February 2026 17:29

All plugin execution is performed on the CPU and is single threaded. Thus you pay the price of a copy and performance for calculations is typically two orders of magnitude worse (GPU memory bandwidth vs. single core memory bandwidth). See the following guidance in the performance tuning section:

As PyFR is an explicit code which is almost always CFL limited running plugins frequently is wasteful.

Regards, Freddie.

Topic		Replies	Views
How to run PyFR on CPU/multicores only [ Without CUDA/OPENCL] General	4	447	8 July 2015
What would be a good explanation of PyFR having a good utilization of GPU acceleration? General	1	250	25 July 2019
Poor TGV performance with GPU and MPI Cases hpc , cuda	5	272	1 June 2023
Regarding PyFR on Multi GPU Just Starting	1	231	3 November 2015
Running on a single cpu core? Just Starting	3	453	29 July 2016

PyFR plugin in-memory execution model

Related topics