Low CPU utilisation when running multiple PyFR simulations with OpenMP backend on macOS

Cassian · 8 August 2025 14:34

Hi PyFR team,

I’m currently running PyFR using the openmp backend on macOS (Apple Silicon), and I’ve noticed that CPU utilisation remains very low — even when running a single simulation. I’m monitoring with both htop and Activity Monitor, and usage rarely exceeds 1–2 logical cores’ worth, regardless of problem size or simulation duration.

I’ve tried the following:

Explicitly setting OMP_NUM_THREADS=4 (and higher)

Verifying that libxsmm.dylib is correctly linked via DYLD_LIBRARY_PATH
Using both small and moderately sized meshes
Running simulations in isolation (only one PyFR instance active)

Despite this, CPU usage remains far below expected levels for a supposedly multi-threaded workload. When running multiple simulations, usage scales somewhat, but each individual run still uses very little CPU, and wall-clock time is long.

Questions:

Is this behaviour expected on macOS with the OpenMP backend?
Are there specific environment variables or backend limitations (e.g. with LIBXSMM, thread affinity, etc.) that may limit CPU usage on macOS or Apple Silicon?
Would moving to Linux (e.g. on an x86_64 machine) improve OpenMP parallelism and performance?
Are there any backend-specific optimisations (e.g. compiling LIBXSMM with special flags) you would recommend?

Any advice would be greatly appreciated. Thanks again for developing such a powerful tool.

Best regards,

Cassian

fdw · 8 August 2025 15:45

Performance should be reasonable with the OpenMP backend for simulations of modest size (such as the two 3D test cases which come with PyFR). Thread overhead is somewhat higher on macOS and Apple M series chips than Linux (ARM or x86), and so there can be a benefit to setting OMP_NUM_THREADS=1 and then partitioning the mesh (say into 4 parts) and then running with mpirun -np 4 …. I would only expect this to make a difference for small simulations, however.

Regards, Freddie.

Cassian · 10 September 2025 05:18

Hi Freddie,

Thanks for your reply and sorry for my late reply. I managed to compare the two results, and they have good agreement. So thanks for that!

Regards, Cassian

Topic		Replies	Views
Running on a single cpu core? Just Starting	3	435	29 July 2016
Runtime Error on macOS with OpenMP Errors	3	253	19 June 2023
Sd7003 case performance with OpenMP Cases incompressible , openmp	8	422	9 February 2020
Installing pyfr develop branch. Error: Address not mapped Errors	11	254	4 July 2023
Failed to run example cases using openmp as backend Errors	28	1526	30 July 2022

Low CPU utilisation when running multiple PyFR simulations with OpenMP backend on macOS

Related topics