This repository was archived by the owner on Mar 20, 2023. It is now read-only.

Support async execution in OpenMP wherever it's supported #725

Draft

iomaganaris wants to merge 11 commits into master

from magkanar/hackathon_openmp_async

Draft

Support async execution in OpenMP wherever it's supported #725

iomaganaris wants to merge 11 commits into master from magkanar/hackathon_openmp_async

Conversation

@iomaganaris

Copy link

Contributor

@iomaganaris iomaganaris commented Dec 21, 2021 •

edited

Loading

Description

Added asynchronous execution of kernels in multiple GPU streams.
Still there are some constructs that the compiler doesn't support:

#pragma omp target update to(<variable>) depend(inout: stream) nowait is not working, even if update from is working. There is an internal compiler error whenever depend(..) nowait is added to the to clause.

coreneuron::nrn_fixed_step_lastpart(coreneuron::NrnThread *):
 386, Taskwait
 Generating update to(nth->_t)
/gpfs/bbp.cscs.ch/ssd/apps/hpc/jenkins/pulls/1392/deploy/externals/2021-12-10/linux-rhel7-x86_64/gcc-9.3.0/nvhpc-21.11-qhk3q2/Linux_x86_64/21.11/compilers/share/llvm/bin/opt: /gpfs/bbp.cscs.ch/ssd/slurmTmpFS/magkanar/140832/nvc++xfYwftRovDgD.ll:144924:43: error: use of undefined value '%.d0009.addr'
 %41 = bitcast [1 x %struct.struct_deps]* %.d0009.addr to i8*, !dbg !120921

#pragma omp taskwait depend(inout: stream) is not working even if it's referenced in an NVIDIA presentation

How to test this?

module load unstable
module load cmake git flex bison python-dev hpe-mpi/2.25.hmpt
module unload hpe-mpi/2.22.hmpt py-mpi4py
module load caliper
module unload cuda/11.0.2
module load gcc
module load boost
module use /gpfs/bbp.cscs.ch/ssd/apps/hpc/jenkins/pulls/1392/deploy/compilers/2021-12-10/modules/tcl/linux-rhel7-x86_64
module use /gpfs/bbp.cscs.ch/ssd/apps/hpc/jenkins/pulls/1392/deploy/externals/2021-12-10/modules/tcl/linux-rhel7-x86_64
module load nvhpc/21.11 cuda/11.5.1
cmake .. \
 -DCMAKE_INSTALL_PREFIX=./install \
 -DCORENRN_ENABLE_TIMEOUT=OFF \
 -DNRN_ENABLE_INTERVIEWS=OFF \
 -DNRN_ENABLE_RX3D=OFF \
 -DNRN_ENABLE_MPI=ON \
 -DCORENRN_ENABLE_OPENMP=ON \
 -DNRN_ENABLE_CORENEURON=ON \
 -DCORENRN_ENABLE_GPU=ON \
 -DCORENRN_ENABLE_NMODL=ON \
 -DCORENRN_NMODL_DIR=<nmodl_dir> \
 -DNRN_ENABLE_PYTHON=ON \
 -DPYTHON_EXECUTABLE=$(which python3) \
 -DNRN_ENABLE_TESTS=OFF \
 -DCORENRN_ENABLE_UNIT_TESTS=OFF \
 -DCMAKE_C_COMPILER=$CC \
 -DCMAKE_CXX_COMPILER=$CXX \
 -DCMAKE_CUDA_COMPILER=nvcc \
 -DCMAKE_BUILD_TYPE=RelWithDebInfo \
 -DCORENRN_ENABLE_CALIPER_PROFILING=ON \
 -DCORENRN_ENABLE_OPENMP_OFFLOAD=ON \
 -DCMAKE_CXX_FLAGS="-Minfo=accel -gopt -tp=skylake-avx512"
cmake --build . --parallel 40 --target install

Test System

OS: RedHat
Compiler: NVHPC 21.11
Version: hackathon_main
Backend: GPU

This was referenced Dec 21, 2021

Changes to support async execution on GPU with OpenACC BlueBrain/mod2c#75

Draft

Changes for async execution of OpenACC and OpenMP BlueBrain/nmodl#788

Closed

@bbpbuildbot

Copy link

Collaborator

bbpbuildbot commented Dec 21, 2021

@olupton olupton mentioned this pull request

Dec 23, 2021

Improve OpenMP offload implementation #729

Open

iomaganaris and others added 11 commits

December 23, 2021 12:11

@iomaganaris @olupton


 Added stream id vector in NrnThread

0721b32

@iomaganaris @olupton


 Fixed openacc async clauses

a36d21c

@iomaganaris @olupton


 Updated nmodl and mod2c submodules

3f3f773

@iomaganaris @olupton


 Fixed issues with missing parenthesis

6060f2c

@iomaganaris @olupton


 More small fixes

60e3d3a

@iomaganaris @olupton


 Fixed openacc async

d6bf37c

@iomaganaris @olupton


 First working commit of openmp async execution

aac0915

@iomaganaris @olupton


 Added depend in update from clauses

7a230ba

@iomaganaris @olupton


 Small indentation fix

@iomaganaris @olupton


 Fixed clang-format

79d0cfc

@olupton


 Update NMODL after rebase.

6b90913

@olupton olupton force-pushed the magkanar/hackathon_openmp_async branch from 4f6675e to 6b90913 Compare

December 23, 2021 11:20

@olupton olupton changed the base branch from hackathon_main to master

December 23, 2021 11:20

@bbpbuildbot

Copy link

Collaborator

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Support async execution in OpenMP wherever it's supported #725

Are you sure you want to change the base?

Support async execution in OpenMP wherever it's supported #725

Uh oh!

Conversation

@iomaganaris iomaganaris commented Dec 21, 2021 •

edited

Loading

Uh oh!

Uh oh!

bbpbuildbot commented Dec 21, 2021

Uh oh!

bbpbuildbot commented Dec 23, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Support async execution in OpenMP wherever it's supported #725

Are you sure you want to change the base?

Support async execution in OpenMP wherever it's supported #725

Uh oh!

Conversation

@iomaganaris iomaganaris commented Dec 21, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

bbpbuildbot commented Dec 21, 2021

Uh oh!

bbpbuildbot commented Dec 23, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

@iomaganaris iomaganaris commented Dec 21, 2021 •

edited

Loading