Return to Answer

added 69 characters in body

edited Nov 27, 2024 at 5:30

72.8k
35
204
297

I'd like to allocate executable memory in CUDA ...

There is no such thing as user allocable "executable" memory. All the empirical evidence I have seen, and architecture whitepapers which NVIDIA has released over the years suggests that the GPU has a programmable MMU and NVIDIA has chosen to logically divide the GPU DRAM into regions for different functions (global memory, constant memory, local memory, code pages). The latter appear fully inaccessible from user code by design.

write SASS/CUBIN code there, and then execute this code.

I don’t see how that could work either. The CUDA execution model requires static allocation of global symbols, registers, local memory, and constant memory in a linking phase which must be performed prior to code being loading onto the GPU and executed. This linking phase can be done at compile time, or runtime, but it must be done. This is the purpose of the nvjitlink API which you reject in your question. The GPU runtime must henthen take the resource requirements of the linked code payload, reservesreserve the necessary register file pages, statically defined shared memory, etc. and tries to run, when or if those resources are available on the target device. There is, to the best of my knowledge, no way you could conceivably run code whose resource requirements are not known and which the runtime has not reserved the necessary GPU resources a priori.

Finally, I would regard the ability to bypass all of the protections which NVIDIA have implemented in their driver and runtime and inject and run arbitrary code on the GPU to be a potential security flaw and expect NVIDIA to eliminate it, if such a vector was documented to exist.

I'd like to allocate executable memory in CUDA ...

write SASS/CUBIN code there, and then execute this code.

I don’t see how that could work either. The CUDA execution model requires static allocation of global symbols, registers, local memory, and constant memory in a linking phase which must be performed prior to code being loading onto the GPU and executed. This linking phase can be done at compile time, or runtime, but it must be done. This is the purpose of the nvjitlink API which you reject in your question. The GPU runtime must hen take the resource requirements of the linked code payload, reserves the necessary register file pages, statically defined shared memory, etc and tries to run, when or if those resources are available on the target device. There is, to the best of my knowledge, no way you could conceivably run code whose resource requirements are not known a priori.

I'd like to allocate executable memory in CUDA ...

write SASS/CUBIN code there, and then execute this code.

I don’t see how that could work either. The CUDA execution model requires static allocation of global symbols, registers, local memory, and constant memory in a linking phase which must be performed prior to code being loading onto the GPU and executed. This linking phase can be done at compile time, or runtime, but it must be done. This is the purpose of the nvjitlink API which you reject in your question. The GPU runtime must then take the resource requirements of the linked code payload, reserve the necessary register file pages, statically defined shared memory, etc. and tries to run, when or if those resources are available on the target device. There is, to the best of my knowledge, no way you could conceivably run code whose resource requirements are not known and which the runtime has not reserved the necessary GPU resources a priori.

added 247 characters in body

Source Link

edited Nov 27, 2024 at 4:16

talonmies

edited Nov 27, 2024 at 4:16

talonmies

72.8k
35
204
297

I'd like to allocate executable memory in CUDA ...

write SASS/CUBIN code there, and then execute this code.

I don’t see how that could work either. The CUDA execution model requires static allocation of global symbols, registers, local memory, and constant memory in a linking phase which must be performed prior to code being loading onto the GPU and executed. This linking phase can be done at compile time, or runtime, but it must be done. This is the purpose of the nvjitlink API which you reject in your question. The GPU runtime must hen take the resource requirements of the linked code payload, reserves the necessary register file pages, statically defined shared memory, etc and tries to run, when or if those resources are available on the target device. There is, to the best of my knowledge, no way you could conceivableconceivably run code whose resource requirements are not known a priori.

I'd like to allocate executable memory in CUDA ...

write SASS/CUBIN code there, and then execute this code.

I'd like to allocate executable memory in CUDA ...

write SASS/CUBIN code there, and then execute this code.

I don’t see how that could work either. The CUDA execution model requires static allocation of global symbols, registers, local memory, and constant memory in a linking phase which must be performed prior to code being loading onto the GPU and executed. This linking phase can be done at compile time, or runtime, but it must be done. This is the purpose of the nvjitlink API which you reject in your question. The GPU runtime must hen take the resource requirements of the linked code payload, reserves the necessary register file pages, statically defined shared memory, etc and tries to run, when or if those resources are available on the target device. There is, to the best of my knowledge, no way you could conceivably run code whose resource requirements are not known a priori.

Source Link

answered Nov 27, 2024 at 1:03

talonmies

answered Nov 27, 2024 at 1:03

talonmies

72.8k
35
204
297

I'd like to allocate executable memory in CUDA ...

write SASS/CUBIN code there, and then execute this code.

Post Made Community Wiki by talonmies

occurred Nov 27, 2024 at 1:03

default

CollectivesTM on Stack Overflow

Return to Answer