Icare3D Blog: CUDA "volatile trick"

CUDA "volatile trick"

Apr 19, 2010 at 1:54 AM Labels: CUDA

A very useful trick found on the CUDA forum.

Very often, the CUDA compiler inline the operations needed to compute the value of a variable used at several places, instead of keeping the variable in a register. This can be a good strategy in some situations, but there is also many cases where it brings register usage up unnecessarily and duplicates instructions. To prevent this, the "volatile" keyword can be used when the variable is declared, forcing it to be really kept and reused.

This trick also work with constant variables (and shared memory) which would otherwise get loaded into registers over and over when accessed at several places.

It clearly reduces the number of virtual registers allocated at the PTX level, which helps a lot for the real register allocation phase that happens later during the transform to cubin. However, be careful not using it with constantly indexed arrays for instance, they would be put in local memory.

More info there:
http://forums.nvidia.com/index.php?showtopic=89573
http://forums.nvidia.com/index.php?showtopic=99209

1 Comment for "CUDA "volatile trick""

Fuat Geleri says:
January 24, 2013 at 8:25 PM

Nice explanation. I will try it.

Icare3D Blog

Research, Computer Graphics and GPU

Search:

Pages

CUDA "volatile trick"

1 Comment for "CUDA "volatile trick""

Post a Comment

About Me

Blog Archive

Twitter updates

Labels

Blog links

Favorite websites

Recent Comments

Followers