You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
NVPTX currently lowers atomixrmw on <4 x float> as a call to __atomic_compare_exchange_16 which does not exist on the GPU: https://godbolt.org/z/ovf4cqKK5
Newer GPUs do have support for vectorized atomic ops on some data types, but on the older GPUs they must be lowered without relying on runtime.