Skip to content

[NVPTX] atomicrmw on <4 x float> relies on __atomic_compare_exchange_16 #122760

@Artem-B

Description

@Artem-B

NVPTX currently lowers atomixrmw on <4 x float> as a call to __atomic_compare_exchange_16 which does not exist on the GPU:
https://godbolt.org/z/ovf4cqKK5

Newer GPUs do have support for vectorized atomic ops on some data types, but on the older GPUs they must be lowered without relying on runtime.

Metadata

Metadata

Assignees

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions