LLVM and SPIRV-LLVM-Translator pulldown (WW32 2025) #19716

iclsrc · 2025-08-05T18:01:05Z

LLVM: llvm/llvm-project@1fbe87b
SPIRV-LLVM-Translator: KhronosGroup/SPIRV-LLVM-Translator@a7eaac8

…8597) Extends reduction support for `do concurrent`, in particular, for associating names. Consider the following input: ```fortran subroutine dc_associate_reduce integer :: i real, allocatable, dimension(:) :: x associate(x_associate => x) do concurrent (i = 1:10) reduce(+: x_associate) end do end associate end subroutine ``` The declaration of `x_associate` is emitted as follows: ```mlir %13:2 = hlfir.declare %10(%12) {uniq_name = "...."} : (!fir.heap<!fir.array<?xf32>>, !fir.shapeshift<1>) -> (!fir.box<!fir.array<?xf32>>, !fir.heap<!fir.array<?xf32>>) ``` where the HLFIR base type is an array descriptor (i.e. the allocatable/heap attribute is dropped as stipulated by the spec; section 11.1.3.3). The problem here is that `declare_reduction` ops accept only reference types. This restriction is already partially handled for `fir::BaseBoxType`'s by allocating a stack slot for the descriptor and storing the box in that stack allocation. We have to modify this a littble bit for `associate` since the HLFIR and FIR base types are different (unlike most scenarios).

This PR adds a summary and synthetic children for `std::unique_ptr` from MSVC's STL ([NatVis](https://github.com/microsoft/STL/blob/313964b78a8fd5a52e7965e13781f735bcce13c5/stl/debugger/STL.natvis#L285-L303)). As with libc++, the deleter is only shown if it's non-empty. Tested both the shared_ptr and unique_ptr tests on Windows. Towards #24834.

Make the option visible, improve the help text, and add a release note.

At this time (immediately prior to llvm21 branching) we haven't instrumented coroutine generation to identify the "key" instructions of things like co_return and similar. This will lead to worse stepping behaviours, as there won't be any key instruction for those lines. This patch removes the key-instructions flag from the DISubprograms for coroutines, which will cause AsmPrinter to use the "old" / existing linetable stepping behaviour, avoiding a regression until we can instrument these constructs. (I'm going to post on discourse about whether this is a good idea or not in a moment)

…(#148600) Ran my python script from llvm/llvm-project#97043 over the repo again and there was 1 duplicate test-case that has been introduced since I last did this. This patch renames that test.

Update the Neoverse V2 Scheduler to reflect the correct latencies along with having updated the relevant mca tests.

…e vectors. (#148351) Fixes llvm/llvm-project#148347

Add a couple of patterns to generate the Xqciac QC_SHLADD shift left and add immediate instruction.

This was obsoleted by ed1ee9a

At the missing `spirv::ImageFetchOp` operation to the SPIR-V MLIR dialect ODS with appropriate testing including negative testing of the verifiers. Signed-off-by: Jack Frankland <[email protected]>

…#148552) Fix a false positve warning which was introduced by #146234.

`f128` intrinsic functions from libm sometimes lower to `long double` library calls when they instead need to be `f128` versions. Add a generic test demonstrating current behavior.

When `EnumRec::isTyped()` is true, include the `EnumValueRec::getTaggedType()` to the documentation.

In the transform dialect tutorial chapter 1, there were some errors that prevented the example from running. This PR fixes them. --------- Co-authored-by: Renato Golin <[email protected]>

Update test with all zero constant input values which get folded during IR construction to actually use different input values, which require materializing build vectors.

…tion (#148614) This is done by other backends at the start of this function, for example AArch64Disassembler::getInstruction. Not setting it means you hit asserts in MCDisassembler::tryAddingSymbolicOperand and MCDisassembler::tryAddingPcLoadReferenceComment when there is a symbolizer set. Which happened to me while debugging a SystemZ program using LLDB. As the only good way to hit this path is from C++, I've copied X86's disassembler unit tests and added just enough to hit an assert if the comment stream is not set.

…h test. NFC.

Preserve the argument-clause for `warn-unused-result` when under clang:: scope. We are not touching gnu:: scope for now as it's an error for GCC to have that string. Personally I think it would be ok to relax it here too as we are not introducing breakage to currently passing code, but feedback is to go slowly about it.

Split out the calls to __builtin_verbose_trap into a separate header. This is just a refactoring to make the code a bit more structured.

… (#148565) If I understand correctly there was a point where we used to need this before it was implied by Zvl*b. Now that it is though and we use -mattr=+v in pretty much every test we can remove it. In unroll-in-loop-vectorizer.ll we can force a VF of 1 instead by using -force-vector-width=1, and in scalable-basics.ll the two RUN lines were the same so I merged them.

Allows expand of sdiv->mul by constant combine for the general case. Previously this was only occurring in the exact case. This is part of the resolution to issue #118090

Our setup runs tests with bazel in such a way that the work tree is readonly, which was causing this test to fail because it couldn't write the .o file. This fixes that, which was new in 15c3793 when this test was introduced.

…__support/math folder. (#147895) Part of #147386 in preparation for: https://discourse.llvm.org/t/rfc-make-clang-builtin-math-functions-constexpr-with-llvm-libc-to-support-c-23-constexpr-math-functions/86450

Add `OffloadDeviceTest::getPlatformBackend()` and use it to skip event tests which currently fail on AMDGPU due to: ``` OL_ERRC_UNIMPLEMENTED: synchronize event not implemented ```

This patch contains fixes for various nits mentioned in #147200: - This patch removes the `bit.` prefix in the op mnemonic. The operation names now directly correspond to the builtin function names except for `bswap` which is represented by `cir.byte_swap` for more clarity. - Since all bit operations are `SameOperandsAndResultType`, this patch updates their assembly format and avoids spelling out the operand type twice.

In #125921, the changes requested by P2372R3 were completed and tested together with corresponding `chrono` types. But that PR didn't mention P2372R3. The `__cpp_lib_format` FTM was even bumped by an earlier PR #98275. This PR confirms that P2372R3 was completed in LLVM 21 (together with P1361R2). Closes #100043

To allow all C++ features in constexpr contexts we need to track constexpr initializers of variables. The mentioned commit moved some code to handle consteval better but we need the code where it used to be since it is not only consteval that we care about.

The difference is additional passes: FPBuiltinFnSelectionPass

27c9b55 was added upstream, but because it has limited SYCL support, tests can't run properly.

…ice.cpp

jsji · 2025-08-06T21:24:27Z

sycl-e2e is failing in for cuda in HandleVirtRegUse.

@intel/llvm-reviewers-cuda Can we get someone to have a look? Thanks!

Simple reproducer is:

bin/clang++  -Werror -fsycl -fsycl-targets=nvptx64-nvidia-cuda ../sycl/test-e2e/Basic/built-ins/math_raw_ptr.cpp -nogpulib

#11 0x00007f902051b929 llvm::LiveVariables::HandleVirtRegUse(llvm::Register, llvm::MachineBasicBlock*, llvm::MachineInstr&) 
#12 0x00007f902051c61c llvm::LiveVariables::runOnInstr(llvm::MachineInstr&, llvm::SmallVectorImpl<llvm::Register>&, unsigned int)

jsji · 2025-08-08T00:09:44Z

sycl-e2e is failing in for cuda in HandleVirtRegUse.

@intel/llvm-reviewers-cuda Can we get someone to have a look? Thanks!

Simple reproducer is:

bin/clang++  -Werror -fsycl -fsycl-targets=nvptx64-nvidia-cuda ../sycl/test-e2e/Basic/built-ins/math_raw_ptr.cpp -nogpulib

#11 0x00007f902051b929 llvm::LiveVariables::HandleVirtRegUse(llvm::Register, llvm::MachineBasicBlock*, llvm::MachineInstr&) 
#12 0x00007f902051c61c llvm::LiveVariables::runOnInstr(llvm::MachineInstr&, llvm::SmallVectorImpl<llvm::Register>&, unsigned int)

Never mind, I had a look myself and found the fix. :)

We may still need to keep CopyToReg even after folding uses into vector loads, since the original register may be used in other blocks. Partially reverts 1fdbe69

jsji · 2025-08-08T02:48:13Z

This is ready for review.

Update llc-pipeline-npm.ll @intel/dpcpp-tools-reviewers
[NFC] Fix Werror=maybe-uninitialized in GCC 13 build @intel/llvm-reviewers-runtime
[NFC] Update namespace in warnings for properties_kernel_negative_dev… @intel/llvm-reviewers-runtime

sarnex

reviewed remaining changes

sarnex · 2025-08-08T14:27:17Z

/merge

bb-sycl · 2025-08-08T14:27:51Z

Fri 08 Aug 2025 02:27:51 PM UTC --- Start to merge the commit into sycl branch. It will take several minutes.

bb-sycl · 2025-08-08T14:38:35Z

Fri 08 Aug 2025 02:38:34 PM UTC --- Merge the branch in this PR to base automatically. Will close the PR later.

ergawy and others added 30 commits July 14, 2025 12:18

[KeyInstr] Add release note & update option (#148244)

34bb38f

Make the option visible, improve the help text, and add a release note.

[lldb][test] TestProcessSaveCoreMinidump: Rename duplicate test-case …

fa14361

…(#148600) Ran my python script from llvm/llvm-project#97043 over the repo again and there was 1 duplicate test-case that has been introduced since I last did this. This patch renames that test.

[AArch64] Corrected Latency Descriptions for NeoverseV2 (#147339)

70bc7d1

Update the Neoverse V2 Scheduler to reflect the correct latencies along with having updated the relevant mca tests.

[LLVM][CodeGen] Ensure optimizeIncrementingWhile only accepts scalabl…

6c2e26a

…e vectors. (#148351) Fixes llvm/llvm-project#148347

[RISCV] Add ISel patterns for Xqciac QC_SHLADD instruction (#148256)

0ae1506

Add a couple of patterns to generate the Xqciac QC_SHLADD shift left and add immediate instruction.

RuntimeLibcalls: Remove unused variable for atomic libcalls (#148599)

c4c56a0

This was obsoleted by ed1ee9a

[mlir][spirv]: Add OpImageFetch (#145873)

87e39c3

At the missing `spirv::ImageFetchOp` operation to the SPIR-V MLIR dialect ODS with appropriate testing including negative testing of the verifiers. Signed-off-by: Jack Frankland <[email protected]>

[Clang] Do not emit -Wmissing-noreturn when [[noreturn]] is present (…

afffa0d

…#148552) Fix a false positve warning which was introduced by #146234.

[IR] Add a test for f128 libm libcall lowering (NFC) (#148308)

d214f07

`f128` intrinsic functions from libm sometimes lower to `long double` library calls when they instead need to be `f128` versions. Add a generic test demonstrating current behavior.

[Offload] Add tagged type to enumerator docs (#147998)

b520d21

When `EnumRec::isTyped()` is true, include the `EnumValueRec::getTaggedType()` to the documentation.

[mlir][transform] Fix transform dialect tutorial chapter 1 (#147983)

efa30f4

In the transform dialect tutorial chapter 1, there were some errors that prevented the example from running. This PR fixes them. --------- Co-authored-by: Renato Golin <[email protected]>

[SLP,AArch64] Update build-vector test to actually build vectors.

eb4de57

Update test with all zero constant input values which get folded during IR construction to actually use different input values, which require materializing build vectors.

[Offload] Return error rather than dropping it (#148609)

a71187e

[mlir][bazel] Port 0a34309

7e03c46

[ARM][ fp16-promote.ll - cleanup CHECKS to be consistently inside eac…

d8aa4a6

…h test. NFC.

[gn build] Port b9ccc0c

ea8ff79

[libc++] Introduce the _LIBCPP_VERBOSE_TRAP macro (#148262)

5951c44

Split out the calls to __builtin_verbose_trap into a separate header. This is just a refactoring to make the code a bit more structured.

[GlobaISel] Allow expanding of sdiv -> mul by constant (#146504)

806028a

Allows expand of sdiv->mul by constant combine for the general case. Previously this was only occurring in the exact case. This is part of the resolution to issue #118090

[clang][scan-deps] fix new test for readonly work trees

e074044

Our setup runs tests with bazel in such a way that the work tree is readonly, which was causing this test to fail because it couldn't write the .o file. This fixes that, which was new in 15c3793 when this test was introduced.

[libc][math] Refactor ldexpf128 implementation to header-only in src/…

0ad2574

…__support/math folder. (#147895) Part of #147386 in preparation for: https://discourse.llvm.org/t/rfc-make-clang-builtin-math-functions-constexpr-with-llvm-libc-to-support-c-23-constexpr-math-functions/86450

[Offload] Skip event tests on AMDGPU (#148632)

508f9a0

Add `OffloadDeviceTest::getPlatformBackend()` and use it to skip event tests which currently fail on AMDGPU due to: ``` OL_ERRC_UNIMPLEMENTED: synchronize event not implemented ```

iclsrc had a problem deploying to WindowsCILock August 5, 2025 18:01 — with GitHub Actions Failure

iclsrc temporarily deployed to WindowsCILock August 5, 2025 18:53 — with GitHub Actions Inactive

iclsrc temporarily deployed to WindowsCILock August 5, 2025 19:03 — with GitHub Actions Inactive

jsji and others added 5 commits August 6, 2025 12:11

Update llc-pipeline-npm.ll

2f58eb0

The difference is additional passes: FPBuiltinFnSelectionPass

[SPIRV] id and range builtins integration for SYCL (#19639)

f4d81f6

27c9b55 was added upstream, but because it has limited SYCL support, tests can't run properly.

Fix SemaSYCL/uses_aspects.cpp

1acd8dc

[NFC] Update namespace in warnings for properties_kernel_negative_dev…

ca6af1e

…ice.cpp

[NFC] Fix Werror=maybe-uninitialized in GCC 13 build

e87c4f0

jsji had a problem deploying to WindowsCILock August 6, 2025 19:41 — with GitHub Actions Failure

jsji temporarily deployed to WindowsCILock August 6, 2025 19:41 — with GitHub Actions Inactive

jsji temporarily deployed to WindowsCILock August 6, 2025 20:15 — with GitHub Actions Inactive

jsji temporarily deployed to WindowsCILock August 6, 2025 20:45 — with GitHub Actions Inactive

[NVPTX] don't erase CopyToRegs when folding movs into loads (#149393)

2067a4c

We may still need to keep CopyToReg even after folding uses into vector loads, since the original register may be used in other blocks. Partially reverts 1fdbe69

jsji temporarily deployed to WindowsCILock August 8, 2025 00:21 — with GitHub Actions Inactive

jsji temporarily deployed to WindowsCILock August 8, 2025 00:54 — with GitHub Actions Inactive

jsji temporarily deployed to WindowsCILock August 8, 2025 01:03 — with GitHub Actions Inactive

sarnex approved these changes Aug 8, 2025

View reviewed changes

bb-sycl approved these changes Aug 8, 2025

View reviewed changes

bb-sycl merged commit 4946b5d into sycl Aug 8, 2025
57 of 58 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

LLVM and SPIRV-LLVM-Translator pulldown (WW32 2025) #19716

LLVM and SPIRV-LLVM-Translator pulldown (WW32 2025) #19716

Uh oh!

iclsrc commented Aug 5, 2025

Uh oh!

jsji commented Aug 6, 2025

Uh oh!

jsji commented Aug 8, 2025

Uh oh!

jsji commented Aug 8, 2025 •

edited

Loading

Uh oh!

sarnex left a comment

Uh oh!

sarnex commented Aug 8, 2025

Uh oh!

bb-sycl commented Aug 8, 2025

Uh oh!

bb-sycl commented Aug 8, 2025

Uh oh!

Uh oh!

Uh oh!

LLVM and SPIRV-LLVM-Translator pulldown (WW32 2025) #19716

LLVM and SPIRV-LLVM-Translator pulldown (WW32 2025) #19716

Uh oh!

Conversation

iclsrc commented Aug 5, 2025

Uh oh!

jsji commented Aug 6, 2025

Uh oh!

jsji commented Aug 8, 2025

Uh oh!

jsji commented Aug 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sarnex left a comment

Choose a reason for hiding this comment

Uh oh!

sarnex commented Aug 8, 2025

Uh oh!

bb-sycl commented Aug 8, 2025

Uh oh!

bb-sycl commented Aug 8, 2025

Uh oh!

Uh oh!

Uh oh!

jsji commented Aug 8, 2025 •

edited

Loading