fix #17098, bad codegen from too many i1 casts #17225

JeffBezanson · 2016-07-01T01:51:03Z

Previously, we always represented Bool as i1 except in storage. Now we represent Bool as i8 most of the time, and truncate to i1 only just before use in branches.

This affected code using tuples of Bools. Previously, we always represented Bool as i1 except in storage. Now we represent Bool as i8 most of the time, and truncate to i1 only just before use in branches.

vtjnash · 2016-07-01T02:27:00Z

lgtm. this definitely seems like it'll be simpler

tkelman · 2016-07-01T05:04:21Z

So does this mean people could hide payloads in the upper bits of julia bools?

timholy · 2016-07-01T09:47:33Z

As part of this, it might be worth reverting #16687 and running nanosoldier.

JeffBezanson · 2016-07-01T15:49:00Z

Yes, I guess you could hide payloads. We could stop that pretty easily I think by zeroing the top bits whenever something is reinterpreted to Bool. However for now we need this performance.

yuyichao · 2016-07-01T16:35:58Z

This seems to have also fixed the SIMD regression caused by bd06d27. Not sure why it broke in the first place though....

This reverts commit 526695c. Jeff's fix to codegen on Bools, #17225, should make this unnecessary.

timholy · 2016-07-01T16:41:14Z

Should we do another nanosoldier run against 0.4?

Summary: Currently, InstCombine is already able to fold expressions of the form `logic(cast(A), cast(B))` to the simpler form `cast(logic(A, B))`, where logic designates one of `and`/`or`/`xor`. This transformation is implemented in `foldCastedBitwiseLogic()` in InstCombineAndOrXor.cpp. However, this optimization will not be performed if both `A` and `B` are `icmp` instructions. The decision to preclude casts of `icmp` instructions originates in r48715 in combination with r261707, and can be best understood by the title of the former one: > Transform (zext (or (icmp), (icmp))) to (or (zext (cimp), (zext icmp))) if at least one of the (zext icmp) can be transformed to eliminate an icmp. Apparently, it introduced a transformation that is a reverse of the transformation that is done in `foldCastedBitwiseLogic()`. Its purpose is to expose pairs of `zext icmp` that would subsequently be optimized by `transformZExtICmp()` in InstCombineCasts.cpp. Therefore, in order to avoid an endless loop of switching back and forth between these two transformations, the one in `foldCastedBitwiseLogic()` has been restricted to exclude `icmp` instructions which is mirrored in the responsible check: `if ((!isa<ICmpInst>(Cast0Src) || !isa<ICmpInst>(Cast1Src)) && ...` This check seems to sort out more cases than necessary because: - the reverse transformation is obviously done for `or` instructions only - and also not every `zext icmp` pair is necessarily the result of this reverse transformation Therefore we now remove this check and replace it by a more finegrained one in `shouldOptimizeCast()` that now rejects only those `logic(zext(icmp), zext(icmp))` that would be able to be optimized by `transformZExtICmp()`, which also avoids the mentioned endless loop. That means we are now able to also simplify expressions of the form `logic(cast(icmp), cast(icmp))` to `cast(logic(icmp, icmp))` (`cast` being an arbitrary `CastInst`). As an example, consider the following IR snippet ``` %1 = icmp sgt i64 %a, %b %2 = zext i1 %1 to i8 %3 = icmp slt i64 %a, %c %4 = zext i1 %3 to i8 %5 = and i8 %2, %4 ``` which would now be transformed to ``` %1 = icmp sgt i64 %a, %b %2 = icmp slt i64 %a, %c %3 = and i1 %1, %2 %4 = zext i1 %3 to i8 ``` This issue became apparent when experimenting with the programming language Julia, which makes use of LLVM. Currently, Julia lowers its `Bool` datatype to LLVM's `i8` (also see JuliaLang/julia#17225). In fact, the above IR example is the lowered form of the Julia snippet `(a > b) & (a < c)`. Like shown above, this may introduce `zext` operations, casting between `i1` and `i8`, which could for example hinder ScalarEvolution and Polly on certain code. Reviewers: grosser, vtjnash, majnemer Subscribers: majnemer, llvm-commits Differential Revision: https://reviews.llvm.org/D22511 Contributed-by: Matthias Reisinger git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@275989 91177308-0d34-0410-b5e6-96231b3b80d8

This reverts commit 526695c. Jeff's fix to codegen on Bools, JuliaLang#17225, should make this unnecessary.

fix #17098, bad codegen from too many i1 casts

f142192

This affected code using tuples of Bools. Previously, we always represented Bool as i1 except in storage. Now we represent Bool as i8 most of the time, and truncate to i1 only just before use in branches.

JeffBezanson merged commit 0b9364d into master Jul 1, 2016

yuyichao deleted the jb/fix17098 branch July 1, 2016 16:35

timholy added a commit that referenced this pull request Jul 1, 2016

Revert "Speed up iteration with numbers"

ac4a2ca

This reverts commit 526695c. Jeff's fix to codegen on Bools, #17225, should make this unnecessary.

timholy mentioned this pull request Jul 1, 2016

Revert "Speed up iteration with numbers" #17230

Merged

tkelman mentioned this pull request Jul 2, 2016

Determine the cause of codegen regression for iteration with numbers #16753

Closed

mfasi pushed a commit to mfasi/julia that referenced this pull request Sep 5, 2016

Revert "Speed up iteration with numbers"

644e1fa

This reverts commit 526695c. Jeff's fix to codegen on Bools, JuliaLang#17225, should make this unnecessary.

simonster mentioned this pull request Apr 17, 2017

Random behavior when converting uninitialized Array{Bool} entries #20065

Closed

vchuravy mentioned this pull request Jun 8, 2024

inlining: optimize and_int(x, true) and or_int(x, false) #54731

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

fix #17098, bad codegen from too many i1 casts #17225

fix #17098, bad codegen from too many i1 casts #17225

Uh oh!

JeffBezanson commented Jul 1, 2016

Uh oh!

vtjnash commented Jul 1, 2016

Uh oh!

tkelman commented Jul 1, 2016

Uh oh!

timholy commented Jul 1, 2016

Uh oh!

JeffBezanson commented Jul 1, 2016

Uh oh!

yuyichao commented Jul 1, 2016

Uh oh!

timholy commented Jul 1, 2016

Uh oh!

Uh oh!

Uh oh!

fix #17098, bad codegen from too many i1 casts #17225

fix #17098, bad codegen from too many i1 casts #17225

Uh oh!

Conversation

JeffBezanson commented Jul 1, 2016

Uh oh!

vtjnash commented Jul 1, 2016

Uh oh!

tkelman commented Jul 1, 2016

Uh oh!

timholy commented Jul 1, 2016

Uh oh!

JeffBezanson commented Jul 1, 2016

Uh oh!

yuyichao commented Jul 1, 2016

Uh oh!

timholy commented Jul 1, 2016

Uh oh!

Uh oh!