Skip to content

Conversation

abrown
Copy link
Member

@abrown abrown commented Jul 17, 2025

This adds full boolean term support for instructions emitted in the new assembler (terms like (_64b | compat) & avx2). Despite doing more checks, this may be quicker too: instead of building up a SmallVec<InstructionSet> to compare against, this generates Rust code like the following that queries what features are available in the target via the AvailableFeatures trait:

#[must_use] // cranelift/assembler-x64/meta/src/generate/inst.rs:227
pub fn is_available(&self, features: &impl AvailableFeatures) -> bool {
    (features._64b() || features.compat()) && features.avx2() // cranelift/assembler-x64/meta/src/generate/inst.rs:232
}

With all of this in place, this PR has a large mechanical translation of all the old, incorrect feature definitions (_64b | compat | avx2) into their new, correct definitions ((_64b | compat) & avx2). I expect this will see a lot more use of this when using more instructions from AVX512, AVX10, APX, etc.

@abrown abrown requested a review from a team as a code owner July 17, 2025 23:01
@abrown abrown requested review from alexcrichton and removed request for a team July 17, 2025 23:01
@abrown
Copy link
Member Author

abrown commented Jul 17, 2025

I do expect this to fail in Winch: something about how we're using -Ccranelift-has-avx2 in param_av2.wat doesn't seem to propagate the right features into the ISA flags used here:

fn ensure_has_avx2(&self) -> Result<()> {
anyhow::ensure!(self.flags.has_avx2(), CodeGenError::UnimplementedForNoAvx2);
Ok(())
}

@github-actions github-actions bot added the cranelift Issues related to the Cranelift code generator label Jul 18, 2025
abrown added 7 commits August 5, 2025 09:15
As discussed [here], we will soon need the ability to express more
complex combinations of CPU features. These are best expressed as
boolean terms: e.g., `(32-bit OR 64-bit) AND ...`, `(32-bit OR 64-bit)
AND ((AVX512VL AND AVX512F) OR AVX10.1)`. This change modifies the
generated code to have a `Inst::is_available` method which contains a
Rust-ified version of the instruction's boolean term. To do this, we now
pass in a `Features` trait, which the instruction can query to see if
its desired feature set is available.

[here]: https://bytecodealliance.zulipchat.com/#narrow/channel/217117-cranelift/topic/boolean.20terms.20for.20x64.20features
This change makes us of the assembler's new generated
`Inst::is_available` methods to check an instruction's feature set in a
more succinct (and likely quicker) way. Unfortunately, this does not
allow us to print the missing ISA requirements on failure--something to
address later.
This is a mechanical transformation converting all instruction
definitions. Now, instructions should have a correct boolean term
describe the features required: e.g., `(_64b | compat) & avx`.
In Cranelift, the `has_*` flags of `isa::x64::settings::Flags` indicate
that the CPU _has_ some capability; the `use_*` flags indicate that
Cranelift _should emit_ instructions using those capabilities. Further,
the `use_*` flags may turned on by the presence of more than one `has_*`
flags; e.g., when `has_avx` and `has_avx2` are available, `use_avx2` is
enabled.

Now that Cranelift's new x64 assembler understands boolean terms, we no
longer need the `use_*` flags for checking if an instruction can be
emitted: instead, we should use the `has_*` flags and rely on the logic
encoded in `Inst::is_available`.
For better error messages (and just for general use of CPU features, see
discussion [here]), this change adds `Inst::features`--a way to
explicitly examine the boolean term for an instruction. This function
returns a `&'static Features` that contains the `AND` and `OR` branches
defining when an instruction is available. This is all generated into
something that looks like:

```rust
pub fn features(&self) -> &'static Features {
    const F1: &'static Features = &Features::Feature(Feature::_64b);
    const F2: &'static Features = &Features::Feature(Feature::compat);
    const F0: &'static Features = &Features::Or(F1, F2);
    F0
}
```

This change makes use of `for_each_feature` more: we build up the
`AvailableFeatures` trait and the `Feature` enum from it. This should be
a bit more direct than searching through the generated code (?).

[here]: bytecodealliance#11272 (comment)
@alexcrichton alexcrichton added this pull request to the merge queue Aug 6, 2025
@github-merge-queue github-merge-queue bot removed this pull request from the merge queue due to failed status checks Aug 6, 2025
@alexcrichton alexcrichton added this pull request to the merge queue Aug 6, 2025
Merged via the queue into bytecodealliance:main with commit 66b4bf3 Aug 6, 2025
56 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
cranelift Issues related to the Cranelift code generator
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants