S390x: emit new instructions added in z17 by theotherjimmy · Pull Request #12319 · bytecodealliance/wasmtime

theotherjimmy · 2026-01-12T15:57:09Z

Z17 (arch15) includes some instructions that allow us to encode some more complicated operations in fewer instructions. This PR adds support to cranelift-codegen to emit these newer instructions when appropriate.

Further, Z17 includes a VBLEND instruction that mimics the same instruction on x64. Since this is no longer an x64-exclusive instruction type, I've renamed the appropriate stuff within cranelift codegen to reflect that this is not ISA-specific anymore.

github-actions · 2026-01-12T17:47:34Z

Subscribe to Label Action

cc @cfallin, @fitzgen

Details

This issue or pull request has been labeled: "cranelift", "cranelift:area:aarch64", "cranelift:area:x64", "cranelift:meta", "isle"

Thus the following users have been cc'd because of the following labels:

cfallin: isle
fitzgen: isle

To subscribe or unsubscribe from this label, edit the .github/subscribe-to-label.json configuration file.

Learn more.

alexcrichton · 2026-01-12T20:58:57Z

I'm going to shift review of this over to @uweigand

alexcrichton · 2026-01-12T20:59:25Z

or, well, I can't officially do that, but @uweigand I'm happy to rubber-stamp once you've approved

uweigand

Looks mostly good to me, but see inline comments.

In addition to what is implemented here, we now could implement vector integer division for 32-bit and 64-bit integer vectors - but there is currently no ISLE to even express this.

cranelift/codegen/src/isa/s390x/inst/emit_tests.rs

cranelift/codegen/src/isa/s390x/lower.isle

uweigand · 2026-01-13T15:10:19Z

cranelift/codegen/src/isa/s390x/lower.isle

+(rule 16 (lower (has_type (and (vxrs_ext3_enabled) (vr128_ty ty)) (band (band y z) (bnot x))))
+      (vec_eval ty 0b00000010 x y z))
+(rule 17 (lower (has_type (and (vxrs_ext3_enabled) (vr128_ty ty)) (band (band x (bnot y)) z)))
+      (vec_eval ty 0b00000010 y x z))


Not sure what if any canonicalization is done at the ISLE level here, but these four don't cover all possible combinations. E.g. (band x (band (bnot y) z)) is not covered.

I guess a more fundamental question is which combinations we should be covering. For example, why cover and-not with three inputs but not or-not?

I stopped because I realized this would add hundreds of rules to get correct, and ran out of steam pretty quickly. I think it's an open question: what should we encode? what 3-input binary operations are actually used?

I made a python script to handle this, and committed the results. My thinking is this:

This only has to happen once

It would be better to have the results interspersed in the file near their 2-input counterparts for better

uweigand · 2026-01-13T15:13:33Z

cranelift/filetests/filetests/isa/s390x/vec-bitwise-arch15.clif

+; block0: ; offset 0x0
+;   .byte 0xe7, 0x8a
+;   .byte 0x80, 0x02
+;   .byte 0x9f, 0x88


We should also add z17 insns to the disassembler, but that is of course a different patch.

crates/cranelift/src/func_environ.rs

uweigand

This looks all good to me now, except for the srem implementation - see inline comment.

cranelift/codegen/src/isa/s390x/lower.isle

This emits & tests a bunch of instructions: * from Miscellaneous-Instruction-Extensions Facility 4: * CLZ, 64bit * CTZ, 64bit * from Vector-Enhancements Facility 3: * 32x4, 64x2 & 128x1 variants of the following: * Divide * Remainder * 64x2 & 128x1 multiply variants * 128x1 vaiants of: * Compare * CLZ * CTZ * Max * Min * Average * Negation * Evaluate Co-authored-by: Jimmy Brisson <jbrisson@linux.ibm.com>

Now that s390x implements blendv as well, we should refer to the instruction without the x86 prefix.

uweigand

Thanks, this version LGTM now!

cfallin

Rubber-stamping based on Ulrich's review -- thanks!

theotherjimmy requested a review from a team as a code owner January 12, 2026 15:57

theotherjimmy requested review from alexcrichton and removed request for a team January 12, 2026 15:57

theotherjimmy force-pushed the s390x-z17 branch 3 times, most recently from ae1c56a to fac9929 Compare January 12, 2026 16:26

uweigand reviewed Jan 13, 2026

View reviewed changes

theotherjimmy force-pushed the s390x-z17 branch from fac9929 to 73c5595 Compare January 13, 2026 18:26

theotherjimmy force-pushed the s390x-z17 branch 2 times, most recently from a32e45e to 898cc0e Compare January 23, 2026 16:10

uweigand reviewed Jan 28, 2026

View reviewed changes

cranelift/codegen/src/isa/s390x/lower.isle Outdated Show resolved Hide resolved

uweigand and others added 3 commits January 29, 2026 11:15

s390x: Emit vector blend on z17

7379b7e

Rename x86_blendv to blendv

8ae8e8f

Now that s390x implements blendv as well, we should refer to the instruction without the x86 prefix.

theotherjimmy force-pushed the s390x-z17 branch from 898cc0e to 8ae8e8f Compare January 29, 2026 17:15

uweigand approved these changes Feb 3, 2026

View reviewed changes

cfallin approved these changes Feb 3, 2026

View reviewed changes

cfallin added this pull request to the merge queue Feb 3, 2026

Merged via the queue into bytecodealliance:main with commit 7ac4b81 Feb 3, 2026
76 checks passed

Conversation

theotherjimmy commented Jan 12, 2026

Uh oh!

github-actions bot commented Jan 12, 2026

Subscribe to Label Action

Uh oh!

alexcrichton commented Jan 12, 2026

Uh oh!

alexcrichton commented Jan 12, 2026

Uh oh!

uweigand left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

uweigand Jan 13, 2026

Choose a reason for hiding this comment

Uh oh!

theotherjimmy Jan 13, 2026

Choose a reason for hiding this comment

Uh oh!

theotherjimmy Jan 22, 2026

Choose a reason for hiding this comment

Uh oh!

uweigand Jan 13, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

uweigand left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

uweigand left a comment

Choose a reason for hiding this comment

Uh oh!

cfallin left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants