Gerrit Bot (Gerrit)

unread,

Dec 14, 2025, 8:19:53 AM (2 days ago) Dec 14

to goph...@pubsubhelper.golang.org, golang-co...@googlegroups.com

Gerrit Bot has uploaded the change for review

Commit message

runtime: optimize zeroing of registers in secret_amd64.s

Use VPXORQ instead of VMOVAPD because the former, when in the form of a zeroing idiom, is handled directly by the renamer.

Tweak also the KXORQs to operate each on a single register, making it trivial to understand what the intent is, and so that all can potentially execute in parallel.

This PR will be imported into Gerrit with the title and first
comment (this text) used to generate the subject and body of
the Gerrit change.

**Please ensure you adhere to every item in this list.**

More info can be found at https://github.com/golang/go/wiki/CommitMessage

+ The PR title is formatted as follows: `net/http: frob the quux before blarfing`
  + The package name goes before the colon
  + The part after the colon uses the verb tense + phrase that completes the blank in,
    "This change modifies Go to ___________"
  + Lowercase verb after the colon
  + No trailing period
  + Keep the title as short as possible. ideally under 76 characters or shorter
+ No Markdown
+ The first PR comment (this one) is wrapped at 76 characters, unless it's
  really needed (ASCII art, table, or long link)
+ If there is a corresponding issue, add either `Fixes #1234` or `Updates #1234`
  (the latter if this is not a complete fix) to this comment
+ If referring to a repo other than `golang/go` you can use the
  `owner/repo#issue_number` syntax: `Fixes golang/tools#1234`
+ We do not use Signed-off-by lines in Go. Please don't add them.
  Our Gerrit server & GitHub bots enforce CLA compliance instead.
+ Delete these instructions once you have read and applied them

Change-Id: I4b89d28e8c8b2a2d36eeda3b45b5bc27c5014782

GitHub-Last-Rev: 2e02768ed6daa0bd2957e6b4f5371bf6dc056e8c

GitHub-Pull-Request: golang/go#76830

Change diff

diff --git a/src/runtime/secret_amd64.s b/src/runtime/secret_amd64.s
index 06103d1..519fe27 100644
--- a/src/runtime/secret_amd64.s
+++ b/src/runtime/secret_amd64.s
@@ -71,33 +71,40 @@
 	JNE	noavx512
 
 	// Zero X16-X31
-	// Note that VZEROALL above already cleared Z0-Z15.
-	VMOVAPD	Z0, Z16
-	VMOVAPD	Z0, Z17
-	VMOVAPD	Z0, Z18
-	VMOVAPD	Z0, Z19
-	VMOVAPD	Z0, Z20
-	VMOVAPD	Z0, Z21
-	VMOVAPD	Z0, Z22
-	VMOVAPD	Z0, Z23
-	VMOVAPD	Z0, Z24
-	VMOVAPD	Z0, Z25
-	VMOVAPD	Z0, Z26
-	VMOVAPD	Z0, Z27
-	VMOVAPD	Z0, Z28
-	VMOVAPD	Z0, Z29
-	VMOVAPD	Z0, Z30
-	VMOVAPD	Z0, Z31
+	// VPXORQ r, r, r is a zeroing idiom according to section
+	// 3.5.1.7 "Clearing Registers and Dependency Breaking Idioms" in
+	// "Intel® 64 and IA-32 Architectures Optimization Reference Manual: Volume 1"
+	// (April 2024)
+	VPXORQ  Z16, Z16, Z16
+	VPXORQ  Z17, Z17, Z17
+	VPXORQ  Z18, Z18, Z18
+	VPXORQ  Z19, Z19, Z19
+	VPXORQ  Z20, Z20, Z20
+	VPXORQ  Z21, Z21, Z21
+	VPXORQ  Z22, Z22, Z22
+	VPXORQ  Z23, Z23, Z23
+	VPXORQ  Z24, Z24, Z24
+	VPXORQ  Z25, Z25, Z25
+	VPXORQ  Z26, Z26, Z26
+	VPXORQ  Z27, Z27, Z27
+	VPXORQ  Z28, Z28, Z28
+	VPXORQ  Z29, Z29, Z29
+	VPXORQ  Z30, Z30, Z30
+	VPXORQ  Z31, Z31, Z31
 
 	// Zero k0-k7
+	// While these are not categorized as zeroing idioms, having them
+	// operate on a single register per instruction makes it easy to
+	// understand what each instruction does.
+	// Note: for wider compatibility these could equally also be KXORW.
 	KXORQ	K0, K0, K0
-	KXORQ	K0, K0, K1
-	KXORQ	K0, K0, K2
-	KXORQ	K0, K0, K3
-	KXORQ	K0, K0, K4
-	KXORQ	K0, K0, K5
-	KXORQ	K0, K0, K6
-	KXORQ	K0, K0, K7
+	KXORQ	K1, K1, K1
+	KXORQ	K2, K2, K2
+	KXORQ	K3, K3, K3
+	KXORQ	K4, K4, K4
+	KXORQ	K5, K5, K5
+	KXORQ	K6, K6, K6
+	KXORQ	K7, K7, K7
 
 noavx512:
 	// misc registers

Change information

Files:

M src/runtime/secret_amd64.s

Change size: M

Delta: 1 file changed, 31 insertions(+), 24 deletions(-)

Open in Gerrit

Related details

Attention set is empty

Submit Requirements:

Code-Review
No-Unresolved-Comments
Review-Enforcement
TryBots-Pass

Inspect html for hidden footers to help with email filtering. To unsubscribe visit settings.

Gerrit

unsatisfied_requirement

satisfied_requirement

open

diffy

Gopher Robot (Gerrit)

unread,

Dec 14, 2025, 8:19:54 AM (2 days ago) Dec 14

to Gerrit Bot, goph...@pubsubhelper.golang.org, golang-co...@googlegroups.com

Gopher Robot added 1 comment

Patchset-level comments

File-level comment, Patchset 1 (Latest):

Gopher Robot . unresolved

I spotted some possible problems with your PR:

  1. Do you still have the GitHub PR instructions in your commit message text? The PR instructions should be deleted once you have applied them.
  2. Do you have the right bug reference format? For this repo, the format is usually 'Fixes #12345' or 'Updates #12345' at the end of the commit message.

Please address any problems by updating the GitHub PR.

When complete, mark this comment as 'Done' and click the [blue 'Reply' button](https://go.dev/wiki/GerritBot#i-left-a-reply-to-a-comment-in-gerrit-but-no-one-but-me-can-see-it) above. These findings are based on heuristics; if a finding does not apply, briefly reply here saying so.

To update the commit title or commit message body shown here in Gerrit, you must edit the GitHub PR title and PR description (the first comment) in the GitHub web interface using the 'Edit' button or 'Edit' menu entry there. Note: pushing a new commit to the PR will not automatically update the commit message used by Gerrit.

For more details, see:

[how to update commit messages](https://go.dev/wiki/GerritBot/#how-does-gerritbot-determine-the-final-commit-message) for PRs imported into Gerrit.
the Go project's [conventions for commit messages](https://go.dev/doc/contribute#commit_messages) that you should follow.

(In general for Gerrit code reviews, the change author is expected to [log in to Gerrit](https://go-review.googlesource.com/login/) with a Gmail or other Google account and then close out each piece of feedback by marking it as 'Done' if implemented as suggested or otherwise reply to each review comment. See the [Review](https://go.dev/doc/contribute#review) section of the Contributing Guide for details.)

Open in Gerrit

Related details

Attention set is empty

Submit Requirements:

Code-Review

No-Unresolved-Comments

Review-Enforcement
TryBots-Pass

Inspect html for hidden footers to help with email filtering. To unsubscribe visit settings.

Gerrit

unsatisfied_requirement

open

diffy

Jorropo (Gerrit)

unread,

Dec 14, 2025, 8:33:46 AM (2 days ago) Dec 14

to Gerrit Bot, goph...@pubsubhelper.golang.org, Gopher Robot, golang-co...@googlegroups.com

Jorropo added 1 comment

Patchset-level comments

File-level comment, Patchset 1 (Latest):

Jorropo . resolved

It isn't to obvious why this needs to be changed.
The register renaming unit also handle register register moves.

Is there something that would show why this needs to be done ?

Open in Gerrit

Related details

Attention set is empty

Submit Requirements:

Code-Review
No-Unresolved-Comments
Review-Enforcement
TryBots-Pass

Inspect html for hidden footers to help with email filtering. To unsubscribe visit settings.

Gerrit

unsatisfied_requirement

open

diffy

Gerrit Bot (Gerrit)

unread,

Dec 14, 2025, 7:48:10 PM (2 days ago) Dec 14

to goph...@pubsubhelper.golang.org, golang-co...@googlegroups.com

Attention needed from Dmitry Vyukov, Ian Lance Taylor and Martin Möhrmann

Gerrit Bot uploaded new patchset

Gerrit Bot uploaded patch set #2 to this change.

Open in Gerrit

Related details

Attention is currently required from:

Dmitry Vyukov
Ian Lance Taylor
Martin Möhrmann

Submit Requirements:

Code-Review
No-Unresolved-Comments
Review-Enforcement
TryBots-Pass

Inspect html for hidden footers to help with email filtering. To unsubscribe visit settings.

Gerrit

unsatisfied_requirement

open

diffy

Carlo Alberto Ferraris (Gerrit)

unread,

Dec 14, 2025, 7:49:46 PM (2 days ago) Dec 14

to Gerrit Bot, goph...@pubsubhelper.golang.org, Dmitry Vyukov, Ian Lance Taylor, Martin Möhrmann, Jorropo, Gopher Robot, golang-co...@googlegroups.com

Attention needed from Dmitry Vyukov, Ian Lance Taylor and Martin Möhrmann

Carlo Alberto Ferraris added 1 comment

Patchset-level comments

File-level comment, Patchset 1:

Gopher Robot . resolved

I spotted some possible problems with your PR:
  1. Do you still have the GitHub PR instructions in your commit message text? The PR instructions should be deleted once you have applied them.
  2. Do you have the right bug reference format? For this repo, the format is usually 'Fixes #12345' or 'Updates #12345' at the end of the commit message.
Please address any problems by updating the GitHub PR.
When complete, mark this comment as 'Done' and click the [blue 'Reply' button](https://go.dev/wiki/GerritBot#i-left-a-reply-to-a-comment-in-gerrit-but-no-one-but-me-can-see-it) above. These findings are based on heuristics; if a finding does not apply, briefly reply here saying so.
To update the commit title or commit message body shown here in Gerrit, you must edit the GitHub PR title and PR description (the first comment) in the GitHub web interface using the 'Edit' button or 'Edit' menu entry there. Note: pushing a new commit to the PR will not automatically update the commit message used by Gerrit.
For more details, see:
[how to update commit messages](https://go.dev/wiki/GerritBot/#how-does-gerritbot-determine-the-final-commit-message) for PRs imported into Gerrit.
the Go project's [conventions for commit messages](https://go.dev/doc/contribute#commit_messages) that you should follow.
(In general for Gerrit code reviews, the change author is expected to [log in to Gerrit](https://go-review.googlesource.com/login/) with a Gmail or other Google account and then close out each piece of feedback by marking it as 'Done' if implemented as suggested or otherwise reply to each review comment. See the [Review](https://go.dev/doc/contribute#review) section of the Contributing Guide for details.)

Carlo Alberto Ferraris

Done

Open in Gerrit

Related details

Attention is currently required from:

Dmitry Vyukov
Ian Lance Taylor
Martin Möhrmann

Submit Requirements:

Code-Review

No-Unresolved-Comments

Review-Enforcement
TryBots-Pass

Inspect html for hidden footers to help with email filtering. To unsubscribe visit settings.

Gerrit

unsatisfied_requirement

satisfied_requirement

open

diffy

Carlo Alberto Ferraris (Gerrit)

unread,

Dec 14, 2025, 8:06:19 PM (2 days ago) Dec 14

to Gerrit Bot, goph...@pubsubhelper.golang.org, Dmitry Vyukov, Ian Lance Taylor, Martin Möhrmann, Jorropo, Gopher Robot, golang-co...@googlegroups.com

Attention needed from Dmitry Vyukov, Ian Lance Taylor, Jorropo and Martin Möhrmann

Carlo Alberto Ferraris added 1 comment

Patchset-level comments

File-level comment, Patchset 1:

Jorropo . resolved

It isn't to obvious why this needs to be changed.
The register renaming unit also handle register register moves.
Is there something that would show why this needs to be done ?

Carlo Alberto Ferraris

Fair question. Two reasons:

the XOR version does not depend on the semantics of VZEROALL, hence (as it has no preconditions) it's easier to interpret for a reader of the code.
from the referenced document: "Assembly/Compiler Coding Rule 32. (M impact, ML generality) Use dependency-breaking-idiom instructions to set a register to 0, or to break a false dependence chain resulting from re-use of registers. In contexts where the condition codes must be preserved, move 0 into the register instead. This requires more code space than using XOR and SUB, but avoids setting the condition codes.". AFAICT VMOVAPD is not classified as a dependency-breaking-idiom in the referenced doc, and we do not need to preserve the condition codes.

FWIW the second does not apply to KXORQ, but we're not changing that as there is apparently no better alternative.

Open in Gerrit

Related details

Attention is currently required from:

Dmitry Vyukov
Ian Lance Taylor

Jorropo
Martin Möhrmann

Submit Requirements:

Code-Review
No-Unresolved-Comments
Review-Enforcement
TryBots-Pass

Inspect html for hidden footers to help with email filtering. To unsubscribe visit settings.

Gerrit

unsatisfied_requirement

satisfied_requirement

open

diffy

Daniel Morsing (Gerrit)

unread,

Dec 15, 2025, 4:57:47 AM (yesterday) Dec 15

to Gerrit Bot, goph...@pubsubhelper.golang.org, Carlo Alberto Ferraris, Dmitry Vyukov, Ian Lance Taylor, Martin Möhrmann, Jorropo, Gopher Robot, golang-co...@googlegroups.com

Attention needed from Dmitry Vyukov, Ian Lance Taylor, Jorropo and Martin Möhrmann

Daniel Morsing voted and added 1 comment

Votes added by Daniel Morsing

Code-Review

+1

1 comment

Patchset-level comments

File-level comment, Patchset 2 (Latest):

Daniel Morsing . unresolved

Generally supportive of this change, but would be interested in seeing some benchmarks. I'm guessing that this all falls into the background compared to the stack clearing and that the process as a whole stalls out-of-order execution because there are no encoded registers for the CPU to execute ahead with.

Open in Gerrit

Related details

Attention is currently required from:

Dmitry Vyukov
Ian Lance Taylor
Jorropo
Martin Möhrmann

Submit Requirements:

Code-Review

No-Unresolved-Comments

Review-Enforcement
TryBots-Pass

Inspect html for hidden footers to help with email filtering. To unsubscribe visit settings.

Gerrit

unsatisfied_requirement

open

diffy

Jorropo (Gerrit)

unread,

Dec 15, 2025, 6:58:29 AM (yesterday) Dec 15

to Gerrit Bot, goph...@pubsubhelper.golang.org, Daniel Morsing, Carlo Alberto Ferraris, Dmitry Vyukov, Ian Lance Taylor, Martin Möhrmann, Gopher Robot, golang-co...@googlegroups.com

Attention needed from Carlo Alberto Ferraris, Dmitry Vyukov, Ian Lance Taylor and Martin Möhrmann

Jorropo added 1 comment

Patchset-level comments

File-level comment, Patchset 1:

Jorropo . resolved

It isn't to obvious why this needs to be changed.
The register renaming unit also handle register register moves.
Is there something that would show why this needs to be done ?

Carlo Alberto Ferraris

Fair question. Two reasons:
the XOR version does not depend on the semantics of VZEROALL, hence (as it has no preconditions) it's easier to interpret for a reader of the code.
from the referenced document: "Assembly/Compiler Coding Rule 32. (M impact, ML generality) Use dependency-breaking-idiom instructions to set a register to 0, or to break a false dependence chain resulting from re-use of registers. In contexts where the condition codes must be preserved, move 0 into the register instead. This requires more code space than using XOR and SUB, but avoids setting the condition codes.". AFAICT VMOVAPD is not classified as a dependency-breaking-idiom in the referenced doc, and we do not need to preserve the condition codes.
FWIW the second does not apply to KXORQ, but we're not changing that as there is apparently no better alternative.

Jorropo

I hadn't considered the first point.

For the second point, moves are not dependency breaking since that would mean whatever uses the result of a move could execute before the move's dependencies non speculatively.

My point is that a move is just as fast as a dependency breaking instruction if the dependency is always resolved by the time the consumer of the move needs them.
(the register renaming unit does not need to wait on dependencies and it can handle dependent renames in the same cycle)

Which for a zero value either loaded a long time ago (as part of the ABI), or materialized from a dependency-breaking 0 idiom will always be true.

So I guess your code change is a performance noop except maybe pollute the memory caches a bit since you wrote `VPXORQ Z$, Z$, Z$` rather than `VPXORQ Z0, Z0, Z31` (registers `>= 16` needs a prefix since they weren't originally supported in VEX altho I didn't checked if that cost is paid for each parameter or once if any parameter uses a register `>= 16`)
But I doubt anyone can measure this here.

Open in Gerrit

Related details

Attention is currently required from:

Carlo Alberto Ferraris

Dmitry Vyukov
Ian Lance Taylor

Martin Möhrmann

Submit Requirements:

Code-Review
No-Unresolved-Comments
Review-Enforcement
TryBots-Pass

Inspect html for hidden footers to help with email filtering. To unsubscribe visit settings.

Gerrit

unsatisfied_requirement

open

diffy

Keith Randall (Gerrit)

unread,

Dec 15, 2025, 11:53:32 AM (yesterday) Dec 15

to Gerrit Bot, goph...@pubsubhelper.golang.org, Keith Randall, Daniel Morsing, Carlo Alberto Ferraris, Dmitry Vyukov, Ian Lance Taylor, Martin Möhrmann, Jorropo, Gopher Robot, golang-co...@googlegroups.com

Attention needed from Carlo Alberto Ferraris, Dmitry Vyukov, Ian Lance Taylor and Martin Möhrmann

Keith Randall added 1 comment

Patchset-level comments

File-level comment, Patchset 2 (Latest):

Keith Randall . resolved

A couple points here:

Latency of these instructions is irrelevant. No one should be using the results. If they all were really dependent on each other (which I don't think they current are - the renamer would understand this idiom I think), that would almost not matter. In fact, dependent might be better, as we would do a trickle of zeroing in the background leaving other functional units open for other instructions. (Of course if the renamer does the work, maybe irrelevant anyway.)

Performance in general here really doesn't matter. `secret.Do` is expected to be expensive. I would opt for whatever encoding uses the fewest number of bytes.

unsatisfied_requirement

open

diffy

Martin Möhrmann (Gerrit)

unread,

Dec 15, 2025, 12:57:44 PM (yesterday) Dec 15

to Gerrit Bot, goph...@pubsubhelper.golang.org, Keith Randall, Daniel Morsing, Carlo Alberto Ferraris, Dmitry Vyukov, Ian Lance Taylor, Jorropo, Gopher Robot, golang-co...@googlegroups.com

Attention needed from Carlo Alberto Ferraris, Dmitry Vyukov and Ian Lance Taylor

Martin Möhrmann added 1 comment

Patchset-level comments

File-level comment, Patchset 2 (Latest):

Martin Möhrmann . resolved

If we benchmark this lets use some newer amds and intel to judge impact.

On some systems that handle more moves then alu ops and dont have zeroing optimization this may regress and not always be a win IF performance is the ultimate motivator here. Which there are arguments already made that maybe should not.

Some benchmarks across generations are also here: https://chipsandcheese.com/i/138977313/renameallocate

Open in Gerrit

Related details

Attention is currently required from:

Carlo Alberto Ferraris
Dmitry Vyukov
Ian Lance Taylor

Submit Requirements:

Code-Review
No-Unresolved-Comments
Review-Enforcement
TryBots-Pass

Inspect html for hidden footers to help with email filtering. To unsubscribe visit settings.

Gerrit

unsatisfied_requirement

open

diffy

Carlo Alberto Ferraris (Gerrit)

unread,

Dec 15, 2025, 8:21:39 PM (18 hours ago) Dec 15

to Gerrit Bot, goph...@pubsubhelper.golang.org, Keith Randall, Daniel Morsing, Dmitry Vyukov, Ian Lance Taylor, Martin Möhrmann, Jorropo, Gopher Robot, golang-co...@googlegroups.com

Attention needed from Dmitry Vyukov and Ian Lance Taylor

Carlo Alberto Ferraris added 1 comment

Patchset-level comments

File-level comment, Patchset 2 (Latest):

Carlo Alberto Ferraris . resolved

I will replace "optimize" that may give the wrong impression with "canonicalize" that hopefully expresses intent better. The main reasons for this change are as follows:

for a human reader understanding of what each instruction does becomes independent of all other instructions (this is not the case currently, for the VMOVs depend on the semantics of the ZEROALL)
both intel and amd optimization guides describe the VXOR as one of the "zeroing idioms" to be preferred to moving zeros

I am not claiming this will be faster, just that it will be easier to read and more idiomatic. Will make it clear in the CL description.

Open in Gerrit

Related details

Attention is currently required from:

unsatisfied_requirement

open

diffy

Carlo Alberto Ferraris (Gerrit)

unread,

Dec 15, 2025, 8:28:32 PM (18 hours ago) Dec 15

to Gerrit Bot, goph...@pubsubhelper.golang.org, Keith Randall, Daniel Morsing, Dmitry Vyukov, Ian Lance Taylor, Martin Möhrmann, Jorropo, Gopher Robot, golang-co...@googlegroups.com

Attention needed from Dmitry Vyukov and Ian Lance Taylor

Carlo Alberto Ferraris added 1 comment

Patchset-level comments

File-level comment, Patchset 2 (Latest):

Carlo Alberto Ferraris . unresolved

I will replace "optimize" that may give the wrong impression with "canonicalize" that hopefully expresses intent better. The main reasons for this change are as follows:
for a human reader understanding of what each instruction does becomes independent of all other instructions (this is not the case currently, for the VMOVs depend on the semantics of the ZEROALL)
both intel and amd optimization guides describe the VXOR as one of the "zeroing idioms" to be preferred to moving zeros
I am not claiming this will be faster, just that it will be easier to read and more idiomatic. Will make it clear in the CL description.

Carlo Alberto Ferraris

FWIW, somebody more informed than me may want to also consider whether it makes any sense to have VZEROALL *immediately followed* by other instructions that operate on ZMM registers. My vague recollection is that VZEROALL/VZEROUPPER should be mostly executed to avoid penalties when transitioning from VEX to non-VEX code. Immediately executing more VEX code after the VZEROALL, and then not executing a VZEROUPPER/VZEROALL when done, seems counter to that.

unsatisfied_requirement

open

diffy

Reply all

Reply to author

Forward

[go] runtime: optimize zeroing of registers in secret_amd64.s

Gerrit Bot (Gerrit)

Gerrit Bot has uploaded the change for review

Commit message

Change diff

Change information

Related details

Gopher Robot (Gerrit)

Gopher Robot added 1 comment

Related details

Jorropo (Gerrit)

Jorropo added 1 comment

Related details

Gerrit Bot (Gerrit)

Gerrit Bot uploaded new patchset

Related details

Carlo Alberto Ferraris (Gerrit)

Carlo Alberto Ferraris added 1 comment

Related details

Carlo Alberto Ferraris (Gerrit)

Carlo Alberto Ferraris added 1 comment

Related details

Daniel Morsing (Gerrit)

Daniel Morsing voted and added 1 comment

Votes added by Daniel Morsing

1 comment

Related details

Jorropo (Gerrit)

Jorropo added 1 comment

Related details

Keith Randall (Gerrit)

Keith Randall added 1 comment

Martin Möhrmann (Gerrit)

Martin Möhrmann added 1 comment

Related details

Carlo Alberto Ferraris (Gerrit)

Carlo Alberto Ferraris added 1 comment

Related details

Carlo Alberto Ferraris (Gerrit)

Carlo Alberto Ferraris added 1 comment