Post-Quantum BIP-86 Recovery via zk-STARK Proof of BIP-32 Seed Knowledge

1,012 views
Skip to first unread message

Olaoluwa Osuntokun

unread,
Apr 8, 2026, 12:18:43 AM (10 days ago) Apr 8
to Bitcoin Development Mailing List
Hi y'all,

I found some spare time this last weekend to dust off a little side project
I started last August: extend TinyGo [1] to be able to produce RISC-V ELF
binaries capable of being run as a guest on the risc0 platform to generate
zk-STARK proofs of arbitrary programs. Initially, I didn't really have a
clear end target application, it was mainly a technical challenge to force
me to learn a bit more about the RISC-V platform, and also the host/guest
architecture of risc0. Fast forward ~9 months later, and an initial killer
use case popped into my mind: a zk-STARK proof that a Taproot output public
key was generated using BIP-32, via a given BIP-86 derivation path.

More formally:
```math
\mathcal{R} = \left\lbrace\;
(\overbrace{K,\, C}^{\textsf{public}} ;\; \underbrace{s,\, \mathbf{p}}_{\textsf{witness}})
\;\middle|\;
\begin{aligned}
  K &= \textsf{BIP86Taproot}\bigl(\textsf{BIP32Derive}(s,\, \mathbf{p})\bigr) \\
  C &= \textsf{SHA256}\bigl(\texttt{"bip32-pq-zkp:path:v1"} \;\|\; \mathbf{p}\bigr)
\end{aligned}
\;\right\rbrace
```

where $K$ is the Taproot output key, $C$ is the path commitment, $s$ is the
BIP-32 seed, and $\mathbf{p}$ is the derivation path.


I was able to get everything working e2e over the weekend, after making
some tweaks to my initial architectural game plan!

The TL;DR is that:

  * Given that the Taproot commitment scheme is post-quantum secure [3], in
    the future we can deploy a soft fork to _disable_ the keyspend path,
    and force all Taproot spends to instead flow through the script path
    (not my idea, commonly discussed amongst developers, not sure who
    proposed it first). At that point, Taproot starts to resemble BIP-360.

  * That works for script path spends, but then leaves all the BIP-86
    wallets in a bad position, as they generated outputs that provably
    don't commit to a script path at all.

  * A 2023 paper (Protecting Quantum Procrastinators with Signature
    Lifting: A Case Study in Cryptocurrencies [4]) proposed a solution to this,
    namely _seed lifting_ (use BIP-32 as the one-way function to the
    Picnic PQ Signature scheme) to provide a post-quantum proof of secret
    information a quantum attacker wouldn't be able to easily obtain.

  * The downside of that is that it reveals the secret BIP 32 seed,
    exposing other non migrated UTXOs of a user.

  * With this project I've cobbled together a series of projects to be able
    to generate a zk-STARK proof that a Taproot output public key was
    generated via BIP-32 invocation of a BIP-86 derivation path.

  * In the future a variant of this scheme can be used to enable wallets
    that generated the private keys via BIP-86, to have a post quantum safe
    exit path in case they don't bother moving their coins in time to the
    yet-to-be-decided post quantum signature scheme.

To achieve this end, I needed to create/fork a series of repos:

  * tinygo-zkvm: https://github.com/Roasbeef/tinygo-zkvm
    * A fork of TinyGo that supports the flavor of RISC-V (rv32im) that
      risc0 requires to generate/execute a guest program to later be proved
      by the host.

  * risc0: https://github.com/Roasbeef/risc0
    * Mostly a bug fix to their c-guest example, along with some
      additional documentation on how to get things running. The repo is
      unmodified other than that. Recent updates to the repo made the
      entire process much easier (Go guest+host), more on that later.

  * go-zkvm: https://github.com/Roasbeef/go-zkvm
    * Go utilities to take a RISC-V ELf binary produced by tinygo-zkvm, and
      package it in the expected R0BF format, which combines the user
      generated RISC-V ELF (the thing that is executed to generate the
      proof) along with the v1compat ELF kernel, which is risc0's execution
      environment.

    * This also includes a Go host package, which loads the guest program,
      executes it, and generates a trace to later be proved. This is
      achieved via a C FFI compat layer between Go and the original Rust
      host/proving/verification code.

  * bip-32-pq-zkp: https://github.com/Roasbeef/bip32-pq-zkp
    * The project that packages everything together, this contains the:
      * Guest Go program that defines the secret witness and
        claim/constraints of the proof.

      * The C FFI wrapper around the OG Rust host, which is used to load
        the guest program, execute it, generate a trace, then finally
        generate a proof.

Details of the final proof as generated on my Mac Book (Apple Silicon M4
Max, 128 GB of RAM):
  * Takes ~55 seconds or so to generate+proof, including execution. This
    uses Metal for GPU acceleration on the platform.
  * Uses ~12 GB of ram.
  * Final proof size is ~1.7 MB.
  * Verification takes ~1.8 seconds, and uses ~32 MB of memory.

On several layers, this demo is far from optimized (more on that later),
this is meant to serve as a PoC to demonstrate that with the latest
software+hardware, a proof of this complexity is well within reach.

For those curious re the e2e details I've generated this tutorial that
explains the entire system top to bottom:
https://github.com/Roasbeef/go-zkvm/blob/main/docs/tutorial.md.

If you got to this point in this mail, and don't care about the lower level
details, thanks for reading up until now, and feel free to return back to
the _The Net of a Million Lies_, or as better known in our Universe:
Monitoring the Situation and/or slopfotainment! 🫡

## Motivation + Background

As commonly known, in the case of an adversary that possesses a quantum
computer capable of breaking classical asymmetric cryptography, any coins
stored in UTXOs with a known public key are vulnerable. This is the case
for any P2PK outputs from waaaay back, and also any other outputs that have
revealed their public key. Pubkey reveal might happen due to address re-use
(spending from the same script twice), or Taproot outputs, which publish
the public key plainly in the pkScript.

As detailed in [3], for Taproot outputs, a widely circulated plan is
roughly to: disable the _keyspend_ path (requires a simple signature),
enforcing a new rule that all Taproot spends must then flow through the
script path. Spending via the script path requires an opening of the
Taproot commitment (C = I + H(I || H(M))), which was shown to be binding even
under classic assumptions, as H(M) (tapscript merkle root) is still a
collision-resistant function.

That means any UTXO that _does_ commit to a script path has a future escape
hatch _if_ such a softfork would need to be deployed in the future.
However, what about all the other wallets that use BIP 86, and don't commit
to a script path at all? Under a strict version of this existing
proposal, those wallets would basically be locked forever.

The goal of this work is to demonstrate a practical solution (discussed
against devs, but never implemented AFAICT): generate a zk proof that an
output was generated using BIP-86. For the zk-Proof, we select zk-STARKs,
as they're plausibly post quantum since they rely only on symmetric
cryptography: layers of merkle trees over an execution trace, along with
some novel sampling/error-correction algorithms.

At this point, you may be asking: "if the quantum adversary can derive the
private key to a random taproot public key, then how exactly does this
help?". The answer lies in the structure of BIP-32! BIP-32 takes an initial
128-512-bit seed (with BIP-39, either 12 or 24 words), then runs it through
HMAC-SHA512 keyed by "Bitcoin seed" to produce the master extended private
key. An adversary who wants to forge this proof needs to find a _colliding_
seed: a different seed s' such that HMAC-SHA512("Bitcoin seed", s') produces
the same master key. The BHT algorithm (Brassard-Hoyer-Tapp [6]) is the
best known quantum collision finder, and it runs in time proportional to the
cube root of the output space: 2^(n/3). For HMAC-SHA512's 512-bit output,
that's ~2^171 quantum operations, well above even NIST's highest
post-quantum security category. Therefore, if you generated a wallet using
BIP-32, you possess _another_ secret that a quantum adversary can't
efficiently reconstruct!

This demo focuses on the Taproot case, but the rough approach also applies
to any other output generated via BIP-32. BIP 32 was originally published in
2012, over 14 years ago. So safe to say that _most_ wallets were generated
under this scheme. However, Bitcoin Core only officially adopted BIP-32 in
2016/2018, moving away from their existing key pool structure. I can't say
how much BTC is held today in outputs generated with Bitcoin Core's original
key pool, but if you have coins generated via that mechanism, you may want
to consider migrating them to a BIP-32 wallet.

## TinyGo + RISC-V + risc0

Now for some of the lower level details. risc0 is a STARK based proving
system that takes a RISC-V ELF binary generated by a guest program (any
program generating using their flavor of rv32im can be proved), executes
that in a host environment, generates a trace, then produces a STARK proof
from that.

Today you can take some subset of Rust, compile it to an ELF using their
toolchain, then execute it, generate a trace, to finally prove+verify it
using their system.

This demo took a bit of a round about journey to achieve this, as after
all, the journey is most of the fun, ain't it!

For the past 10 years or so, my Bitcoin stack of choice (lnd/btcsuite) uses
a series of Go libraries, so I wanted to be able to re-use them, first for
this demo, then also in the future for other projects.

TinyGo is a special Go compiler based on LLVM, that targets mostly embedded
environments. You can use it to generate go programs that can run on
micro controllers, or on web assembly (producing a smaller binary than if
you used the normal stdlib path).

TinyGo supports RISC-V, but _not_ the 32-bit variant of RISC-V that risc0
relies on. So the first step here was to create a new target definition for
TinyGo: riscv32-unknown-none, which uses base integer + multiply/divide
instructions with no compressed instructions, which uses 4 KB stacks for
each task. From there, I created a new linker script
(`targets/riscv32im-risc0-zkvm-elf.ld`) which created a memory layer
identical to what risc0 expects. The final component was a new runtime
(`src/runtime/runtime_zkvm.go`), which implemented a few platform specific
syscalls for risc0 (putchar(), exit(), ticks(), and growHeap()).

When I tried to get this working last year, I had to also implement a number
of kernel syscalls (called ecalls in the platform [7]) to handle: read+write
to stdin/stdout, halting, and the journaling mechanism (the transcript of
execution committed to), which basically implement the kernel that the guest
executes in. Fast forward to 2026, and after pulling the latest version of
the repo, I realized that they now make a libzkvm_platform.a, which packages
up the kernel nicely to be linked against. So I threw out my custom kernel
code, and slotted that in instead.

The final component is a C FFI layer that enables me to use _both_ a Go
guest (the program to be proved) and a Go host (the thing that executes the
program and generates the final proof).

## BIP-32+Taproot zk-STARK Proof

With basic proofs working (like the classic: I know the factorization of a
number `n`), I was unblocked to generate the actual proof. The claim/proof
is represented with the following JSON artifact:
```
{
  "schema_version": 1,
  "image_id": "8a6a2c27dd54d8fa0f99a332b57cb105f88472d977c84bfac077cbe70907a690",
  "claim_version": 1,
  "claim_flags": 1,
  "require_bip86": true,
  "taproot_output_key": "00324bf6fa47a8d70cb5519957dd54a02b385c0ead8e4f92f9f07f992b288ee6",
  "path_commitment": "4c7de33d397de2c231e7c2a7f53e5b581ee3c20073ea79ee4afaab56de11f74b",
  "journal_hex": "010000000100000000324bf6fa47a8d70cb5519957dd54a02b385c0ead8e4f92f9f07f992b288ee64c7de33d397de2c231e7c2a7f53e5b581ee3c20073ea79ee4afaab56de11f74b",
  "journal_size_bytes": 72,
  "proof_seal_bytes": 1797880,
  "receipt_encoding": "borsh"
}
````

The `image_id` is basically a hash of the ELF, so you know what the prover
executed. There are then a few flags that control the claim version and
whether BIP-86 derivation is a part of the proof. BIP-86 was only adopted
post-Taproot, so if you have an existing BIP-44 path, you can instead opt to
claim that instead. The Taproot key we're generating the proof against is
also part of the _public data_, as it sits plainly on the chain for all to
see. We then also include a `path_commitment`, which is a commitment to the
exact BIP 86 path that the prover used. Finally, we also commit to the
journal hex, which is basically a commitment to the public claim.

Assuming you've built the project, then you can generate the proof (even
passing in an arbitrary BIP-32 seed and derivation path with)
```
make prove GO_GOROOT=/path/to/go1.24.4
```

Then verify it with:
```
make verify GO_GOROOT=/path/to/go1.24.4
```

The default prove target writes:
  * ./artifacts/bip32-test-vector.receipt
  * ./artifacts/bip32-test-vector.claim.json

The receipt is the STARK proof artifact. claim.json is the stable,
human-readable description of the public statement being proved.

## Application to a Future Keyspend Disabling Soft fork

As mentioned above, assuming the community is forced to deploy a keyspend
disabling soft fork in the future, we can also deploy some variant of
this proof to enable both BIP-86 wallets, and also any BIP-32 wallet, to
sweep their funds into a new PQ output.

In 2026, we've shown that this is achievable using 2 year old consumer
hardware. I don't doubt that the upcoming advancements (eg: photonics, new
flavor of high bandwidth memory, etc) in hardware (driven by the fierce AI
race) will make such a proof even more feasible.

One thing to note is that this proof has a few layers of indirection,
mainly the RISC-V layer that adds overhead which increase the total amount
of steps, and therefore the size of the proof. A production grade
deployment would likely instead hand roll a custom STARK proof for this
exact statement, to achieve a faster and smaller proof).

# Future Work

In terms of future work, there're a number of interesting following up
projects that can be pursued from here.

One basic one is that the current proof doesn't actually commit to a
spending txid and/or sighash. That can be trivially incorporated into the
proof. Going a step further, the execution of the guest program can even
_generate_ a valid schnorr signature to permit spending.

Looking to the memory+computational requirements necessary to generate the
proof, I've left two low hanging fruits:

 1. First, we can speed up the Elliptic Curve operations the proof requires
    (scalar base mult, then addition, or more performantly Double Scalar
    Multiplication via the Strauss-Shamir trick). For this we can use the
    syscalls/precompile in the risc0 env for big integer arithmetic:
    sys_bigint and sys_bigint2. With this, the guest calls into the kernel
    to use an optimized/accelerated circuit for the modular arithmetic,
    reducing cycles, steps, and thus proof size.

 2. Second right now, the entire claim is a single proof. Instead, we can
    first break that up using their recursive proof/composition syscalls:
    sys_verify_integrity+sys_verify_integrity2. We can then assembled a
    series of these proofs into a _single_ statement, which can save block
    space by aggregating N proofs into a single proof.

-- Laolu

[1]: https://tinygo.org/

[2]: https://risczero.com/

[3]: https://eprint.iacr.org/2025/1307

[4]: https://eprint.iacr.org/2023/362

[5]: https://microsoft.github.io/Picnic/

[6]: https://en.wikipedia.org/wiki/BHT_algorithm

[7]: https://github.com/Roasbeef/go-zkvm/blob/main/docs/ecall-reference.md

conduition

unread,
Apr 8, 2026, 12:10:00 PM (9 days ago) Apr 8
to Olaoluwa Osuntokun, Bitcoin Development Mailing List
Hi Laolu,

Great work getting this working in the real world. I've heard many people on delving and the mailing list conjecture based on this idea, but you're the first person i've seen who's willing to put their money where their mouth is, and actually build a prototype. Bravo!

It seems to me the circuit (guest program) could be simplified. Notice how the guest code computes the entire HD wallet key path, including hardened and non-hardened derivation steps, and also computes the taproot output key with key-tweaking. I'd argue these steps are extraneous to the core hard relation you want the STARK to prove, and could be safely removed to reduce proof size and improve performance.

In reality, you needn't go so far as to prove (1) "I know a BIP39 seed which derives this taproot output key". You need only prove this much more general statement (2): "I know a BIP32 xpriv which derives this xpub via one or more hardened steps". The latter statement (2) still cannot be forged by a quantum adversary even if they know your account-level xpub, but it entails far less computation to prove and verify. The rest of the original statement (1) can be done externally outside the circuit.

Example. If i have a wallet with a taproot address at m/86'/0'/0'/1/2​, I could prove I know the xpriv at m/86'/0'​ which derives the xpub at m/86'/0'/0'​. Then I provide the remaining key path elements /1/2​ in the witness. Note, i do not mean we derive the xpriv at m/86'/0'​ inside the guest program. I mean the prover derives m/86'/0'​ first (in the host), and then writes that xpriv into the guest program's inputs. The guest program derives and outputs the xpub at m/86'/0'/0'​. The verifier may check the STARK output (xpub) is correctly computed, then use the given key-path to manually derive the taproot address from the xpub themselves, outside the circuit, and validate that address against the UTXO i'm spending. The verifier thus has confirmed the prover knew an xpriv which (through a hardened derivation step) derives the correct taproot output key.

This change significantly reduces the size of the circuit. From a glance, I see the original guest program performs 6 HMAC-SHA512 calls (1 for the master key, 5 for the BIP32 derivation steps), two SHA256 compression calls (for the taptweak hash), and two point multiplications. With this simplified variant, we are invoking only a single HMAC-SHA512 call and a single point multiplication. I can't say for sure, but I expect this will improve your proof size and runtime significantly.

This change also makes the circuit more generally applicable to other rescue contexts. For instance, it could be applied to BIP340 xonly keys inside a taproot script tree, or in a P2(W)SH address to an ECDSA public key, or to P2(W)PKH addresses.

Concerned about publishing xpubs? Remember that we are assuming regular EC spending is locked in this context, so it is safe-ish to share account xpubs with quantum attackers. At best the xpub can be used for surveillance but not forgery. If one would prefer not to share the account-level xpub on-chain for privacy reasons, the proof could be extended to also derive the unhardened child xpub at /1/2​ inside the guest program (but we still do not need to do the taproot key tweaking in the guest program).

We should also talk scaling efficiency. Given the cost of STARKs, this style of proof should be able to authorize spends for more than one UTXO. Say you have a wallet with 10 different UTXOs held by distinct addresses in the same BIP44 account. One single STARK proof could authorize spending all 10 of them, by simply committing all 10 input signature hashes into the journal, and labeling the inputs with the corresponding 10 BIP32 key paths somehow. The verifier would need to check the proof only once and not 10 times. The 10 UTXO spends could be validated using the common xpub from the STARK proof's journal.

For a slightly related work proving a similar relation for hashed addresses, using different STARK technology stacks, see this delving post.

However, all this said, my personal preference for long-term procrastinator rescue is still for commit/reveal strategies which prove essentially the same statement about BIP32 in a two-step procedure. They get the job done with much lighter cryptographic machinery and much smaller witnesses: a few hundred bytes over two transactions, compared to a few million bytes in one transaction with STARKs. Boris Nagaev and I discussed this on the list a while back. That said, commit/reveal requires more careful design and seems to demand the use of external quantum-safe coins to make the commitment in the first place, so perhaps the cost would be worth it to some people? IDK. What do you think of commit/reveal compared to STARKs for this purpose?

regards,
conduition

--
You received this message because you are subscribed to the Google Groups "Bitcoin Development Mailing List" group.
To unsubscribe from this group and stop receiving emails from it, send an email to bitcoindev+...@googlegroups.com.
To view this discussion visit https://groups.google.com/d/msgid/bitcoindev/CAO3Pvs_PciUi%2BzBrCps3acO14sgeHVUANx9w6TVwUf_AYcd_qQ%40mail.gmail.com.

publickey - conduition@proton.me - 0x474891AD.asc
signature.asc

conduition

unread,
Apr 8, 2026, 4:30:46 PM (9 days ago) Apr 8
to conduition, Olaoluwa Osuntokun, Bitcoin Development Mailing List
Oh, I've been a fool, a foolish fool.

We don't even need to do point multiplication in the circuit at all.

I'm amending my prior suggestion slightly: The circuit (guest program) could take in an xpriv (e.g. at m/86'/0'​) and output a child xpriv (e.g. at m/86'/0'/0'​) to the journal (instead of outputting a child xpub). 

This is safe because remember, EC spending has been disabled in this context, and to a quantum attacker, an xpub is computationally equivalent to its xpriv. So why bother hiding it? The child xpriv doesn't give an observer anything they can't already do with the equivalent xpub. 

The guest program then is basically the BIP32 CKDpriv algorithm, restricted to a single hardened derivation step. The verifier gets the child xpriv, but can't use it to forge new proofs. Honest verifiers use the xpriv to derive the child address(es) as suggested in my last message, to authenticate spending.

Designing the guest program like this will massively reduce your circuit complexity, because EC point multiplication is wayyyyy harder for the RISC0 compiler to arithmetize than a simple hash function. In my prior work with RISC0, I made a guest program which ran a SHA256 hash and an EC point multiplication. I found that pruning EC point arithmetic from my guest program improved prover runtime by a factor of over 100x.

If I am not fever-dreaming and this is indeed possible, then the new circuit's complexity will be dominated not by point multiplication, but by the HMAC-SHA512 call. Our new task is then to figure out how much we can internally optimize the HMAC-SHA512 call for STARK proving. Here's a few ideas.

If you bust open HMAC-SHA512, it looks like this:

HMAC_SHA512 = SHA512((K⊕0x5c) || SHA512((K⊕0x36) || msg))​ 

...where in the context of BIP32 hardened CKD, the HMAC key K​ is the chaincode (padded with zeros to 128 bytes) and msg = (0x00 || sk || i) is the parent secret key and child index. 

Since len(K) = 128​ is the SHA512​ block size, we need a total of 4 SHA512 compression calls: 
  1. to compress (K⊕0x36)​
  2. to compress the msg​ (and SHA512 padding/length)
  3. to compress (K⊕0x5c), and 
  4. a final compression call to tie it all together. 

The output of that last compression call is partitioned into the child chaincode, and a key delta which is added to the parent secret key (modulo the curve order), producing the child EC secret key. This last step is arithmetically simple; the SHA512 calls are where most of the arithmetic complexity lies.

The question then becomes, which of these compression calls can be done outside the circuit, and which are truly essential for security? 

Note how the parent secret key is the most important piece for soundness. The circuit needs to prove the parent secret key existed in the hash function preimage, and is correctly related to the child secret key via modular addition. So compression call (2) seems unavoidable. The others are less rigid.

I'd argue that if we really dig into the hard relation we're trying to prove here, we can reduce it to this statement:

Given a child xpriv with secret key k​, chaincode c​ and index i​, I know a preimage x​ and secret key sk​ such that:

I <- SHA512(<something> || SHA512(<something> || 0x00 || sk || i)​)
c == I[:32]​
k == int(I[32:]) + sk % n​

Seeing as the <something>​ slots are arbitrary, and we know in BIP32 they are always exactly one-block long, it seems easy to throw out the compression calls (1) and (3). The host can precompute the relevant SHA512 midstates outside the circuit, and pass the midstates into the guest program as secret inputs. The tradeoff is that this permits malicious provers the flexibility of choosing their starting midstates (though hash input length can be fixed at 192 bytes). I'm not entirely sure if this meaningfully weakens the verifier's soundness. Ethan Heilman might have opinions on this, he knows a lot more about attacking hash functions than I do. Intuitively, I doubt sampling random SHA512 midstates is that much better than sampling a random HMAC key (chaincode) K​ and computing the resulting midstates.

This reduces our circuit to, i think, the minimum acceptable security floor for provers: two SHA512 compression calls, which commit to a parent secret key.


regards,
conduition
publickey - conduition@proton.me - 0x474891AD.asc
signature.asc

Olaoluwa Osuntokun

unread,
Apr 9, 2026, 4:34:27 PM (8 days ago) Apr 9
to Bitcoin Development Mailing List
Thanks to Luke Childs, I was able to reduce the size of the proof from 1.8 MB,
to 200 KB. The trade off is that it now takes round 180 seconds to prove vs 50
seconds. The space optimization vector here is utilizing a layer of recursion,
so first the main proof is generated, then a proof that a verified would accept
_that_ proof is generated. IIUC, this can be used to compress a proof a high
complexity into an effectively constant sized statement of proof acceptance.

Relevant commits for the curious:
  * https://github.com/Roasbeef/go-zkvm/compare/f691458110b6b551a20ab36d15de98dfb6813d8c...77420055510acfc2f0bee2dcf2313365dee927e8
  * https://github.com/Roasbeef/bip32-pq-zkp/compare/aae0a1a763eed6bfab8660bcfb8f938f91b10c10...4dcb731e4fc97f84e52c85d2768dc41612c71027


I added some sketch notes to the repo re constructing an aggregated proof via
composition of a merkle tree of individual statements (would enable a single
proof in a block for all the statements):
https://github.com/Roasbeef/bip32-pq-zkp/blob/main/docs/claim.md#future-recursive-composition-sketch.

-- Laolu

Olaoluwa Osuntokun

unread,
Apr 9, 2026, 5:02:37 PM (8 days ago) Apr 9
to conduition, Bitcoin Development Mailing List
Hi Conduition,


> You need only prove this much more general statement (2): "I know a BIP32
> xpriv which derives this xpub via one or more hardened steps".

> I'm amending my prior suggestion slightly: The circuit (guest program)
> could take in an xpriv (e.g. at m/86'/0') and output a child xpriv
> (e.g. at m/86'/0'/0') to the journal (instead of outputting a child
> xpub).

That's an excellent insight!

As mentioned in my recent reply, with risc0's "succinct" receipt type, I was
able to get the proof size down to 220 KB, at the cost of 3.5x longer total
proving time.

Your proposal definitely reduces the complexity of the core statement to be
proved, which would speed up the proving time for the normal
default/composite receipt type.

I'll try to hack this up, and then run a head to head comparison to see this
simpler statement actually results in a smaller proof then the final
succinct receipt of either of the proof variants. Based on my current
intuition w.r.t the lower level details, I think the final succinct proof
size would be on the same order of magnitude re size.

However, this can still be a win as then this would provide potential future
users with a less resource intensive proof, which can then be
aggregated/rolled up into a final succinct proof in a batched manner.

This line of optimization is also more interesting if one were to look at
hand rolling a custom AIR to avoid the overhead that the RISC-V emulation
adds to the rirsc0 proof chain, given that it entirely skips doing any EC
operations at all for the final statement.

----

Re the commit/reveal approach, to be honest I'm not fully caught up on that
proposal. That original thread got pretty long, so I dropped of after a
point 😅. I'll revisit that specific branch of the thread so I can digest it
and develop a proper opinion, then get back to you re comparisons!

-- Laolu

Olaoluwa Osuntokun

unread,
Apr 9, 2026, 5:56:07 PM (8 days ago) Apr 9
to conduition, Bitcoin Development Mailing List
Hi Condution, 

So I implemented both variants of your idea. My intuition was right in that it
doesn't do much to reduce the size of the final succinct size, but the final
xpriv variant resulted in a significant reduction in both proving time, and
also memory usage. I also re-ran the original succint proof for the original
Taproot claim and got a better value for the final proof time (def need a
better benchmark env+set up!).

Here's a breakdown of the resource requirements for the various proofs:
  * Full Taproot
    image ID:
      8a6a2c27dd54d8fa0f99a332b57cb105f88472d977c84bfac077cbe70907a690
    composite:
      seal 1797880
      prove 49.32s
      verify 0.10s
      peak RSS 11907399680
    succinct:
      seal 222668
      prove 64.30s
      verify 0.03s
      peak RSS 11927207936

  * Hardened xpub
    image ID:
    ad4ebc0ef6ce51e0f581cc8d14742a5b97738e9decd3fe2b0f1746de5bad9617
    composite:
      seal 513680
      prove 14.63s
      verify 0.04s
      peak RSS 11783503872
    succinct:
      seal 222668
      prove 17.29s
      verify 0.02s
      peak RSS 11782307840

  * Hardened xpriv
    image ID:
      8401a36e4f54cb2beaf9ac7677603806cf9d775e90ef5a70168045a3c0df0849
    composite:
      seal 234568
      prove 1.98s
      verify 0.02s
      peak RSS 3144171520
    succinct:
      seal 222668
      prove 2.84s
      verify 0.02s
      peak RSS 3145990144

So we can see that the succinct proof sizes are all about the same. However the
xpriv variant can be proved directly in just 2 seconds on my machine! It also
requires just 3 GB of memory for the proof as well.

I've created some additional supporting documentation to detail exactly what
the new proofs do and their results:

  * https://github.com/Roasbeef/bip32-pq-zkp/blob/main/docs/reduced-variants.md

  * https://github.com/Roasbeef/bip32-pq-zkp/blob/1c89fdb398180a2b3eff7761b7f4b233d455c6c9/README.md#reduced-proof-variants

  * https://github.com/Roasbeef/bip32-pq-zkp/blob/438c548ca9b49d83ef4019974a5171f5e06fa840/docs/claim.md#reduced-variant-claims


Once again, thanks for the great ideas! I wonder if we can improve on this
round of proof golf further before reaching down a lower level with some sort
of AIR compiler 🤔.

-- Laolu

conduition

unread,
Apr 10, 2026, 1:47:09 PM (7 days ago) Apr 10
to Olaoluwa Osuntokun, Bitcoin Development Mailing List
Ah! Amazing work! 2 seconds to prove is really crazy. Proving a single SHA256 and one modular addition on my CPU back in the day took like 20 seconds. Your GPU is coming in clutch for this. I best RISC0 has also improved quite a bit since then.

I think the next optimization step would be pre-seeding the two SHA512 midstates from the host, so you only need to prove two SHA512 compression calls instead of four. Intuitively I expect this would at best halve your prover time from 2sec, to probably a little over 1sec, and your verifier time will probably drop as well since that also seems to scale with circuit complexity. 

I think I have two half-decent arguments now as to why this won't affect security: 

First, even if a fraudulent prover is handed the correct midstates to use, the prover would still have to do the hard work of finding the parent secret key needed as a witness. This is at least the same difficulty as finding the parent sk​​ if we just hashed it without a chaincode at all, using two bare SHA512 calls - the only thing that changes is the midstate, and the SHA512 input length suffix. Starting from a different midstate doesn't magically give the attacker a head-start in a 256-bit search space looking for sk​. A frauduent prover would know the child secret key k = sk + int(I[32:]) % n​​, but they don't know int(I[32:]) or sk​ so they cannot solve for either.

Nominally, the fraudulent prover wouldn't even know the correct midstates, so their task is strictly harder.

Secondly, here's another argument as to why finding the midstates in the first place should also be hard. 

Any adversary who could solve this problem by finding the right midstates could be used as an oracle to prove the existence of partial 2-cycles in SHA512. 

  • Given a SHA512 hash I​​, set sk = int(I[32:])​​​
  • Compute k = sk + sk % n​
  • Use the black-box fraudulent prover on the child key k​​ to find correct midstates such that 

I == SHA512(<something> || SHA512(<something> || 0x00 || sk || i))​​​
k == int(I[32:]) + sk % n​
  == sk + sk % n​​

Remember that sk = int(I[32:])​​. Thus for these conditions to hold, the proof forger must be able to find not just the correct midstates, but also midstates which give a 2-stage partial hash cycle so that:

I == SHA512(<something> || SHA512(<something> || 0x00 || I[32:] || i))​

This seems unlikely or at least very difficult.

regards,
conduition
publickey - conduition@proton.me - 0x474891AD.asc
signature.asc

Alex

unread,
Apr 13, 2026, 3:21:46 PM (4 days ago) Apr 13
to Bitcoin Development Mailing List
I guess my first response got moderated because I was too negative, so here is the retry:

* 1.7 MB is not realistically useful for anything. It's so big that, even the biggest PQC signatures are orders of magnitude smaller and if you do some basic napkin math you can see that it's realistically never going to be useful (it would takes decades for people to migrate and it would cost you more than your entire portfolio to move the funds). Therefore it is realistically unusable.
* The sheer complexity of ZK-STARK proofs and their multi-second verification cost is practically never going to land in Bitcoin - for reference - just look at OP_CAT, a 12 line addition that has been stuck in review for years. Realistically, a multi-million line VM tied to a specific CPU architecture is absolutely never going to land in Bitcoin just from its sheer complexity and attack surface it introduces. Therefore it is nothing more than esoteric make belief, if we are being realistic.

For the layman, it is also a lazy way to offload the PQC problem to the future "ah but we just put a ZK proof and we're done" as if ZK proofs are readily available commodities - they are not. It's not that simple, for the complexity reasons above. We need slim security assumptions, such as the SHRINCS hash based signatures and we need them now, so that we can migrate billions of ETF funds already.

ZK-STARKs are the new "silver bullet" but realistically, Bitcoin is so ossified (as evident by the multi-year delay for OP_CAT) that discussions regarding it is, well, esoteric distractions.

Also it makes no sense to deny SQIsign based on "it's too complex, we don't know if the security assumptions hold" and then with the other hand argue that "let's shove this multi million line CPU VM into Bitcoin and put 1.7 MB proofs that take multiple seconds to verify (that alone is a MAJOR DOS attack surface)

You get my point? It's so easy to say "ZK this, ZK that" but the sheer massive complexity it brings is a gaping flesh wound in terms of attack surface.

sadiq Ismail

unread,
Apr 13, 2026, 3:21:47 PM (4 days ago) Apr 13
to Bitcoin Development Mailing List
Hi Laolu, list,

Nice work.

The scheme extends to BIP-352 (Silent Payments). The BIP-352 receiver reconstructs the output P using their private scan key, public spend key, and public information from the spending transaction A.
See BIP-352 Scanning. BIP-352 recommends but does not mandate BIP-32 for deriving the scan and spend keys, but specifies the following derivation paths when BIP-32 is used:

    b_scan  = BIP32Derive(s, m/352'/coin_type'/account'/1'/0)
    b_spend = BIP32Derive(s, m/352'/coin_type'/account'/0'/0)

For all silent payment addresses generated using BIP-32, your technique applies. The prover produces a zk-STARK proof that the program BIP32Derive(s, p_scan) and BIP32Derive(s, p_spend) were run correctly,
and that the resulting keys reconstruct the on-chain output P using along with A. 

As you highlighted txid is not committed in the proof currently, the argument is replayable. The current POC does not bind to where the coins go. Anyone who observes the chain could copy it and attach it to a different transaction, spending the same UTXO to a different address. Worse for silent payment, because all user UTXO have the same secret BIP32Derive(s, p_scan) and BIP32Derive(s, p_spend) except for A.
The zk-STARK proof? or this mechanism should definitely be bound to the spending transaction and the input being spent. 

Curious why not generalise beyond BIP-32? P2PKH and P2WPKH without BIP-32 still commit to an unrevealed secret — HASH160(k·G) — as long as the pubkey has never previously appeared on-chain. A zk-STARK proof should apply here too? The prover argues for the correctness of HASH160(k·G) = h, where k is the private key scalar and k·G is the elliptic-curve point, without ever revealing k or the pubkey. This would allow recovery of a broader set of funds. If it were decided that classical signatures for these output types are invalidated and only a valid zk-STARK proof is required to spend, anyone who holds the original secret can unlock their funds. 

P.S. I am not for or against disabling valid spend paths post-quantum, just discussing the technical possibilities.

Best,
Abubakar Sadiq

conduition

unread,
Apr 14, 2026, 11:54:51 AM (3 days ago) Apr 14
to sadiq Ismail, Bitcoin Development Mailing List
Hi Abubakar,

Awesome point! A single STARK could be used to prove you knew an xpriv which derives the xpriv at m/352'/coin'/account'​. Then any verifier could easily derive the private scan/spend keys m/352'/coin'/account'/0'/0​ or m/352'/coin'/account'/1'/0​, then replicate the ECDH process (with some BIP352-specific tweaks that the spender can provide), and add the result together. This reproduces the private key of the SP taproot output being spent, proving the spender is authentic.

With proper labeling of input metadata, this procedure could cover every UTXO in a silent payment wallet, because the scan/spend keys are common to every derived output-specific address.

Curious why not generalise beyond BIP-32? P2PKH and P2WPKH without BIP-32 still commit to an unrevealed secret — HASH160(k·G) — as long as the pubkey has never previously appeared on-chain. A zk-STARK proof should apply here too? The prover argues for the correctness of HASH160(k·G) = h, where k is the private key scalar and k·G is the elliptic-curve point, without ever revealing k or the pubkey. This would allow recovery of a broader set of funds.

I agree this type of thing would also be useful, but it is harder to argue security for this than for BIP32 xpriv ownership.

Typically, xpubs and xprivs are not shared on-chain, and wallets usually treat them with greater care since xpubs compromise privacy, and xprivs compromise security.

Hashed address pubkeys on the other hand are commonly shared on-chain and almost never treated as sensitive. So it seems likely that if we DO permit spending using a ZK proof of knowledge of k​ such that HASH160(k*G) = h​, then some non-negligible fraction of those coins will still be vulnerable to a CRQC because k*G​ may be known to the attacker, which to a CRQC is equivalent to knowing k​. 

To mitigate, a verifier would need to reject such proofs when someone tries to spend coins locked to such an "exposed pubkey" address. Since bitcoin nodes do not maintain an address index, they don't intrinsically know which addresses have exposed pubkeys or not. We would have to compose a list of such addresses, and since that list changes over time, it would need to be updated by nodes in real-time when indexing transactions. This list will almost certainly be incomplete, because we don't have any record of pubkeys exposed off-chain (e.g. by hardware wallets, by xpubs shared in multi-party protocols, by lost TXs in orphaned blocks).

This idea also collides very unfortunately with the BIP32 xpriv proof of knowledge. With the faster proof style i suggested to modify Laolu's original one, a spender immediately gives up knowledge of their account-level xpriv to the CRQC when publishing a TX. Even if the spender was using a hashed-address with a hidden pubkey, the CRQC now knows the secret key k​ for that address, and could use it to forge a proof that h = HASH160(k*G)​ to attempt to double-spend/RBF the authentic transaction.

So we can't just add hashed-address proofs alongside BIP32 xpriv proofs. They'd have to be mutually exclusive: A verifier would accept a BIP32 xpriv proof if and only if the address is on the "exposed pubkey" list. The verifier would accept a hashed-address proof if and only if the address is NOT on the "exposed pubkey" list. It's definitely feasible, it's just hard to do both safely.

This is also difficult to motivate, as we lack hard statistics about the relevant portion of the Bitcoin supply that we could use hashed-address proofs (but not BIP32 proofs) to rescue: Those UTXOs which fall into the venn-diagram overlap of "Not using BIP32" and "Not exposed-pubkey". BIP32 was introduced in 2012, whereas P2PKH was introduced in 2009, so clearly there must be some overlap, but how much? I suppose one could estimate by indexing all the P2PKH UTXOs received before BIP32 was published, and counting what portion of them still have (probably) hidden public keys. This would be useful research for anyone with the time and a working node.

regards,
conduition
publickey - conduition@proton.me - 0x474891AD.asc
signature.asc

Boris Nagaev

unread,
Apr 15, 2026, 5:35:13 PM (2 days ago) Apr 15
to Bitcoin Development Mailing List
Hi Laolu, Abubakar, Conduition, list,

Does this idea extend to MuSig2 or FROST outputs, assuming the relevant parties cooperate during proving (or use some MPC) and collectively know the underlying seeds / secret material?

For MuSig2, I can imagine a proof that each participant key came from a BIP32 seed/path.
For FROST, I am less sure what the analogous proof statement would be.

At least to me, MuSig2 seems like it may be within reach. I would be very interested if someone has a concrete sketch in mind.

Best,
Boris

conduition

unread,
1:56 PM (1 hour ago) 1:56 PM
to Boris Nagaev, Bitcoin Development Mailing List
Hey Boris, good to hear from you! Great questions.

I think MuSig2 might actually be possible to rescue even without BIP32, since the MuSig2 key aggregation scheme is a one-way function which a CRQC cannot reverse. So you don't even need a BIP32 proof to rescue it, technically, though as we'll see BIP32 is still desirable if we can use it.

A MuSig2 key is kind of like a hashed address. If EC spending is disabled, then you just need to prove knowledge of the secret keys which reproduce the address. 

Example aggregation of a MuSig2 key X​ using two peer pubkeys A and B:

A = a * G
B = b * G
X = A * int(H(A, A || B)) + B * int(H(B, A || B))​

If you can prove knowledge of a​ and b​ in a way that also commits that proof to a spending transaction, then you can spend X​ but a CRQC cannot.

For example:

1. With a ZK-STARK, the MuSig peers could cooperate to prove they know a correct (a, b)​ pair which produce pubkeys (A, B)​ such that:

x = a * int(H(A, A || B)) + b * int(H(B, A || B))​

...where the group secret key x​ is written to the public output of the proof. A verifier could then recompute X = x * G​ outside the circuit to validate a spend. Creating this proof demands we arithmetize two curve point multiplications and two hash function invocations, i.e. one mult and one hash per MuSig2 participant. Another downside is this would require participants to trust each other and share private keys (or use MPC to build the proof).

2. Alternatively, we can sacrifice performance to reduce trust. Prove you know a correct set of pubkeys (A, B)​ such that they recompute X​ using MuSig key aggregation, and then prove you know a BIP340 signature from X​ on a given transaction. This is similar to what Kurbatov did in this recent post exploring STARKs for rescuing hashed addresses without the need to export secret keys from secure signing devices. This approach would let you prove a similar statement without the need to share privkeys between peers, at the cost of one extra point multiplication and an extra hash invocation - These are needed to verify a Schnorr signature in the circuit.

Both proofs are hard for an external QC to forge without knowing A​ and B​, and neither can be forged by non-quantum peers in the MuSig group. Though, if there is a quantum adversary in the MuSig group, the group is screwed in either case. It'll still be very expensive to generate either proof, and circuit complexity will scale linearly with the number of MuSig2 signers. I think commit/reveal would be a better choice in this situation.

But... if the MuSig2 peer pubkeys A​ and B​ were derived using a hardened BIP32 step or some other hash-based algorithm though, then we could reuse Laolu's techniques to create a BIP32 proof which is secure even against Quantum attackers inside the MuSig group. 

Peers could provide one BIP32 xpriv proof showing hardened derivation for each contributory secret key a​ and b​, and then let the verifier recompute A, B and X​ in software. The resulting circuit would be much faster and more efficient, because you only need to arithmetize one HMAC call per peer; no point multiplication needed. A quantum attacker in the MuSig group cannot forge any of their peers' proofs either, since they do not know the preimages their peers used to generate their keys. Though you'd need one separate proof for every signer in the MuSig group, so it'd probably be wise to aggregate the proofs together, especially for larger groups.

------------

As for FROST, I think that will be a harder problem. It depends on how the threshold master key was constructed: Trusted dealer, or distributed keygen (DKG).

With a trusted dealer, then it depends how the dealer generated the master key. If the dealer used BIP32 and still have the seed, then peers could ask the dealer to follow Laolu's techniques. If the dealer discarded the seed or if they didn't use some quantum-hard function on a (remembered) secret input to generate the key, then I think hope is lost.

If the FROST key was generated using a DKG protocol, then unfortunately I don't think there is any quantum-hard asymmetry we can rely on there at all. FROST DKG doesn't use hash functions, it is all algebraic. A valid DKG execution for any secp256k1 point can be easily simulated by a quantum attacker, since they can compute the FROST group secret key and then break it up into valid DKG shares. It would maybe be possible to rescue if ALL the DKG participants generate their shares using a deterministic hash of secret input, because that'd be impossible for a QC to simulate and forge. But IDK of any FROST implementations which do it that way.


regards,
conduition
publickey - conduition@proton.me - 0x474891AD.asc
signature.asc
Reply all
Reply to author
Forward
0 new messages