Difficulty of classical supercomputers breaking ”128”-bit asymmetric cryptography

1,302 views

Skip to first unread message

John Mattsson

unread,

Nov 20, 2023, 3:51:54 AM11/20/23

to pqc-...@list.nist.gov

Hi,

Japanese CRYPTEC published some great graphs [1-2] on the difficulty of integer factorization and ECDLP using supercomputers from the TOP500 list. The graphs have 30 years of data on supercomputer performance with trend lines, factorization and ECDLP records, and estimated complexity required to break RSA-3072 and P-256 in one year. Extrapolating (assuming Moore’s law will not slow down in the next 70 years), an estimate is that it takes until around 2090 before it’s possible to recover a P-256 or RSA-3072 key using one year processing time on the world’s fastest classical supercomputer.

A graph with colorful lines

Description automatically generated

I think the right practical question to ask is if it is worth finding a private key, not if a nation state with a Manhattan/Apollo sized project could. The currently fastest supercomputer Frontier is estimated to have cost US$600 million and have yearly energy cost of US$20 million. Assuming a lifetime of 5 years, the cost for one year of operation is US$140 million. Very few, if any, keys are worth that. Multi-key attacks finding one of many keys are typically much less interesting for an attacker. Multi-key attacks that can find all keys like [3] are however very interesting for attackers but are at least as hard as finding a single key.

My understanding is that lattice sieving attacks on ML-KEM-512 have time complexity comparable to Pollard’s Rho on P-256 but much larger memory complexity (which is a more expensive and limiting resource). Given the exponential memory requirements of current lattice attack algorithms, ML-KEM-512 seems more costly to break than P-256 using classical supercomputers. Grover’s algorithm with its quadratic (n²) speedup will not provide any practical quantum advantage as at least cubic (n³) or quartic (n⁴) speedups are required for a practical quantum advantage [4-5].

Assuming an extreme case where the above calculations overestimate the real-world price-performance ratio with five orders of magnitude compared to using proprietary ASIC (as Bernstein et al. estimates for AES-128 [6]) only cuts around two decades from the above numbers.

Given this I see little reason at all to be worried about ML-KEM-512 practical security. ML-KEM-512 seems very appropriate to use in most use cases where performance is important. I am sure we will see smaller optimizations in lattice attack algorithms but overall, they seem to be relatively stable [7].

[1] https://www.cryptrec.go.jp/symposium/2023_cryptrec-eval.pdf

[2] https://events.btq.li/Japan_CRYPTREC_Activities_on_PQC_Shiho_Moriai.pdf

[3] https://weakdh.org/imperfect-forward-secrecy-ccs15.pdf

[4] https://cacm.acm.org/magazines/2023/5/272276-disentangling-hype-from-practicality-on-realistically-achieving-quantum-advantage/fulltext

[5] https://arxiv.org/pdf/2011.04149.pdf

[6] https://cat.cr.yp.to/cryptattacktester-20231020.pdf#subsection.F.11

[7] https://eprint.iacr.org/2021/785

Cheers,

John Preuß Mattsson

D. J. Bernstein wrote (in another thread):

https://groups.google.com/a/list.nist.gov/g/pqc-forum/c/W2VOzy0wz_E/m/0P1em5h0BAAJ

>For translating bit operations into dollars, a reasonable starting point
>is Bitcoin, which did roughly 2^111 bit operations last year. That's the
>equivalent of a few billion dollars of mass-market Bitcoin-mining
>machines running for a year. A large-scale attacker will be able to
>reach similar economies of scale and has a much larger yearly budget.
>
>Recovering an AES-128 key, out of a billion target keys encrypting a
>known plaintext, takes roughly that number of bit operations, so it's
>feasible for a large-scale attacker.
>
>(The attacker obviously won't use commercial CPUs or FPGAs for such a
>large computation. See the five-order-of-magnitude gap analyzed in
>https://cat.cr.yp.to/cryptattacktester-20231020.pdf#subsection.F.11.)
>
>Recovering a Curve25519 key, out of any number of target keys, takes
>about 2^145 bit operations: i.e., billions of times more difficult. This
>_does not_ mean that an attacker with billions of dollars to spend won't
>be able to break Curve25519 for billions of years: computer technology
>is continuing to become more and more cost-effective, and I wouldn't bet
>against this giving another 2^30 within the foreseeable future. But
>quantum computers look like they'll easily win this race.
>
>(One can always find optimists claiming that chip improvement has come
>to an end and that quantum computing will fail. However, risk management
>requires looking at the bad scenarios. This is why we're working on
>post-quantum cryptography _before_ seeing demonstrations of anything
>useful being accomplished by quantum computers.)
>
>For Kyber-512, the latest Kyber documentation had a security analysis
>from 2020 ending up with a very wide uncertainty range for the costs of
>high-probability single-target "primal" attacks: between 2^135 and 2^165
>bit operations, not counting costs of memory access. Optimists then say
>that
>
>* there won't be improved single-target attack algorithms,
>* multi-target attacks won't do better than single-target attacks,
>* attacks against Kyber won't do better than these MLWE/MLWR attacks,
>* the costs of memory access will add a lot to the 135, and
>* 135, as the low end, has negligible chance of occurring anyway.
>
>However:
>
>* There are already undisputed improvements (plus some disputed
>improvements), the easiest being clumping, which by itself moves
>the entire [135,165] range down by nearly 10 bits. The issue here
>isn't simply that the range for known attacks needs to be updated;
>the big issue is that lattice attacks are complicated and haven't
>stabilized. This needs to be factored into risk analyses.
>
>* There's an ACM CCS 2021 paper briefly outlining a proof that
>one-out-of-many attacks for MLWE are as hard as single-target
>attacks, but the proof has a flaw pointed out and exploited in
>https://cr.yp.to/papers.html#lprrr. The main analysis in the latter
>paper is asymptotic; quantifying the concrete impact on Kyber-512
>will take more research. This is also something where better
>attacks should be expected---within limits, since a 2^40-target
>attack can't drop security levels by more than 40 bits, but those
>limits aren't comforting for Kyber-512.
>
>* Even if QROM IND-CCA2 security is assumed to cover all IND-CCA2
>attacks (which isn't necessarily the case, as discussions of a
>recent NSA proposal illustrate), current proofs don't rule out the
>possibility of Kyber's QROM IND-CCA2 security being much easier to
>break than the underlying MLWE/MLWR problems. The Kyber security
>analysis relies on hope that this proof gap doesn't matter, rather
>than on cryptanalysis.
>
>* The only quantified claim regarding the costs of memory access in
>Kyber-512 attacks comes from NIST incorrectly multiplying the cost
>of each memory access by the number of bit operations in an attack.
>A correct analysis instead multiplies the cost of each memory
>access by the number of memory accesses (a number that doesn't
>appear in NIST's analysis), and searches for attacks optimizing the
>total cost. This is something else where more research is required.
>
>* I'm not aware of attempts to quantify probabilities for the
>components of the uncertainty range in the Kyber documentation, so
>there's no way to derive a quantitative conclusion regarding the
>probability of the low end occurring---never mind the question of
>whether a 25% or 10% or 5% chance of disaster is acceptable.
>
>Some of the alarm bells going off here are already recognized in NIST's
>general statements regarding procedures. For example, NIST said that
>breaking a single AES-128 key (this is closer to the Curve25519 example
>than to the one-out-of-a-billion-AES-128-keys example) is a "floor" for
>security, said that a replacement for gate counts "must at minimum"
>ensure (among other things) that "all known attacks have been optimized
>with respect to the proposed metric", and said that "submitters of
>algorithms where the complexity of the best known attack has recently
>decreased significantly, or is otherwise poorly understood, should be
>especially conservative" in "category" assignments.

D. J. Bernstein

unread,

Nov 20, 2023, 1:17:56 PM11/20/23

to pqc-...@list.nist.gov

'John Mattsson' via pqc-forum writes:
> I think the right practical question to ask is if it is worth finding
> a private key, not if a nation state with a Manhattan/Apollo sized
> project could. The currently fastest supercomputer Frontier is
> estimated to have cost US$600 million and have yearly energy cost of
> US$20 million. Assuming a lifetime of 5 years, the cost for one year
> of operation is US$140 million. Very few, if any, keys are worth that.
> Multi-key attacks finding one of many keys are typically much less
> interesting for an attacker. Multi-key attacks that can find all keys
> like [3] are however very interesting for attackers but are at least
> as hard as finding a single key.

So it's okay if an attacker can break all keys for just $700 million?
Nothing in the above paragraph contradicts this possibility.

See https://blog.cr.yp.to/20151120-batchattacks.html for analysis of
how the "attacker economist" approach goes wrong. It's instructive to
look back at how RSA Labs was arguing in 2000 that RSA-768 was safe, and
then re-read current arguments for the safety of Kyber-512.

> Grover's algorithm with its quadratic (n2) speedup will not provide
> any practical quantum advantage

It's certainly possible to take particular predictions regarding

* quantum cycle times (compared to attacker patience),
* quantum error correction, and
* overall quantum-computer manufacturing costs

to conclude that depth is limited to roughly 2^40 iterations of a
typical inner loop while each qubit operation (times reversibility
overhead) costs roughly 2^40 bit operations, implying that Grover's
algorithm isn't useful. But this is a fragile conclusion and could
easily be wrong.

NIST has stated conclusions on both sides of this. For example:

* In 2017, NIST wrote "even if we assume the sort of quantum
technology often suggested to be possible in 15 years (e.g. ~1GW
power requirement and a few hours to factor a 2048 bit number),
current technology can still do brute force search cheaper than
Grover's algorithm".

* In 2022, NIST commented on consequences of "the likely scenario
where the limiting attack on AES128 is Grover's algorithm".

There hasn't been any transparency regarding NIST's rationale for this
"likely" statement. I don't see how one can reasonably decide which of
the possibilities is more likely given the available literature.

> My understanding is that lattice sieving attacks on ML-KEM-512 have
> time complexity comparable to Pollard's Rho on P-256

No. The uncertainty interval is much wider than that, allowing much
smaller and much larger possibilities. The latest Kyber documentation
said between 2^135 and 2^165 bit operations; this hasn't been updated
for newer attacks. (P-256 is roughly 2^147, noticeably above NIST's
AES-128 baseline, but of course it's vulnerable to quantum attacks.
Bitcoin did 2^111 bit operations last year.)

> but much larger
> memory complexity (which is a more expensive and limiting resource).

There are lower-memory options. https://eprint.iacr.org/2020/1260
estimated that a quantum version of its enumeration algorithm would be
faster than quantum sieving for the sizes we're talking about here. The
first concrete analysis of the cost of the main quantum subroutine in
this attack is https://eprint.iacr.org/2023/1623.

For most of the competition, NIST had discouraged research into these
options:

* NIST specified a minimum security level that, for cryptanalysts,
looks at least as hard to break with quantum attacks (basically,
have to beat 2^80 qubit operations, modulo depth issues) as with
non-quantum attacks (merely have to beat 2^143 bit operations).

* NIST emphasized a "classical gates" cost metric that ignores
memory-access costs, and set as-far-as-we-know-impossible-to-meet
requirements for any proposed replacement metric.

* NIST criticized the occasional submissions that quantitatively
estimated memory-access costs.

Late last year, NIST said that Kyber-512 is "unlikely" to be easier to
break than AES-128 if one switches from "classical gates" to accounting
for memory-access costs. A cryptanalyst writing a new paper saying NIST
overestimated attack costs for Kyber-512 will be faced with reviewers
asking obvious questions:

* Is the metric used for this paper really the metric NIST used?

* Are the attack costs in this paper really better than what NIST was
claiming?

* Are lower attack costs surprising, given that NIST wasn't denying
lower attack costs but merely saying that beating AES-128 was
"unlikely"?

Even if a speedup is big enough by itself to disprove any reasonable
interpretation of NIST's security claims, being forced to write up
analyses of many different interpretations is annoying and complicated.
Most cryptanalysts won't even bother attacking ill-defined problems
(other targets are much more fun!), and people who do attack those
problems end up spending a lot of time bulletproofing their work in a
way that would have been unnecessary if the problems had been properly
defined in the first place. The end result is slower progress, making it
even more risky to rely on the claimed hardness of these problems.

For comparison, the official call for submissions said "All submitters
are advised to be somewhat conservative in assigning parameters to a
given category, but submitters of algorithms where the complexity of the

best known attack has recently decreased significantly, or is otherwise

poorly understood, should be especially conservative."

> I am sure we will see smaller optimizations in lattice attack
> algorithms but overall, they seem to be relatively stable [7].

Smaller than what? Relative to what? If there's a claim that attacks
aren't improving enough to threaten the Kyber-512 security margin, then
where's the quantification of the security margin and the improvements?

[7] is a 2021 paper claiming asymptotic lower bounds for a fragment of
one attack strategy. The paper says nothing about other layers of the
attack, says nothing about other attacks (e.g., tuple lattice sieving),
and says nothing about concrete sizes such as Kyber-512.

---D. J. Bernstein

signature.asc

John Mattsson

unread,

Nov 21, 2023, 12:08:01 PM11/21/23

to D. J. Bernstein, pqc-...@list.nist.gov

D. J. Bernstein wrote:

>So it's okay if an attacker can break all keys for just $700 million?

I don’t think that is okay if the algorithm is used globally like FFDH-1024 was. The largest security agencies could consider $700 million worth it, but for ML-KEM-512 we are very far from that. Today it would be completely infeasible to break a single ML-KEM-512 key even if you spent much more money than $700 million and as far as I'm aware there are no lattice attacks that can recover all the keys. I would love to see more precomputation research like [1] for lattice, code, multivariate, hash, and isogenies. When these kinds of attacks are possible you clearly need a larger security margin, and maybe it is a good conservative approach to assume that finding all the keys is as easy as finding a single key unless there are strong reasons to believe otherwise. FFDH-1024 was clearly used too long. But the main problem was not NIST/BSI/ANSSI requirements, but that people did not follow them. NIST only allowed 80-bit crypto until 2010 minus the lifetime the data needed protection. NIST requirement to only allow 112-bit crypto until 2030 minus the lifetime the data needed protection is much more conservative. But given that many people seem to ignore the lifetime part in NIST’s requirements, maybe simpler requirements like BSI’s don’t use 112-bit crypto after 2022 are better. That should give 40 years of secrecy. But simple requirements create problems as well. I think RSA-2048 is fine to use for web server authentication beyond 2022.

>It's instructive to

>look back at how RSA Labs was arguing in 2000 that RSA-768 was safe, and

>then re-read current arguments for the safety of Kyber-512.

In addition to promoting too short keys lengths, RSA have done other bad things like spreading doubt about ECC for business purposes [2] as well as implementing Dual_EC_DRBG for profit [3]. RSA-768 was already possible to break on the fastest commercial supercomputers in 2000 [4]. Even with 2^135 operations, assuming Moore's law will continue, and ignoring memory requirements, ML-KEM-512 will not be possible to break until 2080 on commercial supercomputers. I don't see the connection. I think it might be more instructive to look back at the SHA-3 competition and how NIST ended up standardizing the fixed-length SHA-3 functions with meaningless high pre-image security significantly hurting their performance and leading to them not being deployed.

>See https://blog.cr.yp.to/20151120-batchattacks.html for analysis of

>how the "attacker economist" approach goes wrong.

I think most approaches goes wrong from time to time. I think not using any approach is the main problem. You can also argue that the “theoretical cryptographer” approach has gone wrong in the past with slower performance, lower security, increased cost, and increased CO2 emissions as a result. Upgrading existing systems can be very costly and using slower algorithms than needed on large scale lead to significant energy use and CO2 emissions. The unfounded conspiracy theories of backdoors in NIST P-curves led many people to doubt ECC and stay on FFDH and RSA. I also know of developers that disabled P-256 after reading https://safecurves.cr.yp.to/. As they did not have Curve25519 implemented the result was that they only supported older FFDH based key exchange with much lower security than P-256.

Following the same conservative approach as for 112-bit crypto you can use 128-bit crypto until at least 2060. The practically weakest part of 128-bit crypto is as you write multi-key attacks against AES-128 but I don’t think multi-key attacks finding one of many keys are very interesting for practical attackers and best practice today is to use randomness in the nonce to increase the complexity of multi-key attacks. The nonces in AES-128-GCM in TLS 1.3 e.g., use 96 bits of randomness, which is great. I think that should be the recommendation rather than stop using AES-128. But arguing that AES-128 should be phased out because of classical attacks like you do in [4] makes a lot more sense to me than arguing that it should be phased out because of Grover’s algorithm.

Talking about cost there is a large difference between existing systems and new systems. NIST typically does not any difference between the two. In the mobile industry, 6G will e.g., use 256-bit keys but 4G and 5G will likely continue to live for decades with 128-bit keys, which I think is perfectly fine.

>> but much larger
>> memory complexity (which is a more expensive and limiting resource).
>
>There are lower-memory options. https://eprint.iacr.org/2020/1260
>estimated that a quantum version of its enumeration algorithm would be
>faster than quantum sieving for the sizes we're talking about here. The
>first concrete analysis of the cost of the main quantum subroutine in
>this attack is https://eprint.iacr.org/2023/1623.

Thanks for links, I have not read them yet but I assume they might require much larger quantum computers than the ones breaking RSA and ECC. I think it would be very nice with some kind of classical supercomputer graphs like [5] taking memory into account. For many attacks, memory is not a limiting resource, but for classical attacks on lattice systems my understanding is that memory is likely to be a limiting resource.

Cheers,

John Preuß Mattsson

[1] https://weakdh.org/imperfect-forward-secrecy-ccs15.pdf

[2] https://eprint.iacr.org/2008/390.pdf

[3] https://eprint.iacr.org/2015/767.pdf

[4] https://www.cryptrec.go.jp/symposium/2023_cryptrec-eval.pdf

[5] https://blog.cr.yp.to/20151120-batchattacks.html

You received this message because you are subscribed to the Google Groups "pqc-forum" group.

To unsubscribe from this group and stop receiving emails from it, send an email to pqc-forum+...@list.nist.gov.

To view this discussion on the web visit https://groups.google.com/a/list.nist.gov/d/msgid/pqc-forum/20231120181741.880602.qmail%40cr.yp.to.

D. J. Bernstein

unread,

Nov 26, 2023, 10:02:44 AM11/26/23

to pqc-...@list.nist.gov

'John Mattsson' via pqc-forum writes:
> You can also argue that the “theoretical cryptographer” approach has
> gone wrong in the past with slower performance, lower security,
> increased cost, and increased CO2 emissions as a result. Upgrading
> existing systems can be very costly and using slower algorithms than
> needed on large scale lead to significant energy use and CO2
> emissions.

Are there numbers backing up this claim of significant CO2 emissions?

Encrypting data with an AES-256 circuit at (say) 7nm costs roughly 2^-42
joules per byte. See https://cat.cr.yp.to/cryptattacktester-20231020.pdf
Appendix F.11 for AES-128 estimates; AES-256 is 1.4x slower. Is there a
data-communication mechanism that's anywhere near this efficient?

https://doi.org/10.1111/jiec.12630 estimated about 2^-12 joules per byte
for Internet traffic ("0.06 kWh/GB"), times 2^-1 every two years, so
it's reasonable to estimate 2^-15 joules per byte today.

Short-distance communication is cheaper, but how do 2^-42 joules per
byte end up turning into an environmental issue? Even if there are 2^30
devices each continuously encrypting 2^30 bytes per second (is there any
evidence of this?), all of them together use just 2^18 watts for these
AES-256 circuits. World energy consumption is about 2^45 watts.

Sure, low-cost IoT devices aren't using 7nm, but what's the total volume
of data being encrypted on those devices, and what's the total energy
consumption from this encryption?

An NB-IoT device is designed to last for 10 years on a 5Wh battery,
meaning average power usage of 2^-14 watts, so even a _trillion_ such
devices would have total power accounting for only 2^26 watts (times the
overhead factor for battery production), never mind the question of what
fraction of that is being used for encryption.

If the intent was to say "larger" rather than "slower": Total Internet
traffic is roughly 2^47 bytes per second, so roughly 2^32 watts for the
data transmission. Is there some source of number-of-session tallies
implying that a few kilobytes of extra traffic per session is going to
be an environmental issue?

The average web page is over 2MB. Video is over 2/3 of Internet traffic.
A once-a-minute temperature sensor is sending half a million readings to
its control device every year---and _one_ public-key operation suffices
to protect all of those readings, even with forward secrecy, since a
hash ratchet provides forward secrecy. (Regarding the idea that we don't
really need to protect temperature readings, see the literature on
temperature-based side-channel attacks such as ThermalAttackNet.)

Even if a device needs to set up sessions to a million different servers
every year (is there any evidence of this being a common case?), and
even if there are a billion such devices, what exactly is supposed to be
the problem with adding 2 kilobytes of Internet traffic per session?
That's 2^36 bytes per second: roughly a 1/2^12 fraction of Internet
traffic, roughly a 1/2^25 fraction of world energy consumption.

If these _are_ issues for the environment, don't we all have a shared
responsibility to move bulk encryption from AES-256 to ChaCha20 (which
uses half as many bit operations despite the higher security margin), to
broadcast each server's public keys so that they're efficiently reused
for ciphertexts from many clients, and to choose Classic McEliece for
post-quantum encryption since it has by far the smallest ciphertexts?

I do agree that upgrades are costly---not on a damage-the-environment
scale; the issue here is _coordinating_ the upgrades among all the
relevant engineers in a way that doesn't break anything. Surely the
smallest number of cryptographic upgrades comes from systematically
selecting boring crypto with high security margins, rather than
following the same practices that led to wide deployment of, and
subsequent upgrades from, low-security systems such as DES, MD5, SHA-1,
RSA-512, RSA-768, RSA-1024, DSA-512, etc.

> best practice today is to use randomness in the nonce to increase the
> complexity of multi-key attacks

That's exactly what NIST recommended in pqc-forum email dated 27 Jul
2017 20:56:50 +0000. The reason this isn't best practice is that it's
addressing just one small piece of the complicated, mostly unstudied
multi-target attack surface. Best practice is instead to presume that
the attacker will gain a factor T for T targets, and move up to higher
security levels to compensate, as recommended in, e.g., the Classic
McEliece submission.

Example of the danger: Appendix C of https://cr.yp.to/papers.html#lprrr
pointed out a very easy 2^88-guess attack breaking one out of 2^40
ciphertexts sent to a FrodoKEM-640 public key. (The attack is a special
case of the simplest attack in https://cr.yp.to/papers.html#footloose,
but that paper didn't point out the application to FrodoKEM.) NIST's
recommendation did nothing to stop this attack, whereas systematically
moving up to a higher security level would have stopped it.

Further examples of the danger:

* https://eprint.iacr.org/2021/1351 incorrectly claims to be able to
show, assuming hardness of many-sample MLWE, that one out of many
ciphertexts for "lattice-based schemes" are as hard to break as a
single targeted ciphertext. https://cr.yp.to/papers.html#lprrr,
Appendix A.4, points out an apparently unfixable gap in the proof
sketch from https://eprint.iacr.org/2021/1351.

* The main topic of https://cr.yp.to/papers.html#lprrr is analyzing
the impact of multiple ciphertexts per key upon standard analyses
of primal attacks, assuming existing heuristics. Asymptotically,
there's a security loss. This is something else that isn't stopped
by random nonces.

* For cross-key attacks, look at https://eprint.iacr.org/2019/215
saying "Because of the large pre-processing time, our algorithm
does not provide a concrete attack on those schemes". Such claims
have to be revisited when there are many targets: all targets use a
single ring, and the attack's precomputation depends only on the
ring. (The schemes in question are simpler attack targets than
Kyber, but this isn't comforting given how many previously claimed
dividing lines have been broken by this line of work; see Section
1.2 of https://ntruprime.cr.yp.to/latticerisks-20211031.pdf. The
latest work is continuing to focus on the simpler targets,
improving precomputation time and main-computation time.)

The situation is different for ECC: discrete logs, unlike lattices, have
a _tight_ and easily verified worst-case-to-average-case self-reduction
guaranteeing that an attack breaking one of T targets is as hard as an
attack breaking a prespecified target. Also, ECC attacks are---except
for the advent of quantum computers!---much more stable and much more
precisely analyzed than lattice attacks.

---D. J. Bernstein (speaking for myself)