Of course it does.
The standard ABI is designed to maximize the speed of main-line (non interrupt) code. To the extent it succeeds in that, any deviation will result in slower code.
If I remember correctly, the Embench project found that RV32E using the simple truncated ABI (6 A, 2 S, 3 T vs 8 A, 12 S, 7 T) is about 1.3x slower than the standard ABI.
I suspect the EABI's proposed 4 A, 5 S, 2 T probably suffers about the same, but maybe a bit less as only having 2 S registers probably causes more spills than reducing A+T from 9 to 6. [1][2]
The purpose of RV32E is to save die space by reducing the register set by 512 bits or 64 bytes. If the 1.3x code size increase is correct and if registers are 2x more expensive than SRAM then you have parity at 32 registers and 384 bytes of code in ITIM vs 16 registers and 512 bytes of code in ITIM. That makes RV32E probably very seldom worth it.
EABI is a bit different. The only purpose I can see for the proposed EABI is for users for whom interrupt latency is everything, and they have either just about no code at all running in the "main program" or else it is low priority background tasks. They are more than happy to give up 1.3x on main program execution speed (and code size) for the trade-off of getting 2 to 2.5 times quicker interrupt response.
If your interrupt routines don't call any other functions then you're better off using the standard ABI and putting __attribute__((interrupt)) on the interrupt routines so they save only and exactly the registers they need.
Presumably EABI users' interrupt handlers are at least somewhat complex, including calling other functions. That means they also will suffer the slowdown from having fewer A+T registers. At some point you'd be better off by using the standard ABI and just saving all those 15 registers on interrupt entry (or on calling a non- __attribute__((interrupt)) function after determining "slow path" handling is needed)
No doubt there is a sweet spot between not calling any other functions and calling too many other functions where the proposed EABI (or something like it) is the best solution. It might be a very small sweet spot.
Regardless of the actual overall performance truth, there is a marketing advantage to being able to tell prospective customers (or their management) that RISC-V has an EABI with similar interrupt latency to the ABI on competitor CPUs. Even if in the end the project engineers don't actually choose to use it.
From this point of view the right number of A registers is clearly four, the same as the competition ESPECIALLY as many people have designed software around four function arguments (no more, no fewer) being supported efficiently. The question then becomes whether two T registers is too many, given that the competitor only has to save one (r12). Having two T registers quite likely (I think) results in overall faster code -- but then so does just not using the EABI in the first place, in most cases.
I think for marketing reasons we should have the RISC-V EABI mimic the competitor ABI as closely as possible, and be available and supported by the tools, even if almost no-one should end up actually using it.
[1] RV32I using the proposed EABI of course has a large number of S registers
[2] Of course this should be tested. I was ready and advocating to do this work in late 2018 but it was deemed to have insufficient priority compared to adding initial V support to Spike (and other later tasks).