Just want to announce/update the bench results while waiting for a response that the module I made reached 1.2M HTTP requests on my 16 CPU core laptop @ 100% CPU clock using keep-alive + desync mode (which I came up with later, improving performance by 2-3x + my laptop has a faster CPU, explaining the difference in results from earlier) using a static HTTP page.
Though even with an actual open source HTTP server (Cypache) I'm writing which parses HTTP headers, gets the URL request, resolves the pathname, checks if the URL contains the www dir in full, etc, it should still actually achieve exactly 1M requests +/- since on 50% underclocked CPU performance I get up to exactly 500k requests per second.
Mentioning this mainly for 2 reasons:
1. To make the point that extern functions to call from C++ to Cython are absolutely invaluable imo.
2. I know that Stefan wrote a blog post about a similar (but in my opinion fairly weaker) result by Nexedi in 2018 on his blog
http://blog.behnel.de/posts/fast-python-web-server-with-cython.html and I thought my results were worth mentioning in that regard for exceeding these results by a fair amount and making the HTTP server itself open source (fastepoll, the backend model, is currently proprietary with a copyright to myself though, but can be substituted for with uvloop very easily).
Although Cypache does real processing, reading from index.html or any HTML file, supporting index traversal, calling realpath to resolve paths, etc, it uses heavy caching (with an update timer of up to 1 sec) for basically everything to achieve these high-end results.