Hrmmmm... I don't believe that's how it works. Perhaps I misread what you wrote but If you had 1000 concurrent users, a NodeJS application server would be able to serve those users easily on one thread (Event Loop, asynchronously non-blocking); however, in a traditional Apache Tomcat setup, you'd have to spawn 1 thread per connection (1000 users = 1000 threads)...so you'd end up consuming way more memory and have to bring in load balancers than a NodeJS application server would use.
It seems like a lot of open source high performance web servers are following in the non-blocking event driven I/O. Nginx, lighttpd, and Mongrel2 just to name a few.
I'm not too sure about whether it caches the response like you've said though, I could be wrong about the V8 JS engine but I think the only real micro-optimizations on the backend is V8 tries to figure out patterns in your execution sequences, much like PHP APC.
The main win with using NodeJS as a platform is the fact that you can port existing heavy-lifting code from the front-end to the back-end, such as, calculating map clusters like on Kiwi's Google Map, sorting huge JSON data sets, or distributing computing power globally based on idle connections (
http://maprejuice.com/). Additionally, its inherent non-blocking I/O gives your project a better, more understandable way of developing applications (we're used to the event driven architecture), and its sheer performance.
There's a few really neat applications from the NodeJS Knockout competition (
http://nodeknockout.com/) open for voting, like, an MMO Tetris, MMO Scrabble, Multiplayer typing games, and more.