That's one of the use cases we have to improve.
Its a 2 phase approach however to improve for that use case. Several things have to happen and would have to change in OpenRefine and a new mode called "batch mode" would allow to choose from the current way OpenRefine works to an alternate way or mode with some expected latency.
The changes would be in our project manager as well as storage layer as well as the operations matrix methods to utilize the new storage layer/compute layers. We are leaning toward having an option for a storage layer built around Apache Spark / Hadoop and would trial this out with the community. I've also already reached out to the Apache Flink team to see if they want to give a go at perhaps helping as well, and a member of their team said he'd take a look. (no guarantees)
but more links to gists around that research and others can be added to our TODO's on that page by anyone.
Felix, I hope this gives you the high level ideas we have so far.
What we really need are hackers to hack on some of these ideas and see what's feasible.
Jacky and I are more than willing to help guide folks. Our codebase is not the easiest to navigate for the uninitiated.