Nope.
OpenRefine was designed to work with data locally. Then you can export the transformed data.
which is really cool for working with and moving data between systems and transforming it. This is called ETL. Extract, Transform, Load.
Nifi can ingress from anywhere if need be via stream or batch onto disk if not enough memory, egress out to wherever, but Nifi is not a distributed computing platform but moves data from/to them while transforming if need be.
The Hadoop Summit 2016 and also Oscon 2015 videos are excellent
Data provenance can be disabled, see their wiki and community docs and of course ask questions on their mailing list.
Good luck Daniel and if you can tell me more about your problem I'd be happy to help further.