Hi, we are heavy pig users and right now doing PoC on Scalding.
I need to create/find simple utility which can help me to translate:
1. human deadable input to Parquet/Avro
2. Parquet/Avro result to human readable format
The idea is:
1. create human-readable testing input for scalding job
2. declare desired target format (avro/parquet)
3. feed input to job
4. convert result to human readable format and verify.
We did throw away pig-unit since it's useless and created our own pig-testing utility which automatically converts and human-readable input to target format and feed it to script during test. We also have automatic tool which parses avro/parquet result to JSON and allows write evident output verifications.
Is there sometihing similar is scalding?
What are other approaches to do integration test for a scalding job using readers and writers?