I'm just learned of elephant bird, and it may do what I want.
I have some data that I'm writing in one Hadoop map-reduce job, and it's read in by a subsequent job.
Currently, I'm using JSON, which I treat as Text in Hadoop. Parsing the JSON seems to be a
bottleneck, so I'm looking for alternatives.
Can I define my data structures with protocol buffers, and then use elephant bird to write and read
the data? If so, I believe that would be much faster than writing/reading/parsing JSON.
Also, is the elephant bird library pretty stable at this point? I want to use it for production code.
Thanks for any help!
-- Paul