Hi, I am getting FAILED_TO_UNCOMPRESS(5) error when trying to decompress one of thrift field that we set earlier in the pipeline. We run hadoop job to read these data and it runs into the exception shown in the following callstack:
org.xerial.snappy.SnappyNative.throw_error(SnappyNative.java:98) org.xerial.snappy.SnappyNative.rawUncompress(Native Method) org.xerial.snappy.Snappy.rawUncompress(Snappy.java:474) org.xerial.snappy.Snappy.uncompress(Snappy.java:513) org.xerial.snappy.SnappyInputStream.readFully(SnappyInputStream.java:147) org.xerial.snappy.SnappyInputStream.readHeader(SnappyInputStream.java:99) org.xerial.snappy.SnappyInputStream.<init>(SnappyInputStream.java:59)The code that throws an exception is this line:
SnappyInputStream inputStream = new SnappyInputStream(byteArrayInputStream);The parent thrift has been serialized/deserialized using binary thrift protocol. I can read all the other fields fine. I have tried to debug this issue but because the code that throws an exception is in native C++ code, it seems I am not able to step into those from IntelliJ.
What are the options to debug this, please let me know! I am trying to force to use pure jave but I am having some other trouble at the moment. If any of you have seen issues like in the hadoop pipeline, please let me know.
Thanks!
org.xerial.snappy.SnappyError: [PARSING_ERROR] position: 5
at org.xerial.snappy.pure.SnappyRawDecompressor.uncompressAll(SnappyRawDecompressor.java:155)