I uploaded a dataset into both the last stable build as well as the latest nightly build. I get a error during parse into the hex file.
The message is...
Got Exception RuntimeException, with msg java.util.concurrent.ExecutionException: java.lang.NullPointerException: null while mapping key $00000000c00100000000$pricer_examples3.txt
I got this error both with a 4 node test cluster as well as a single node. I got the error with a data set of > 5 million rows as well as a smaller sample of 100k rows.
The only interesting thing about the file is that most of the columns are factors rather than integer or float values.
The nightly build I am using is...
2.5.0.1294
... Build I also see this problem on older builds, including the April 3 build.
The following stack dump appears in the console...
06:48:27.097 # Session INFO HTTPD: GET /Parse.html single_quotes=0 header=0 separator=9 parser_type=CSV destination_key=pricer_examples3.hex source_key=pricer_examples3.txt
java.lang.RuntimeException: java.util.concurrent.ExecutionException: java.lang.NullPointerException: null while mapping key $00000000c00100000000$pricer_examples3.txt
at water.parser.ParseDataset.parse(ParseDataset.java:337)
at water.parser.ParseDataset$ParserFJTask.compute2(ParseDataset.java:367)
at water.H2O$H2OCountedCompleter.compute(H2O.java:712)
at jsr166y.CountedCompleter.exec(CountedCompleter.java:429)
at jsr166y.ForkJoinTask.doExec(ForkJoinTask.java:263)
at jsr166y.ForkJoinPool$WorkQueue.runTask(ForkJoinPool.java:974)
at jsr166y.ForkJoinPool.runWorker(ForkJoinPool.java:1477)
at jsr166y.ForkJoinWorkerThread.run(ForkJoinWorkerThread.java:104)
Caused by: java.util.concurrent.ExecutionException: java.lang.NullPointerException: null while mapping key $00000000c00100000000$pricer_examples3.txt
at jsr166y.ForkJoinTask.get(ForkJoinTask.java:961)
at water.parser.ParseDataset.parse(ParseDataset.java:336)
... 7 more
Caused by: java.lang.NullPointerException: null while mapping key $00000000c00100000000$pricer_examples3.txt
at water.parser.Enum.getTokenId(Enum.java:83)
at water.parser.DParseTask.addStrCol(DParseTask.java:923)
at water.parser.CsvParser.parallelParse(CsvParser.java:128)
at water.parser.DParseTask.map(DParseTask.java:589)
at water.MRTask.lcompute(MRTask.java:66)
at water.DRemoteTask.compute2(DRemoteTask.java:91)
at water.MRTask.lcompute(MRTask.java:62)
at water.DRemoteTask.compute2(DRemoteTask.java:91)
at water.MRTask.lcompute(MRTask.java:62)
at water.DRemoteTask.compute2(DRemoteTask.java:91)
at water.MRTask.lcompute(MRTask.java:62)
at water.DRemoteTask.compute2(DRemoteTask.java:91)
... 6 more