Error parsing file

44 views
Skip to first unread message

dlo...@gmail.com

unread,
Apr 7, 2014, 6:58:59 PM4/7/14
to h2os...@googlegroups.com
Hi.

I uploaded a dataset into both the last stable build as well as the latest nightly build. I get a error during parse into the hex file.

The message is...

Got Exception RuntimeException, with msg java.util.concurrent.ExecutionException: java.lang.NullPointerException: null while mapping key $00000000c00100000000$pricer_examples3.txt

I got this error both with a 4 node test cluster as well as a single node. I got the error with a data set of > 5 million rows as well as a smaller sample of 100k rows.

The only interesting thing about the file is that most of the columns are factors rather than integer or float values.

The nightly build I am using is...

2.5.0.1294

... Build I also see this problem on older builds, including the April 3 build.

The following stack dump appears in the console...


06:48:27.097 # Session INFO HTTPD: GET /Parse.html single_quotes=0 header=0 separator=9 parser_type=CSV destination_key=pricer_examples3.hex source_key=pricer_examples3.txt
java.lang.RuntimeException: java.util.concurrent.ExecutionException: java.lang.NullPointerException: null while mapping key $00000000c00100000000$pricer_examples3.txt
at water.parser.ParseDataset.parse(ParseDataset.java:337)
at water.parser.ParseDataset$ParserFJTask.compute2(ParseDataset.java:367)
at water.H2O$H2OCountedCompleter.compute(H2O.java:712)
at jsr166y.CountedCompleter.exec(CountedCompleter.java:429)
at jsr166y.ForkJoinTask.doExec(ForkJoinTask.java:263)
at jsr166y.ForkJoinPool$WorkQueue.runTask(ForkJoinPool.java:974)
at jsr166y.ForkJoinPool.runWorker(ForkJoinPool.java:1477)
at jsr166y.ForkJoinWorkerThread.run(ForkJoinWorkerThread.java:104)
Caused by: java.util.concurrent.ExecutionException: java.lang.NullPointerException: null while mapping key $00000000c00100000000$pricer_examples3.txt
at jsr166y.ForkJoinTask.get(ForkJoinTask.java:961)
at water.parser.ParseDataset.parse(ParseDataset.java:336)
... 7 more
Caused by: java.lang.NullPointerException: null while mapping key $00000000c00100000000$pricer_examples3.txt
at water.parser.Enum.getTokenId(Enum.java:83)
at water.parser.DParseTask.addStrCol(DParseTask.java:923)
at water.parser.CsvParser.parallelParse(CsvParser.java:128)
at water.parser.DParseTask.map(DParseTask.java:589)
at water.MRTask.lcompute(MRTask.java:66)
at water.DRemoteTask.compute2(DRemoteTask.java:91)
at water.MRTask.lcompute(MRTask.java:62)
at water.DRemoteTask.compute2(DRemoteTask.java:91)
at water.MRTask.lcompute(MRTask.java:62)
at water.DRemoteTask.compute2(DRemoteTask.java:91)
at water.MRTask.lcompute(MRTask.java:62)
at water.DRemoteTask.compute2(DRemoteTask.java:91)
... 6 more


Sri

unread,
Apr 7, 2014, 11:33:55 PM4/7/14
to dlo...@gmail.com, h2os...@googlegroups.com
We'll take a quick look.

Can you check if it works on our previous stable release - Jacobi?

http://s3.amazonaws.com/h2o-release/h2o/rel-jacobi/2/index.html

Thanks,
Sri
> --
> You received this message because you are subscribed to the Google Groups "H2O Users - h2ostream" group.
> To unsubscribe from this group and stop receiving emails from it, send an email to h2ostream+...@googlegroups.com.
> For more options, visit https://groups.google.com/d/optout.

Tom Kraljevic

unread,
Apr 8, 2014, 12:21:11 AM4/8/14
to Sri, dlo...@gmail.com, h2os...@googlegroups.com

I filed this bug to track:
https://0xdata.atlassian.net/browse/PUB-564

Thanks
Tom

Doug Loyer

unread,
Apr 8, 2014, 9:06:39 AM4/8/14
to Sri, h2os...@googlegroups.com
I was able to reproduce the problem in version 2.2.0.2

I have a small, 35M data set that reproduces the problem.  I can send it to you, but I would rather not send it to the whole group.  

-Doug
Reply all
Reply to author
Forward
0 new messages