I have some data (output of a rhipe job) I'm trying to instantiate as a ddo. When I run updateAttributes the job fails:
data<-updateAttributes(data)
---------------------------------
There were R errors, showing 30:
1(1):
R ERROR BEGIN (map):
=============
Error: PB ERROR[LOGLEVEL_E
Autokill is true and terminating job_1455286789626_0015
Error in .jcall("RJavaTools", "Ljava/lang/Object;", "invokeMethod", cl, :
java.io.FileNotFoundException: Cannot access /tmp/tmp_output-0498f342445580fbf0bd05ce8f34b2db: No such file or directory.
In addition: Warning message:
In Rhipe:::rhwatch.runner(job = job, mon.sec = mon.sec, readback = readback, :
Job failure, deleting output: /tmp/tmp_output-0498f342445580fbf0bd05ce8f34b2db:
----------------------------------------------------------------------------------------------------------------
I've already changed the permissions of /tmp to 777. I've also tried changing the temp directory using rhoptions("HADOOP.TMP.FOLDER"="/user/tessera/tmp") to a folder I've created using rhmkdir.
----------------------------------------------------------------------------------------------------------------
permission owner group size modtime file
1 drwxrwxrwt hadoop supergroup 0 2016-02-12 14:19 /tmp/hadoop-yarn
I don't know if this helps, but here is my hadoop temp dir setting from core-site.xml
<property><name>hadoop.tmp.dir</name><value>/mnt/var/lib/hadoop/tmp</value></property>
The suggested fix
rhoptions(file.types.remove.regex =(/_meta|/_rh_meta|/_outputs|/_SUCCESS|/_LOG|/_log|rhipe_debug|rhipe_merged_index_db)")
doesn't help. Any ideas? Here is my session Info. I'm using EMR cluster started with ami 3.11.
----------------------------------------------------------------------------------------------------------------
sessionInfo()
R version 3.2.2 (2015-08-14)
Platform: x86_64-redhat-linux-gnu (64-bit)
Running under: Amazon Linux AMI 2015.09
locale:
[1] LC_CTYPE=en_US.UTF-8 LC_NUMERIC=C LC_TIME=en_US.UTF-8 LC_COLLATE=en_US.UTF-8
[5] LC_MONETARY=en_US.UTF-8 LC_MESSAGES=en_US.UTF-8 LC_PAPER=en_US.UTF-8 LC_NAME=en_US.UTF-8
[9] LC_ADDRESS=en_US.UTF-8 LC_TELEPHONE=en_US.UTF-8 LC_MEASUREMENT=en_US.UTF-8 LC_IDENTIFICATION=en_US.UTF-8
attached base packages:
[1] stats graphics grDevices utils datasets methods base
other attached packages:
[1] datadr_0.7.5.9 Rhipe_0.75.2 rJava_0.9-9 codetools_0.2-14
loaded via a namespace (and not attached):
[1] Rcpp_0.12.3 lattice_0.20-33 digest_0.6.9 dplyr_0.4.3 assertthat_0.1 chron_2.3-47 grid_3.2.2
[8] R6_2.1.2 DBI_0.3.1 magrittr_1.5 data.table_1.9.6 hexbin_1.27.1 tools_3.2.2 parallel_3.2.2
Error: PB ERROR[LOGLEVEL_ERROR](google/protobuf/io/coded_stream.cc:171) A protocol message was rejected because it was too big (more than 268435456 bytes). To increase the limit (or to disable these warnings), see CodedInputStream::SetTotalBytesLimit() in google/protobuf/io/coded_stream.h.
Sorry, need to investigate more....
--
You received this message because you are subscribed to the Google Groups "Tessera-Users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tessera-user...@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/tessera-users/179288b6-b810-45ca-83c1-5996454bf100%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.