--
You received this message because you are subscribed to the Google Groups "Alluxio Users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to alluxio-user...@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
Bin, hello, thaks for helping me. Their state are unpersisted.
Denis
14 Июн 2016 г. 20:31 пользователь "Bin Fan" <fanb...@gmail.com> написал:
Thanks,
Denis
Bin, I checked configuration in properties file. We already set there alluxio.user.file.writetype.default=CACHE_THROUGH. (and this confirmed by http://our-node-with-alluxio:19999/configuration)
-bash-4.1$ ps uax | grep alluxio
yarn 5745 0.0 0.0 103276 900 pts/1 S+ 09:52 0:00 grep alluxio
yarn 42420 0.0 0.0 106108 1212 ? Ss Jun15 0:00 /bin/bash -c ./alluxio-yarn-setup.sh application-master -num_workers 9 -master_address uat-node005 -resource_path hdfs://nameservice1/tmp 1>/var/log/hadoop-yarn/container/application_1465799602059_0210/container_e19_1465799602059_0210_01_000001/stdout 2>/var/log/hadoop-yarn/container/application_1465799602059_0210/container_e19_1465799602059_0210_01_000001/stderr
yarn 42425 0.0 0.0 106108 1264 ? S Jun15 0:00 /bin/bash ./alluxio-yarn-setup.sh application-master -num_workers 9 -master_address uat-node005 -resource_path hdfs://nameservice1/tmp
yarn 42431 0.0 0.0 106112 1340 ? S Jun15 0:00 /bin/bash ./integration/bin/alluxio-application-master.sh -num_workers 9 -master_address uat-node005 -resource_path hdfs://nameservice1/tmp
yarn 42452 1.3 0.1 905024 180604 ? Sl Jun15 13:37 /usr/java/default//bin/java -cp /data/disk7/yarn/nm/usercache/devops/appcache/application_1465799602059_0210/container_e19_1465799602059_0210_01_000001/conf/::/data/disk7/yarn/nm/usercache/devops/appcache/application_1465799602059_0210/container_e19_1465799602059_0210_01_000001/assembly/target/alluxio-assemblies-1.1.0-jar-with-dependencies.jar:/etc/hadoop/conf.cloudera.yarn:/var/run/cloudera-scm-agent/process/4087-yarn-NODEMANAGER:/opt/cloudera/parcels/CDH-5.5.1-1.cdh5.5.1.p0.11/lib/hadoop/*:/opt/cloudera/parcels/CDH-5.5.1-1.cdh5.5.1.p0.11/lib/hadoop/lib/*:/opt/cloudera/parcels/CDH-5.5.1-1.cdh5.5.1.p0.11/lib/hadoop-hdfs/*:/opt/cloudera/parcels/CDH-5.5.1-1.cdh5.5.1.p0.11/lib/hadoop-hdfs/lib/*:/opt/cloudera/parcels/CDH-5.5.1-1.cdh5.5.1.p0.11/lib/hadoop-yarn/*:/opt/cloudera/parcels/CDH-5.5.1-1.cdh5.5.1.p0.11/lib/hadoop-yarn/lib/*:/data/disk7/yarn/nm/usercache/devops/appcache/application_1465799602059_0210/container_e19_1465799602059_0210_01_000001/* -Dalluxio.home=/data/disk7/yarn/nm/usercache/devops/appcache/application_1465799602059_0210/container_e19_1465799602059_0210_01_000001 -Dalluxio.logs.dir=/data/disk7/yarn/nm/usercache/devops/appcache/application_1465799602059_0210/container_e19_1465799602059_0210_01_000001/logs -Dalluxio.worker.tieredstore.level0.dirs.path=/tmp/ramdisk -Dalluxio.master.hostname=uat-node005 -Dalluxio.underfs.address=hdfs://nameservice1/ -Dalluxio.worker.memory.size=10GB -Dlog4j.configuration=file:/data/disk7/yarn/nm/usercache/devops/appcache/application_1465799602059_0210/container_e19_1465799602059_0210_01_000001/conf/log4j.properties -Dorg.apache.jasper.compiler.disablejsr199=true -Djava.net.preferIPv4Stack=true -Djava.security.krb5.realm= -Djava.security.krb5.kdc= -Xmx256M alluxio.yarn.ApplicationMaster -num_workers 9 -master_address uat-node005 -resource_path hdfs://nameservice1/tmp
yarn 42705 0.0 0.0 106108 1212 ? Ss Jun15 0:00 /bin/bash -c ./alluxio-yarn-setup.sh alluxio-worker 1>/var/log/hadoop-yarn/container/application_1465799602059_0210/container_e19_1465799602059_0210_01_000003/stdout 2>/var/log/hadoop-yarn/container/application_1465799602059_0210/container_e19_1465799602059_0210_01_000003/stderr
yarn 42709 0.0 0.0 106104 1260 ? S Jun15 0:00 /bin/bash ./alluxio-yarn-setup.sh alluxio-worker
yarn 42737 0.0 0.0 106112 1328 ? S Jun15 0:00 /bin/bash ./integration/bin/alluxio-worker-yarn.sh
yarn 42847 0.1 0.4 33199372 612876 ? Sl Jun15 1:49 /usr/java/default//bin/java -cp /data/disk0/yarn/nm/usercache/devops/appcache/application_1465799602059_0210/container_e19_1465799602059_0210_01_000003/conf/::/data/disk0/yarn/nm/usercache/devops/appcache/application_1465799602059_0210/container_e19_1465799602059_0210_01_000003/assembly/target/alluxio-assemblies-1.1.0-jar-with-dependencies.jar -Dalluxio.home=/data/disk0/yarn/nm/usercache/devops/appcache/application_1465799602059_0210/container_e19_1465799602059_0210_01_000003 -Dalluxio.logs.dir=/data/disk0/yarn/nm/usercache/devops/appcache/application_1465799602059_0210/container_e19_1465799602059_0210_01_000003/logs -Dalluxio.worker.tieredstore.level0.dirs.path=/tmp/ramdisk -Dalluxio.master.hostname=uat-node005 -Dalluxio.underfs.address=hdfs://nameservice1/ -Dalluxio.worker.memory.size=10GB -Dlog4j.configuration=file:/data/disk0/yarn/nm/usercache/devops/appcache/application_1465799602059_0210/container_e19_1465799602059_0210_01_000003/conf/log4j.properties -Dorg.apache.jasper.compiler.disablejsr199=true -Djava.net.preferIPv4Stack=true -Djava.security.krb5.realm= -Djava.security.krb5.kdc= -Dalluxio.logger.type=WORKER_LOGGER -Dalluxio.home=/data/disk0/yarn/nm/usercache/devops/appcache/application_1465799602059_0210/container_e19_1465799602059_0210_01_000003 -Dalluxio.logger.type=WORKER_LOGGER -Dalluxio.logs.dir=/var/log/hadoop-yarn/container/application_1465799602059_0210/container_e19_1465799602059_0210_01_000003 -Dalluxio.master.hostname=uat-node005 alluxio.worker.AlluxioWorker
-bash-4.1$ cd /data/disk0/yarn/nm/usercache/devops/appcache/application_1465799602059_0210/container_e19_1465799602059_0210_01_000003/conf/
-bash-4.1$ pwd
/data/disk0/yarn/nm/usercache/devops/appcache/application_1465799602059_0210/container_e19_1465799602059_0210_01_000003/conf
-bash-4.1$ ls
alluxio-env.sh alluxio-site.properties core-site.xml hdfs-site.xml log4j.properties mapred-site.xml topology.map topology.py workers yarn-site.xml
-bash-4.1$ cat alluxio-site.properties | grep type | grep write
alluxio.user.file.writetype.default=CACHE_THROUGH
Now lets check the node with running master (uat-node005)
-bash-4.1$ ps aux | grep alluxio
yarn 7732 0.0 0.0 106108 1212 ? Ss Jun15 0:00 /bin/bash -c ./alluxio-yarn-setup.sh alluxio-master 1>/var/log/hadoop-yarn/container/application_1465799602059_0210/container_e19_1465799602059_0210_01_000002/stdout 2>/var/log/hadoop-yarn/container/application_1465799602059_0210/container_e19_1465799602059_0210_01_000002/stderr
yarn 7736 0.0 0.0 106108 1264 ? S Jun15 0:00 /bin/bash ./alluxio-yarn-setup.sh alluxio-master
yarn 7748 0.0 0.0 106108 1212 ? Ss Jun15 0:00 /bin/bash -c ./alluxio-yarn-setup.sh alluxio-worker 1>/var/log/hadoop-yarn/container/application_1465799602059_0210/container_e19_1465799602059_0210_01_000011/stdout 2>/var/log/hadoop-yarn/container/application_1465799602059_0210/container_e19_1465799602059_0210_01_000011/stderr
yarn 7752 0.0 0.0 106108 1264 ? S Jun15 0:00 /bin/bash ./alluxio-yarn-setup.sh alluxio-worker
yarn 7756 0.0 0.0 106112 1336 ? S Jun15 0:00 /bin/bash ./integration/bin/alluxio-master-yarn.sh
yarn 7812 0.0 0.0 106112 1336 ? S Jun15 0:00 /bin/bash ./integration/bin/alluxio-worker-yarn.sh
yarn 7912 0.3 0.5 33245016 686232 ? Sl Jun15 3:53 /usr/java/default//bin/java -cp /data/disk7/yarn/nm/usercache/devops/appcache/application_1465799602059_0210/container_e19_1465799602059_0210_01_000002/conf/::/data/disk7/yarn/nm/usercache/devops/appcache/application_1465799602059_0210/container_e19_1465799602059_0210_01_000002/assembly/target/alluxio-assemblies-1.1.0-jar-with-dependencies.jar -Dalluxio.home=/data/disk7/yarn/nm/usercache/devops/appcache/application_1465799602059_0210/container_e19_1465799602059_0210_01_000002 -Dalluxio.logs.dir=/data/disk7/yarn/nm/usercache/devops/appcache/application_1465799602059_0210/container_e19_1465799602059_0210_01_000002/logs -Dalluxio.worker.tieredstore.level0.dirs.path=/tmp/ramdisk -Dalluxio.master.hostname=uat-node005 -Dalluxio.underfs.address=hdfs://nameservice1/ -Dalluxio.worker.memory.size=10GB -Dlog4j.configuration=file:/data/disk7/yarn/nm/usercache/devops/appcache/application_1465799602059_0210/container_e19_1465799602059_0210_01_000002/conf/log4j.properties -Dorg.apache.jasper.compiler.disablejsr199=true -Djava.net.preferIPv4Stack=true -Djava.security.krb5.realm= -Djava.security.krb5.kdc= -Dalluxio.logger.type=MASTER_LOGGER -Dalluxio.home=/data/disk7/yarn/nm/usercache/devops/appcache/application_1465799602059_0210/container_e19_1465799602059_0210_01_000002 -Dalluxio.logger.type=MASTER_LOGGER -Dalluxio.logs.dir=/var/log/hadoop-yarn/container/application_1465799602059_0210/container_e19_1465799602059_0210_01_000002 alluxio.master.AlluxioMaster
yarn 7950 0.1 0.3 33160472 475616 ? Sl Jun15 1:30 /usr/java/default//bin/java -cp /data/disk7/yarn/nm/usercache/devops/appcache/application_1465799602059_0210/container_e19_1465799602059_0210_01_000011/conf/::/data/disk7/yarn/nm/usercache/devops/appcache/application_1465799602059_0210/container_e19_1465799602059_0210_01_000011/assembly/target/alluxio-assemblies-1.1.0-jar-with-dependencies.jar -Dalluxio.home=/data/disk7/yarn/nm/usercache/devops/appcache/application_1465799602059_0210/container_e19_1465799602059_0210_01_000011 -Dalluxio.logs.dir=/data/disk7/yarn/nm/usercache/devops/appcache/application_1465799602059_0210/container_e19_1465799602059_0210_01_000011/logs -Dalluxio.worker.tieredstore.level0.dirs.path=/tmp/ramdisk -Dalluxio.master.hostname=uat-node005 -Dalluxio.underfs.address=hdfs://nameservice1/ -Dalluxio.worker.memory.size=10GB -Dlog4j.configuration=file:/data/disk7/yarn/nm/usercache/devops/appcache/application_1465799602059_0210/container_e19_1465799602059_0210_01_000011/conf/log4j.properties -Dorg.apache.jasper.compiler.disablejsr199=true -Djava.net.preferIPv4Stack=true -Djava.security.krb5.realm= -Djava.security.krb5.kdc= -Dalluxio.logger.type=WORKER_LOGGER -Dalluxio.home=/data/disk7/yarn/nm/usercache/devops/appcache/application_1465799602059_0210/container_e19_1465799602059_0210_01_000011 -Dalluxio.logger.type=WORKER_LOGGER -Dalluxio.logs.dir=/var/log/hadoop-yarn/container/application_1465799602059_0210/container_e19_1465799602059_0210_01_000011 -Dalluxio.master.hostname=uat-node005 alluxio.worker.AlluxioWorker
yarn 18961 0.0 0.0 103276 904 pts/1 S+ 09:58 0:00 grep alluxio
-bash-4.1$ cd /data/disk7/yarn/nm/usercache/devops/appcache/application_1465799602059_0210/container_e19_1465799602059_0210_01_000002/conf/
-bash-4.1$ pwd
/data/disk7/yarn/nm/usercache/devops/appcache/application_1465799602059_0210/container_e19_1465799602059_0210_01_000002/conf
-bash-4.1$ ls
alluxio-env.sh alluxio-site.properties core-site.xml hdfs-site.xml log4j.properties mapred-site.xml topology.map topology.py workers yarn-site.xml
-bash-4.1$ cat alluxio-site.properties | grep write | grep type
alluxio.user.file.writetype.default=CACHE_THROUGH
So looks like a configuration on classpath for both (master and worker) and on both nodes we see alluxio.user.file.writetype.default=CACHE_THROUGH in alluxio-site.properties.
Best regards,
Denis
Hello Bin,Thanks for helping,You are correct.So I think we have a picture now:1. Spark application should have alluxio config files in its class path.We've copied these configs to /opt/spark/lib/ and they worked fine.But could you point us to documentation which describes how to do that? Or provide detailed description of your solution.So with everything works fine (including persisting _SUCCESS files)
2. About our problem with _SUCCESS, I think we did not see it on hdfs because it's generated by driver, not by a executor.