My cdap pipline logs :
2021-12-13 09:46:03,222 - DEBUG [provisioning-task-6:i.c.c.i.p.t.ProvisioningTask@125] - Executing PROVISION subtask REQUESTING_CREATE for program run program_run:default.test.-SNAPSHOT.workflow.DataPipelineWorkflow.25c89d8a-5bdc-11ec-90c5-2c768a554ecd. 2021-12-13 09:46:03,226 - DEBUG [provisioning-task-6:i.c.c.i.p.t.ProvisioningTask@129] - Completed PROVISION subtask REQUESTING_CREATE for program run program_run:default.test.-SNAPSHOT.workflow.DataPipelineWorkflow.25c89d8a-5bdc-11ec-90c5-2c768a554ecd. 2021-12-13 09:46:03,348 - DEBUG [provisioning-task-6:i.c.c.i.p.t.ProvisioningTask@116] - Completed PROVISION task for program run program_run:default.test.-SNAPSHOT.workflow.DataPipelineWorkflow.25c89d8a-5bdc-11ec-90c5-2c768a554ecd. 2021-12-13 09:46:06,001 - INFO [program-start-8:i.c.c.i.a.r.d.DistributedProgramRunner@479] - Starting Workflow Program 'DataPipelineWorkflow' with Arguments [logical.start.time=1639376162648,
system.profile.name=USER:hajvahid], with debugging false 2021-12-13 09:46:06,024 - DEBUG [runtime-scheduler-11:i.c.c.i.a.r.d.r.RemoteExecutionTwillPreparer@249] - Create and copy launcher.jar 2021-12-13 09:46:06,025 - DEBUG [runtime-scheduler-11:i.c.c.i.a.r.d.r.RemoteExecutionTwillPreparer@278] - Done launcher.jar 2021-12-13 09:46:06,027 - DEBUG [runtime-scheduler-11:i.c.c.i.a.r.d.r.RemoteExecutionTwillPreparer@232] - Create and copy twill.jar 2021-12-13 09:46:06,027 - DEBUG [runtime-scheduler-11:i.c.c.i.a.r.d.r.RemoteExecutionTwillPreparer@240] - Done twill.jar 2021-12-13 09:46:06,028 - DEBUG [runtime-scheduler-11:i.c.c.i.a.r.d.r.AbstractRuntimeTwillPreparer@521] - Create and copy application.jar 2021-12-13 09:46:06,028 - DEBUG [runtime-scheduler-11:i.c.c.i.a.r.d.r.AbstractRuntimeTwillPreparer@529] - Done application.jar 2021-12-13 09:46:06,028 - DEBUG [runtime-scheduler-11:i.c.c.i.a.r.d.r.AbstractRuntimeTwillPreparer@542] - Create and copy resources.jar 2021-12-13 09:46:06,043 - DEBUG [runtime-scheduler-11:i.c.c.i.a.r.d.r.AbstractRuntimeTwillPreparer@545] - Done resources.jar 2021-12-13 09:46:06,043 - DEBUG [runtime-scheduler-11:i.c.c.i.a.r.d.r.AbstractRuntimeTwillPreparer@585] - Populating Runnable LocalFiles 2021-12-13 09:46:06,043 - DEBUG [runtime-scheduler-11:i.c.c.i.a.r.d.r.AbstractRuntimeTwillPreparer@592] - Added file file:/root/cdap-sandbox-6.4.1/data/tmp/1639376165470-0/cConf.xml 2021-12-13 09:46:06,044 - DEBUG [runtime-scheduler-11:i.c.c.i.a.r.d.r.AbstractRuntimeTwillPreparer@592] - Added file file:/root/cdap-sandbox-6.4.1/conf/logback-container.xml 2021-12-13 09:46:06,044 - DEBUG [runtime-scheduler-11:i.c.c.i.a.r.d.r.AbstractRuntimeTwillPreparer@592] - Added file file:/root/cdap-sandbox-6.4.1/data/tmp/1639376165470-0/1639376165501-0/artifacts.jar 2021-12-13 09:46:06,044 - DEBUG [runtime-scheduler-11:i.c.c.i.a.r.d.r.AbstractRuntimeTwillPreparer@592] - Added file file:/root/cdap-sandbox-6.4.1/data/tmp/1639376165470-0/appSpec1788042231698515657.json 2021-12-13 09:46:06,044 - DEBUG [runtime-scheduler-11:i.c.c.i.a.r.d.r.AbstractRuntimeTwillPreparer@592] - Added file file:/root/cdap-sandbox-6.4.1/data/namespaces/system/artifacts/cdap-data-pipeline/6.4.1.746bf979-4052-4ab4-aeca-eae2e4763138.jar 2021-12-13 09:46:06,044 - DEBUG [runtime-scheduler-11:i.c.c.i.a.r.d.r.AbstractRuntimeTwillPreparer@592] - Added file file:/root/cdap-sandbox-6.4.1/data/tmp/1639376165470-0/hConf.xml 2021-12-13 09:46:06,044 - DEBUG [runtime-scheduler-11:i.c.c.i.a.r.d.r.AbstractRuntimeTwillPreparer@592] - Added file file:/root/cdap-sandbox-6.4.1/data/tmp/1639376165470-0/1639376165501-0/artifacts.jar 2021-12-13 09:46:06,045 - DEBUG [runtime-scheduler-11:i.c.c.i.a.r.d.r.AbstractRuntimeTwillPreparer@592] - Added file file:/root/cdap-sandbox-6.4.1/data/tmp/1639376165470-0/program.options6623809234107657218.json 2021-12-13 09:46:06,045 - DEBUG [runtime-scheduler-11:i.c.c.i.a.r.d.r.AbstractRuntimeTwillPreparer@595] - Done Runnable LocalFiles 2021-12-13 09:46:06,045 - DEBUG [runtime-scheduler-11:i.c.c.i.a.r.d.r.AbstractRuntimeTwillPreparer@613] - Creating /root/cdap-sandbox-6.4.1/data/tmp/25c89d8a-5bdc-11ec-90c5-2c768a554ecd242575053672585158/runtime.config.jar2192106370460675325/twillSpec.json 2021-12-13 09:46:06,047 - DEBUG [runtime-scheduler-11:i.c.c.i.a.r.d.r.AbstractRuntimeTwillPreparer@633] - Done /root/cdap-sandbox-6.4.1/data/tmp/25c89d8a-5bdc-11ec-90c5-2c768a554ecd242575053672585158/runtime.config.jar2192106370460675325/twillSpec.json 2021-12-13 09:46:06,048 - DEBUG [runtime-scheduler-11:i.c.c.i.a.r.d.r.AbstractRuntimeTwillPreparer@644] - Creating /root/cdap-sandbox-6.4.1/data/tmp/25c89d8a-5bdc-11ec-90c5-2c768a554ecd242575053672585158/runtime.config.jar2192106370460675325/logback-template.xml 2021-12-13 09:46:06,058 - DEBUG [runtime-scheduler-11:i.c.c.i.a.r.d.r.AbstractRuntimeTwillPreparer@648] - Done /root/cdap-sandbox-6.4.1/data/tmp/25c89d8a-5bdc-11ec-90c5-2c768a554ecd242575053672585158/runtime.config.jar2192106370460675325/logback-template.xml 2021-12-13 09:46:06,060 - DEBUG [runtime-scheduler-11:i.c.c.i.a.r.d.r.AbstractRuntimeTwillPreparer@550] - Create and copy runtime.config.jar 2021-12-13 09:46:06,065 - DEBUG [runtime-scheduler-11:i.c.c.i.a.r.d.r.AbstractRuntimeTwillPreparer@572] - Done runtime.config.jar 2021-12-13 09:46:06,175 - ERROR [runtime-scheduler-11:i.c.c.i.a.r.d.r.RemoteExecutionTwillRunnerService@527] - Fail to start program run program_run:default.test.-SNAPSHOT.workflow.DataPipelineWorkflow.25c89d8a-5bdc-11ec-90c5-2c768a554ecd java.io.IOException: Failed to SSH to
ro...@172.16.10.2:22 at io.cdap.cdap.common.ssh.DefaultSSHSession.<init>(DefaultSSHSession.java:103) ~[na:na] at io.cdap.cdap.internal.app.runtime.distributed.remote.RemoteExecutionTwillPreparer.launch(RemoteExecutionTwillPreparer.java:117) ~[na:na] at io.cdap.cdap.internal.app.runtime.distributed.remote.AbstractRuntimeTwillPreparer.lambda$start$1(AbstractRuntimeTwillPreparer.java:466) ~[na:na] at io.cdap.cdap.internal.app.runtime.distributed.remote.RemoteExecutionTwillRunnerService$ControllerFactory.lambda$create$0(RemoteExecutionTwillRunnerService.java:503) ~[na:na] at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511) ~[na:1.8.0_181] at java.util.concurrent.FutureTask.run(FutureTask.java:266) ~[na:1.8.0_181] at java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.access$201(ScheduledThreadPoolExecutor.java:180) ~[na:1.8.0_181] at java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadPoolExecutor.java:293) ~[na:1.8.0_181] at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149) ~[na:1.8.0_181] at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624) ~[na:1.8.0_181] at java.lang.Thread.run(Thread.java:748) ~[na:1.8.0_181] Caused by: com.jcraft.jsch.JSchException: Auth fail at com.jcraft.jsch.Session.connect(Session.java:519) ~[com.jcraft.jsch-0.1.54.jar:na] at com.jcraft.jsch.Session.connect(Session.java:183) ~[com.jcraft.jsch-0.1.54.jar:na] at io.cdap.cdap.common.ssh.DefaultSSHSession.<init>(DefaultSSHSession.java:100) ~[na:na] ... 10 common frames omitted 2021-12-13 09:46:06,185 - WARN [runtime-scheduler-11:i.c.c.i.a.r.d.r.RemoteExecutionTwillRunnerService@537] - Force termination of remote process for program_run:default.test.-SNAPSHOT.workflow.DataPipelineWorkflow.25c89d8a-5bdc-11ec-90c5-2c768a554ecd failed java.io.IOException: Failed to SSH to
ro...@172.16.10.2:22 at io.cdap.cdap.common.ssh.DefaultSSHSession.<init>(DefaultSSHSession.java:103) ~[na:na] at io.cdap.cdap.internal.app.runtime.distributed.remote.SSHRemoteProcessController.killProcess(SSHRemoteProcessController.java:107) ~[na:na] at io.cdap.cdap.internal.app.runtime.distributed.remote.SSHRemoteProcessController.kill(SSHRemoteProcessController.java:102) ~[na:na] at io.cdap.cdap.internal.app.runtime.distributed.remote.RemoteExecutionTwillRunnerService$ControllerFactory.lambda$create$2(RemoteExecutionTwillRunnerService.java:535) ~[na:na] at java.util.concurrent.CompletableFuture.uniWhenComplete(CompletableFuture.java:760) ~[na:1.8.0_181] at java.util.concurrent.CompletableFuture$UniWhenComplete.tryFire(CompletableFuture.java:736) ~[na:1.8.0_181] at java.util.concurrent.CompletableFuture.postComplete(CompletableFuture.java:474) ~[na:1.8.0_181] at java.util.concurrent.CompletableFuture.completeExceptionally(CompletableFuture.java:1977) ~[na:1.8.0_181] at io.cdap.cdap.internal.app.runtime.distributed.remote.RemoteExecutionTwillRunnerService$ControllerFactory.lambda$create$0(RemoteExecutionTwillRunnerService.java:505) ~[na:na] at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511) ~[na:1.8.0_181] at java.util.concurrent.FutureTask.run(FutureTask.java:266) ~[na:1.8.0_181] at java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.access$201(ScheduledThreadPoolExecutor.java:180) ~[na:1.8.0_181] at java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadPoolExecutor.java:293) ~[na:1.8.0_181] at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149) ~[na:1.8.0_181] at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624) ~[na:1.8.0_181] at java.lang.Thread.run(Thread.java:748) ~[na:1.8.0_181] Caused by: com.jcraft.jsch.JSchException: socket is not established at com.jcraft.jsch.Util.createSocket(Util.java:394) ~[com.jcraft.jsch-0.1.54.jar:na] at com.jcraft.jsch.Session.connect(Session.java:215) ~[com.jcraft.jsch-0.1.54.jar:na] at com.jcraft.jsch.Session.connect(Session.java:183) ~[com.jcraft.jsch-0.1.54.jar:na] at io.cdap.cdap.common.ssh.DefaultSSHSession.<init>(DefaultSSHSession.java:100) ~[na:na] ... 15 common frames omitted 2021-12-13 09:46:07,564 - WARN [provisioning-task-8:i.c.c.r.s.p.r.RemoteHadoopProvisioner@148] - Unable to clean up resources for program DataPipelineWorkflow run 25c89d8a-5bdc-11ec-90c5-2c768a554ecd on the remote cluster. The run directory may need to be manually deleted on cluster node 172.16.10.2. java.io.IOException: Failed to SSH to
ro...@172.16.10.2:22 at io.cdap.cdap.common.ssh.DefaultSSHSession.<init>(DefaultSSHSession.java:103) ~[na:na] at io.cdap.cdap.internal.provision.DefaultSSHContext.createSSHSession(DefaultSSHContext.java:120) ~[na:na] at io.cdap.cdap.runtime.spi.ssh.SSHContext.createSSHSession(SSHContext.java:92) ~[na:na] at io.cdap.cdap.runtime.spi.ssh.SSHContext.createSSHSession(SSHContext.java:80) ~[na:na] at io.cdap.cdap.runtime.spi.provisioner.remote.RemoteHadoopProvisioner.createSSHSession(RemoteHadoopProvisioner.java:82) ~[na:na] at io.cdap.cdap.runtime.spi.provisioner.remote.RemoteHadoopProvisioner.deleteCluster(RemoteHadoopProvisioner.java:143) ~[na:na] at io.cdap.cdap.runtime.spi.provisioner.Provisioner.deleteClusterWithStatus(Provisioner.java:142) [na:na] at io.cdap.cdap.internal.provision.task.ClusterDeleteSubtask.execute(ClusterDeleteSubtask.java:42) [na:na] at io.cdap.cdap.internal.provision.task.ProvisioningSubtask.execute(ProvisioningSubtask.java:54) [na:na] at io.cdap.cdap.internal.provision.task.ProvisioningTask.lambda$executeOnce$1(ProvisioningTask.java:127) [na:na] at io.cdap.cdap.common.service.Retries.callWithRetries(Retries.java:185) ~[na:na] at io.cdap.cdap.common.service.Retries.callWithInterruptibleRetries(Retries.java:259) ~[na:na] at io.cdap.cdap.internal.provision.task.ProvisioningTask.executeOnce(ProvisioningTask.java:127) [na:na] at io.cdap.cdap.internal.provision.ProvisioningService.lambda$null$21(ProvisioningService.java:659) ~[na:na] at io.cdap.cdap.internal.provision.ProvisioningService.callWithProgramLogging(ProvisioningService.java:837) ~[na:na] at io.cdap.cdap.internal.provision.ProvisioningService.lambda$null$22(ProvisioningService.java:657) ~[na:na] at io.cdap.cdap.common.async.KeyedExecutor$2.run(KeyedExecutor.java:99) ~[na:na] at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511) ~[na:1.8.0_181] at java.util.concurrent.FutureTask.run(FutureTask.java:266) ~[na:1.8.0_181] at java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.access$201(ScheduledThreadPoolExecutor.java:180) ~[na:1.8.0_181] at java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadPoolExecutor.java:293) ~[na:1.8.0_181] at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149) ~[na:1.8.0_181] at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624) ~[na:1.8.0_181] at java.lang.Thread.run(Thread.java:748) ~[na:1.8.0_181] Caused by: com.jcraft.jsch.JSchException: Auth fail at com.jcraft.jsch.Session.connect(Session.java:519) ~[com.jcraft.jsch-0.1.54.jar:na] at com.jcraft.jsch.Session.connect(Session.java:183) ~[com.jcraft.jsch-0.1.54.jar:na] at io.cdap.cdap.common.ssh.DefaultSSHSession.<init>(DefaultSSHSession.java:100) ~[na:na] ... 23 common frames omitted 2021-12-13 09:46:07,702 - DEBUG [provisioning-task-8:i.c.c.i.p.t.ProvisioningTask@116] - Completed DEPROVISION task for program run program_run:default.test.-SNAPSHOT.workflow.DataPipelineWorkflow.25c89d8a-5bdc-11ec-90c5-2c768a554ecd.