From: eeps...@marketshare.com
Sent: March 11, 2016 2:22:50pm PST
To: cascading-user
Subject: AWS S3 with IAM Assume Role Session access - not on EMR or EC2
--
You received this message because you are subscribed to the Google Groups "cascading-user" group.
To unsubscribe from this group and stop receiving emails from it, send an email to cascading-use...@googlegroups.com.
To post to this group, send email to cascadi...@googlegroups.com.
Visit this group at https://groups.google.com/group/cascading-user.
To view this discussion on the web visit https://groups.google.com/d/msgid/cascading-user/16a1824e-560e-413f-8df1-26e5bb93f22d%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
From: eeps...@marketshare.com
Sent: March 11, 2016 2:58:37pm PST
To: cascading-user
Subject: Re: AWS S3 with IAM Assume Role Session access - not on EMR or EC2
We are not running on EMR.
--
You received this message because you are subscribed to the Google Groups "cascading-user" group.
To unsubscribe from this group and stop receiving emails from it, send an email to cascading-use...@googlegroups.com.
To post to this group, send email to cascadi...@googlegroups.com.
Visit this group at https://groups.google.com/group/cascading-user.
To view this discussion on the web visit https://groups.google.com/d/msgid/cascading-user/cd8d5180-e1dd-4d90-930c-fbee86f699ef%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
Hey Ken,
I did not want to hijack the other thread but was wondering if this aside was from your experience or if there are other such tips documented somewhere.
As an aside, normally you don't want to read directly from S3 in a workflow - it often leads to job failures when you've got lots of data.
So in our workflows we first use embedded distcp job (via the DistCp class) to copy files into HDFS.
-- Ken
From: sarma.t...@tubemogul.com
Sent: March 11, 2016 6:56:32pm PST
To: cascadi...@googlegroups.com
Subject: RE: AWS S3 with IAM Assume Role Session access - not on EMR or EC2
Hey Ken,
I did not want to hijack the other thread but was wondering if this aside was from your experience or if there are other such tips documented somewhere.
As an aside, normally you don't want to read directly from S3 in a workflow - it often leads to job failures when you've got lots of data.
So in our workflows we first use embedded distcp job (via the DistCp class) to copy files into HDFS.
-- Ken
From: eepstein@marketshare.com
Sent: March 11, 2016 2:22:50pm PST
To: cascading-user
Subject: AWS S3 with IAM Assume Role Session access - not on EMR or EC2
--
You received this message because you are subscribed to the Google Groups "cascading-user" group.
To unsubscribe from this group and stop receiving emails from it, send an email to cascading-use...@googlegroups.com.
To post to this group, send email to cascadi...@googlegroups.com.
Visit this group at https://groups.google.com/group/cascading-user.
To view this discussion on the web visit https://groups.google.com/d/msgid/cascading-user/76CFA679C7066E87.1-1ee2fbdf-ac0a-49fb-a172-e318c8ea2d63%40mail.outlook.com.
For more options, visit https://groups.google.com/d/optout.