--
You received this message because you are subscribed to the Google Groups "Luigi" group.
To unsubscribe from this group and stop receiving emails from it, send an email to luigi-user+...@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
Thanks!
On Wednesday, February 25, 2015 at 1:58:40 AM UTC-8, Ron Reiter wrote:
> I'll let Erik answer this one, but IMHO you can do two things:
>
>
> 1) Run Luigi as a separate Python process which runs a Spark Python job, and then the targets would be files
> 2) Run Luigi on the cluster using the Spark Python interpreter, using the spark context wherever you need it, and create targets or task outputs which are actually RDDs
>
>
> Both should work I guess.
>
>
>
>
>
>
>
>
>
> Thanks,
> Ron
>
>
>
>
>
for path in all_paths:
target = S3FlagTarget(path)
if target.exists():
run_child_job(path)