--
You received this message because you are subscribed to the Google Groups "cascading-user" group.
To view this discussion on the web visit https://groups.google.com/d/msg/cascading-user/-/WCZygpwcvhIJ.
To post to this group, send email to cascadi...@googlegroups.com.
To unsubscribe from this group, send email to cascading-use...@googlegroups.com.
For more options, visit this group at http://groups.google.com/group/cascading-user?hl=en.
To post to this group, send email to cascading-user@googlegroups.com.
To unsubscribe from this group, send email to cascading-user+unsubscribe@googlegroups.com.
For more options, visit this group at http://groups.google.com/group/cascading-user?hl=en.
To view this discussion on the web visit https://groups.google.com/d/msg/cascading-user/-/fL9yHv5Z1WQJ.
To post to this group, send email to cascadi...@googlegroups.com.
To unsubscribe from this group, send email to cascading-use...@googlegroups.com.
For more options, visit this group at http://groups.google.com/group/cascading-user?hl=en.
To view this discussion on the web visit https://groups.google.com/d/msg/cascading-user/-/mdBisAgRNocJ.
To post to this group, send email to cascadi...@googlegroups.com.
To unsubscribe from this group, send email to cascading-use...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msg/cascading-user/-/t1OKvoEF_W4J.
To post to this group, send email to cascadi...@googlegroups.com.
To unsubscribe from this group, send email to cascading-use...@googlegroups.com.
For more options, visit this group at http://groups.google.com/group/cascading-user?hl=en.
It helps dealing with skewed data.
Right now, the replication is constant across all keys, but a smarter approach is to only replicate the keys with a lot of values (we have this on our roadmap: do a sampled cogroup + count, compute replication from count, then Join the replication to the non-sampled streams and apply the block join algorithm).
--
Oscar Boykin :: @posco :: https://twitter.com/intent/user?screen_name=posco
> > > > > > > > To post to this group, send email to cascadi...@googlegroups.com (mailto:cascadi...@googlegroups.com).
> > > > > > > > To unsubscribe from this group, send email to cascading-use...@googlegroups.com (mailto:cascading-use...@googlegroups.com).
> > > > > > > > For more options, visit this group at http://groups.google.com/group/cascading-user?hl=en.
> > > > > > >
> > > > > > >
> > > > > > > --------------------------
> > > > > > > Ken Krugler
> > > > > > > http://www.scaleunlimited.com (http://www.scaleunlimited.com/)
> > > > > > > custom big data solutions & training
> > > > > > > Hadoop, Cascading, Mahout & Solr
> > > > > >
> > > > > >
> > > > > > --
> > > > > > You received this message because you are subscribed to the Google Groups "cascading-user" group.
> > > > > > To view this discussion on the web visit https://groups.google.com/d/msg/cascading-user/-/fL9yHv5Z1WQJ.
> > > > > > To post to this group, send email to cascadi...@googlegroups.com (mailto:cascadi...@googlegroups.com).
> > > > > > To unsubscribe from this group, send email to cascading-use...@googlegroups.com (mailto:cascading-use...@googlegroups.com).
> > > > > > For more options, visit this group at http://groups.google.com/group/cascading-user?hl=en.
> > > > >
> > > > >
> > > > > --
> > > > > Chris K Wensel
> > > > > ch...@concurrentinc.com (mailto:ch...@concurrentinc.com)
> > > > > http://concurrentinc.com (http://concurrentinc.com/)
> > > >
> > >
> > >
> > > --
> > > You received this message because you are subscribed to the Google Groups "cascading-user" group.
> > > To view this discussion on the web visit https://groups.google.com/d/msg/cascading-user/-/t1OKvoEF_W4J.
> > > To post to this group, send email to cascadi...@googlegroups.com (mailto:cascadi...@googlegroups.com).
> > > To unsubscribe from this group, send email to cascading-use...@googlegroups.com (mailto:cascading-use...@googlegroups.com).
> > > For more options, visit this group at http://groups.google.com/group/cascading-user?hl=en.
> >
> >
> > --
> > Chris K Wensel
> > ch...@concurrentinc.com (mailto:ch...@concurrentinc.com)
> > http://concurrentinc.com (http://concurrentinc.com/)
> >
> >
> >
> >
> >
> >
> >
> >
> >
> >
> >
> >
> >
> >
> >
> >
> >
> > --
> > You received this message because you are subscribed to the Google Groups "cascading-user" group.
> > To post to this group, send email to cascadi...@googlegroups.com (mailto:cascadi...@googlegroups.com).
> > To unsubscribe from this group, send email to cascading-use...@googlegroups.com (mailto:cascading-use...@googlegroups.com).
> > For more options, visit this group at http://groups.google.com/group/cascading-user?hl=en.
>
>
> --
> Chris K Wensel
> ch...@concurrentinc.com (mailto:ch...@concurrentinc.com)
> http://concurrentinc.com
>
>
>
>
>
>
>
>
>
>
>
>
>
>
>
>
> --
> You received this message because you are subscribed to the Google Groups "cascading-user" group.
> To post to this group, send email to cascadi...@googlegroups.com (mailto:cascadi...@googlegroups.com).
> To unsubscribe from this group, send email to cascading-use...@googlegroups.com (mailto:cascading-use...@googlegroups.com).