Groups
Conversations
All groups and messages
Send feedback to Google
Help
Training
Sign in
Groups
cascading-user
Conversations
About
Groups keyboard shortcuts have been updated
Dismiss
See shortcuts
cascading-user
Contact owners and managers
1–30 of 3247
Mark all as read
Report group
0 selected
Chris K Wensel
2/16/23
Update
Hey all Quick status update. I have started working on a new project for developers that intersects
unread,
Update
Hey all Quick status update. I have started working on a new project for developers that intersects
2/16/23
Chris K Wensel
8/31/22
Cascading 4.5
Hey all Quick note that Cascading 4.5 has been released. This adds support for Hadoop 3.x. This
unread,
Cascading 4.5
Hey all Quick note that Cascading 4.5 has been released. This adds support for Hadoop 3.x. This
8/31/22
Chris K Wensel
8/30/22
Cascading 4.1
Hey all Quick note that Cascading 4.1 has been released. This will be the last Hadoop 2.x minor
unread,
Cascading 4.1
Hey all Quick note that Cascading 4.1 has been released. This will be the last Hadoop 2.x minor
8/30/22
Velkumar Neel
, …
Chris K Wensel
3
7/18/22
Cascading to spark
There are a lot of 'depends' here to work through.. First, I'd see what the gap is to
unread,
Cascading to spark
There are a lot of 'depends' here to work through.. First, I'd see what the gap is to
7/18/22
Chris K Wensel
1/14/22
Cascading 4.5 WIP = Hadoop 3 and Tez 0.10
Hey all Thanks to the team at Foursquare Labs, we have a new WIP with support for Hadoop 3. Special
unread,
Cascading 4.5 WIP = Hadoop 3 and Tez 0.10
Hey all Thanks to the team at Foursquare Labs, we have a new WIP with support for Hadoop 3. Special
1/14/22
Rakesh Iyer
,
Chris K Wensel
2
12/7/21
HashJoin results for a SelfJoin of a small File produces partial result.
Can you share what version of Hadoop you are running in the cluster? Also, is this test run on the
unread,
HashJoin results for a SelfJoin of a small File produces partial result.
Can you share what version of Hadoop you are running in the cluster? Also, is this test run on the
12/7/21
Chris K Wensel
5/31/21
Cascading 4.0.0 Released
Hey all Just a quick note to announce that Cascading 4.0.0 has been released to Maven Central. https:
unread,
Cascading 4.0.0 Released
Hey all Just a quick note to announce that Cascading 4.0.0 has been released to Maven Central. https:
5/31/21
Chris K Wensel
3
4/18/21
[Action Required] Conjars Repo
I made the switch.. For complicated reasons I had to redirect the name servers to reference a new
unread,
[Action Required] Conjars Repo
I made the switch.. For complicated reasons I had to redirect the name servers to reference a new
4/18/21
Velkumar Neel
, …
Chris K Wensel
17
2/10/21
Cascading counter
I'm unsure what outcome you expect? You can always test the sources for size before you plan the
unread,
Cascading counter
I'm unsure what outcome you expect? You can always test the sources for size before you plan the
2/10/21
ajay wisawe
,
Chris K Wensel
2
9/3/20
How Can I change the LOG LEVEL for Class : cascading.tap.hadoop.io.TapOutputCollector
If you stick log4j log declarations into your JobConf, that is then passed to the planner, they will
unread,
How Can I change the LOG LEVEL for Class : cascading.tap.hadoop.io.TapOutputCollector
If you stick log4j log declarations into your JobConf, that is then passed to the planner, they will
9/3/20
gc1888
,
Chris K Wensel
2
6/17/20
TextDelimited using quote string causes job to fail.
Sounds like you have a record the regex used by TextDelimited can't parse. If you enable traps
unread,
TextDelimited using quote string causes job to fail.
Sounds like you have a record the regex used by TextDelimited can't parse. If you enable traps
6/17/20
Guillaume Bibens
,
Chris K Wensel
2
6/16/20
Support Hadoop 3
Hi My plan is to release Cascading 4.0 for Hadoop 2 only. When C4 will be released, I'm unsure.
unread,
Support Hadoop 3
Hi My plan is to release Cascading 4.0 for Hadoop 2 only. When C4 will be released, I'm unsure.
6/16/20
Saravanabavagugan
,
Chris K Wensel
6
2/19/20
Cascading deletes some temporary files which MR job is trying to access in its mapper resulting in stale file handle error
Threading is not happening within the task. We use multithreading to create and run many MR jobs in
unread,
Cascading deletes some temporary files which MR job is trying to access in its mapper resulting in stale file handle error
Threading is not happening within the task. We use multithreading to create and run many MR jobs in
2/19/20
Chris K Wensel
,
Dusty OBrien
4
12/3/19
Nested Fields and Hierarchical Data
Yeah that does help. I figure I could reach into the JSON myself if I use the whole originalRecord
unread,
Nested Fields and Hierarchical Data
Yeah that does help. I figure I could reach into the JSON myself if I use the whole originalRecord
12/3/19
Chris K Wensel
10/24/19
Re: Reading GS files
If Hadoop supports gs:// then Cascading supports it. Just know the support may not ship native in
unread,
Re: Reading GS files
If Hadoop supports gs:// then Cascading supports it. Just know the support may not ship native in
10/24/19
Chris Schneider
10/16/19
cascading.utils 2.6.4 released
Hi Cascading Buddies, I just pushed a new version of cascading.utils to Conjars. Minor changes
unread,
cascading.utils 2.6.4 released
Hi Cascading Buddies, I just pushed a new version of cascading.utils to Conjars. Minor changes
10/16/19
PaulON
, …
Ben Podgursky
5
8/2/19
"Too many counters"
I vaguely recall (could very well be wrong, long time ago) that you have to set this on the
unread,
"Too many counters"
I vaguely recall (could very well be wrong, long time ago) that you have to set this on the
8/2/19
Jing Lu
,
Ken Krugler
5
7/29/19
Pipeline becomes very slow after I try to join one small data set using joinWithSmaller/Tiny
I tried joinWithSmaller, it's also not terminated. So, I was thinking to try something more
unread,
Pipeline becomes very slow after I try to join one small data set using joinWithSmaller/Tiny
I tried joinWithSmaller, it's also not terminated. So, I was thinking to try something more
7/29/19
Kunal Ghosh
,
Wang Zhong
2
7/18/19
Cascading hive metastore with Kerberos authentication
Hi, It seems that the version of your hive metastore service is older than that of your hive
unread,
Cascading hive metastore with Kerberos authentication
Hi, It seems that the version of your hive metastore service is older than that of your hive
7/18/19
Ben Podgursky
,
Chris K Wensel
2
6/29/19
Released OSS data pipeline orchestrator Workflow2, with Cascading integration
Hey, this looks great! Thanks for sharing this with everyone! chris On Jun 28, 2019, at 8:57 AM, Ben
unread,
Released OSS data pipeline orchestrator Workflow2, with Cascading integration
Hey, this looks great! Thanks for sharing this with everyone! chris On Jun 28, 2019, at 8:57 AM, Ben
6/29/19
gc1888
,
Chris K Wensel
3
2/26/19
Writing HFiles
I was able to do this by modifying the HFileoutFormat2 to work with the hadoop.mapred library. I then
unread,
Writing HFiles
I was able to do this by modifying the HFileoutFormat2 to work with the hadoop.mapred library. I then
2/26/19
HIMANSHU VERMA
,
Ken Krugler
2
12/3/18
Problem with avro part file compaction using Cascading
Why are you comparing Avro records? Are you using the record as part of a grouping key? Asking
unread,
Problem with avro part file compaction using Cascading
Why are you comparing Avro records? Are you using the record as part of a grouping key? Asking
12/3/18
duob...@homeaway.com
11/16/18
Joins and optimal partitioning
Hi all, I've never really had a good opportunity to deeply understand LARGE (inner) joins in
unread,
Joins and optimal partitioning
Hi all, I've never really had a good opportunity to deeply understand LARGE (inner) joins in
11/16/18
Srikanth Adibhatla
, …
minkymorgan
4
11/8/18
Cascading - strategic direction
1. For complex ETL pipelines where we had to apply multiple business rules on incoming data, we found
unread,
Cascading - strategic direction
1. For complex ETL pipelines where we had to apply multiple business rules on incoming data, we found
11/8/18
khedkarn...@gmail.com
10/24/18
Unable to read decimal and timestamp column values from a parquet file
Hello, I am facing an issue while reading decimal and timestamp type values from a parquet file. I am
unread,
Unable to read decimal and timestamp column values from a parquet file
Hello, I am facing an issue while reading decimal and timestamp type values from a parquet file. I am
10/24/18
PaulON
,
Chris K Wensel
4
10/15/18
Custom Partitioner not working for GroupBy
Basically we have some join operations that have a small set of keys (100's) and we want want one
unread,
Custom Partitioner not working for GroupBy
Basically we have some join operations that have a small set of keys (100's) and we want want one
10/15/18
Shivank Garg
,
Ken Krugler
2
7/4/18
Optimising Cascading Codes.
Hi Shivank, As Bill Atkinson told me when I first started working at Apple, "Measure, then
unread,
Optimising Cascading Codes.
Hi Shivank, As Bill Atkinson told me when I first started working at Apple, "Measure, then
7/4/18
Hagai Attias
6/26/18
Converting a pipe to a map using .toIterableExecution results in OutOfMemory
I'm using the following code to convert a pipe into a map and use it in a subsequent step: val
unread,
Converting a pipe to a map using .toIterableExecution results in OutOfMemory
I'm using the following code to convert a pipe into a map and use it in a subsequent step: val
6/26/18
Chris K Wensel
,
vinnov...@gmail.com
2
6/25/18
Re: AVAILABLE CONSULTANT HOTLIST
Title: Jr. ScrumMaster Location: Denver (DTC) - CO Duration: 6 month contract Experience : 4-5 yrs
unread,
Re: AVAILABLE CONSULTANT HOTLIST
Title: Jr. ScrumMaster Location: Denver (DTC) - CO Duration: 6 month contract Experience : 4-5 yrs
6/25/18
ranjan banerjee
, …
Chris K Wensel
8
6/6/18
Is retry policy available for individual hadoop jobs
see FlowStepJob#blockOnJob() that should get you going in the right direction. but keep in mind this
unread,
Is retry policy available for individual hadoop jobs
see FlowStepJob#blockOnJob() that should get you going in the right direction. but keep in mind this
6/6/18