You cannot post messages because only members can post, and you are not currently a member.
Description:
User group for Cascading users
|
|
|
Newbie problem about using scalding
|
| |
Sorry if this has been asked before. I am java developer trying scala for hadoop data processing. The scalding project looks really cool and easy. However, after I decided to run some test, I can't get it setup for days with my current project. Is there a way to use scalding like a jar library or include scalding in the dependency easily? Creating a new project just... more »
|
|
Fwd: Job stuck in compilation
|
| |
Anyone else see cascading take a long time in flow planning phase?
We've seen some very long cases when the graph gets up to 30-35 steps.
---------- Forwarded message ----------
...To: Scalding Users <scalding-us...@twitter.com>
In case anyone is interested, I think I figured out why this happened.... more »
|
|
Pattern Released
|
| |
Hey all
We finally got Pattern pushed up to GitHub
[link]
and updated the Pattern page [link]
Enjoy
chris
--
Chris K Wensel
ch...@concurrentinc.com
[link]
|
|
Baffling error message
|
| |
I am manipulating pairs of strings? I've created a PreferenceDatum type that encapsulated the pair of strings and use the fields from it pretty much all the way through the flow. But for some reason that baffles me I get errors throughout. What are the rules for field names?
I get exceptions for the following:... more »
|
|
Scalding: Column Reordering In a Write Statement
|
| |
Hi, I am currently running into an issue where after i perform the following chained groupBy statement below, I cannot change the ordering of the fields in the write statement. .... .groupBy(('id,'week,'year)) { g => g.sum('soldQuantity -> 'aggrQuantity) .average('soldQuantity -> 'avgQuantity)... more »
|
|
Can Cascading help me with my use case?
|
| |
Hi All, I am new to Cascading, so please be gentle... I am currently working on a project which involves populating a graph database. Below is some background informatie about my project. I am trying to create a graph the shows a relationship between entities (Person, Location, Organisations) and the document(s) they occur. I use the... more »
|
|
scalding: serialization issue during join
|
| |
I am attempting to join a TSV source with an avro source. I am using scalding 0.8.2 and the scalding.avro project (in this specific job, I'm using the PackedAvroSource). When I attempt to run the job on my cluster, it throws the following exception during initialization: Exception in thread "main" cascading.flow.planner.Planner Exception: could... more »
|
|
Announcing Pattern
|
| |
Hey all
We announced Pattern today. A new framework and library for machine learning on Cascading on Hadoop.
You can see the announcement here: [link]
The Pattern page is here:
[link]... more »
|
|
Dynamic Column Creating Functions?
|
| |
Hi-- I'm trying to create a Pivot-like function in which the stream accepts a set of tuples and introduces new column names based on the input tuples. Effectively, I'm creating my own data structure based on the operate() input and when a flush() is invoked, it creates the necessary tuples and flows them through the OutputCollector.... more »
|
|
|