Account Options

  1. Sign in
The old Google Groups will be going away soon, but your browser is incompatible with the new version.
Google Groups Home
« Groups Home
Group info
Language: English
Group categories:
Computers > Software
More group info »
Discussions
View:  Topic list, Topic summary Topics 1 - 10 of 1555  Older »

You cannot post messages because only members can post, and you are not currently a member.
Description: User group for Cascading users
 

Newbie problem about using scalding 
  Sorry if this has been asked before. I am java developer trying scala for hadoop data processing. The scalding project looks really cool and easy. However, after I decided to run some test, I can't get it setup for days with my current project. Is there a way to use scalding like a jar library or include scalding in the dependency easily? Creating a new project just... more »
By raas  - May 23 - 3 new of 3 messages    

Fwd: Job stuck in compilation 
  Anyone else see cascading take a long time in flow planning phase? We've seen some very long cases when the graph gets up to 30-35 steps. ---------- Forwarded message ---------- ...To: Scalding Users <scalding-us...@twitter.com> In case anyone is interested, I think I figured out why this happened.... more »
By Oscar Boykin  - May 23 - 1 new of 1 message    

Pattern Released 
  Hey all We finally got Pattern pushed up to GitHub [link] and updated the Pattern page [link] Enjoy chris -- Chris K Wensel ch...@concurrentinc.com [link]
By Chris K Wensel  - May 23 - 1 new of 1 message    

Baffling error message 
  I am manipulating pairs of strings? I've created a PreferenceDatum type that encapsulated the pair of strings and use the fields from it pretty much all the way through the flow. But for some reason that baffles me I get errors throughout. What are the rules for field names? I get exceptions for the following:... more »
By Pat Ferrel  - May 23 - 9 new of 9 messages    

Scalding: Column Reordering In a Write Statement 
  Hi, I am currently running into an issue where after i perform the following chained groupBy statement below, I cannot change the ordering of the fields in the write statement. .... .groupBy(('id,'week,'year)) { g => g.sum('soldQuantity -> 'aggrQuantity) .average('soldQuantity -> 'avgQuantity)... more »
By Brian Arnold  - May 23 - 2 new of 2 messages    

Can Cascading help me with my use case? 
  Hi All, I am new to Cascading, so please be gentle... I am currently working on a project which involves populating a graph database. Below is some background informatie about my project. I am trying to create a graph the shows a relationship between entities (Person, Location, Organisations) and the document(s) they occur. I use the... more »
By Martijn van Leeuwen  - May 23 - 3 new of 3 messages    

scalding: serialization issue during join 
  I am attempting to join a TSV source with an avro source. I am using scalding 0.8.2 and the scalding.avro project (in this specific job, I'm using the PackedAvroSource). When I attempt to run the job on my cluster, it throws the following exception during initialization: Exception in thread "main" cascading.flow.planner.Planner Exception: could... more »
By jeffo  - May 21 - 8 new of 8 messages    

Announcing Pattern 
  Hey all We announced Pattern today. A new framework and library for machine learning on Cascading on Hadoop. You can see the announcement here: [link] The Pattern page is here: [link]... more »
By Chris K Wensel  - May 21 - 1 new of 1 message    

Using scalding to run MR jobs from reading Pail files with protobuf 
  Hey Helena, The current definitive reference on Pail is an article by Nathan Marz (Pail's author): [link] [link] PailSource is in serious alpha, and the brief answer is that we don't currently have great examples or tests of how to use Pail with more... more »
By Sam Ritchie  - May 21 - 2 new of 2 messages    

Dynamic Column Creating Functions? 
  Hi-- I'm trying to create a Pivot-like function in which the stream accepts a set of tuples and introduces new column names based on the input tuples. Effectively, I'm creating my own data structure based on the operate() input and when a flush() is invoked, it creates the necessary tuples and flows them through the OutputCollector.... more »
By Eric Y. Theriault  - May 19 - 3 new of 3 messages    

1 - 10 of 1555   « Newer | Older »

XML       Send email to this group: cascading-user@googlegroups.com