[ASK] How to unify methodological prototyping with implementation in reality (aka in cascalog)?

Hesen Peng

unread,

May 25, 2014, 11:26:43 PM5/25/14

to clo...@googlegroups.com

Hi everybody,

I have been doing some random machine learning doodling and have always been haunted with this problem.

When I just wanna set out and try if my methodology would work, I would program in Clojure aiming at a small dataset available within one single host memory (or even just doodle R or Python. I know what's blasphemy).

When I wanna implement the results, I would usually have to re-write the whole process in Cascalog just in order to apply them in Hadoop data. I know I can still just start up to write Hadoop functions. But that's a bit overshoot since I have to run simulation study to verify the performance, correctness etc before even going into production development. Besides, having hadoop in memory to sit between my program and the computer just slows down computation.

I've day-dreamed a lot some smart ways to do prototyping and, with the flip of a switch, my functions just become easily applicable to Hadoop data. Wondering if there's any ways to do that?

I actually asked around and some one who I admire a lot suggested writing a Hadoop emulator-like wrapper for my input data. Wondering if there's an easier way? I would appreciate any input. Thanks.

Hesen

Atamert Ölçgen

unread,

May 26, 2014, 1:08:20 AM5/26/14

to clo...@googlegroups.com

If I'm not mistaken, PigPen allows you to execute queries locally, without a hadoop cluster etc. Does that solve your problem?

https://github.com/Netflix/PigPen/wiki/Design-and-Features#testing-local-execution-and-debugging

--
You received this message because you are subscribed to the Google
Groups "Clojure" group.
To post to this group, send email to clo...@googlegroups.com
Note that posts from new members are moderated - please be patient with your first post.
To unsubscribe from this group, send email to
clojure+u...@googlegroups.com
For more options, visit this group at
http://groups.google.com/group/clojure?hl=en
---
You received this message because you are subscribed to the Google Groups "Clojure" group.
To unsubscribe from this group and stop receiving emails from it, send an email to clojure+u...@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

--
Kind Regards,
Atamert Ölçgen

-+-
--+
+++

www.muhuk.com

Hesen Peng

unread,

May 30, 2014, 10:51:27 AM5/30/14

to clo...@googlegroups.com

That's exactly the tool. Thanks a lot :)

You received this message because you are subscribed to a topic in the Google Groups "Clojure" group.
To unsubscribe from this topic, visit https://groups.google.com/d/topic/clojure/pw-xPg4D-DA/unsubscribe.
To unsubscribe from this group and all its topics, send an email to clojure+u...@googlegroups.com.