Just wanted to point out few recent blog posts and upcoming events.
First, if you are in the Atlanta area, check out
July 21, 2009 – Cloud Computing with Hadoop, Map/Reduce and Cascading.
Also, A new Cascading pipe – MultiGroupBy outlines a way to defer the joining of multiple streams during co-grouping to a subsequent operation (a Buffer).
This is definitely something we would like to adopt in some fashion for Cascading 1.1.
Next, Cascading’s Logparser example in Clojure takes home the current meme on wrapping Cascading with Clojure. For those not in the loop, “clojure is a dialect of Lisp, and shares with Lisp the code-as-data philosophy and a powerful macro system”.
I think this re-inforces the idea of exposing a MapReduce query planner as an API and not as a syntax. I am very interested to see how this evolves.
Finally, the folks at Cloudscaling call out a recent presentation by Chris,
Hadoop 101, that covers Hadoop, Cascading, and some best practices.
You can reach the presentation directly here: Building Scale Free Applications with Hadoop and Cascading.