Welcome
Cascading is a feature rich API for defining and executing complex and fault tolerant data processing workflows on a Hadoop cluster.
The processing API lets the developer quickly assemble complex distributed processes without having to "think" in MapReduce. And to efficiently schedule them based on their dependencies and other available meta-data.
Read more about Cascadings features.
Also see our About page. For a simple primer on using Cascading, please read our Gentle Introduction. And for those already familiar with Hadoop and MapReduce, read our detailed Overview on how Cascading works.
With our Groovy extension, Cascading.groovy, it is even simpler for both advanced and casual Hadoop users to create complex processing workflows. Read more on our Groovy Scripting page.
Recent Events
Version 0.6.0 of Cascading is now available for download. For details on new features and bug fixes, see the CHANGES.txt file. For a quick summary, read on.
We are pleased to announce the 0.1.0 release of Cascading.groovy, our Groovy language interpreter extension. With Cascading.groovy, Hadoop applications can be scripted by both advanced and casual Hadoop users without thinking in MapReduce. Read our Groovy Scripting Overview for more details.
Version 0.5.0 of Cascading is now available for download. For details on new features and bug fixes, see the CHANGES.txt file. For a quick summary, read on.
Version 0.4.0 of Cascading is now available for download. See below for a review of the major changes. For more details, see the changes.txt file.
Cascading 0.3.0 has just been packaged and is available for download from our downloads page. It incorporates many great changes, read on for more.
A little note to let everyone know Cascading is now available for download and includes the full source. Please visit our project site for more information.