Cascading is the proven application development platform for building data applications on Hadoop.

Get Cascading 2.6



Simplifies systems integration through ANSI SQL compatibility and a JDBC driver


Enables various machine learning scoring algorithms through PMML compatibility


Enables development with Scala, a powerful language for solving functional problems


Enables development with Clojure, a Lisp dialect

Latest News

Cascading 2.6

We are happy to announce that Cascading 2.6 is now publicly available for download. This release contains new features and bug fixes. Of note are the new DecoratorTap and DistCacheTap (itself a DecoratorTap sub-class) classes. Working together, Flows can cache data directly into the Hadoop…

The Cascading 3.0 Query Planner

Cascading 1.0 when released, represented a huge milestone. An enterprise friendly Java API, not a syntax, and fail fast planner allowed developers to build robust, maintainable, data-oriented applications that could execute reliably on Apache Hadoop for hours or days. Cascading 2.0 made a nod towards…

Cascading 3.0 WIP Now Supports Apache Tez

We are happy to announce that the latest Cascading 3.0 WIP now adds Apache Tez as a supported runtime platform. We are making this release available so interested parties can begin testing Tez deployments against existing Cascading applications. A downloadable version of Cascading 3.0 WIP…

Fluid — A Fluent API for Cascading

We have announced our new Fluid project and need your help in testing it out. Fluid is an API library exposing the Cascading library as a Fluent API. Fluid’s primary goal is not only to make hard things possible, but also to keep simple things…

IntelliJ Plugin for Cascading

We have published an initial IntelliJ plugin for Cascading designed to improve the experience of developing data-oriented applications in modern IDEs. The first version of the plugin allows developers to quickly visualize and debug their Cascading code with Driven when developing with IntelliJ. Project:…

Driven 1.0

We are happy to announce that Driven 1.0 is now generally available. Driven provides developers with operational visibility for Cascading applications, including those built with Cascading dynamic programming languages (i.e. Scalding, Cascalog, Lingual, Pattern, etc.). Existing users should also make sure they have the latest…


We are happy to announce a new open source project that we have been working on: Cascading-Hive. Now, you can use Cascading and Apache Hive together. Key Features: Run Hive queries within a Cascade Read from Hive tables within a Cascading Flow Write/create Hive tables…

Read more