Cascading is the proven application development platform for building data applications on Hadoop.

Get Cascading 3.0

Related Projects


Simplifies development of Cascading applications through an advanced auto-suggesting fluent API.


Simplifies systems integration through ANSI SQL compatibility and a JDBC driver


Enables development with Scala, a powerful language for solving functional problems


Enables development with Clojure, a Lisp dialect

Latest News

Announcing Cascading 3.0 on Apache Flink

Thanks to our partners, data Artisans, Cascading users now have an additional compute fabric to execute Cascading 3.0 applications on, Apache Flink. From the project site.. “Apache Flink is a platform for scalable stream and batch processing. Flink’s execution engine features low-latency pipelined and scalable…

Cascading 3.0 Maintenance Release

We have just published Cascading 3.0.2, a minor maintenance release.  Upgrading is recommended for all users. This release resolves the following issues: Updated Apache Tez to 0.6.2 to prevent deadlocks in complex DAGs. Note this release is incompatible with Tez 0.6.1. Fixed issues concerning detailed…

Cascading 2.7 Maintenance Release

We have just published Cascading 2.7.1, a minor maintenance release. This release resolves the following issues: Fixed issue where c.p.GroupBy or c.p.CoGroup would fail if attempting to group or join incoming Fields.UNKNOWN tuple streams using relative positions in the grouping fields selectors. Fixed issue where c.u.ShutdownUtil…

July 2015

July 2015 Newsletter

Cascading 3.0 Maintenance Release

We have just published a new maintenance release 3.0.1 of Cascading. This release resolves the following issue: – Fixed issue in c.f.t.p.Hadoop2TezFlowStepJob where the LocalResources were not passed to the AppMaster correctly causing ClassNotFoundException during split calculation for custom InputFormats. It can be downloaded from…

June 2015

There has been a lot going on in the last month. Cascading 3.0 release is now available. This release helps future-proof your data infrastructure investmentsand by supporting newer compute fabrics as they become available. Also, a new EAP version for Driven is freely available for doing real time performance

Cascading-Hive 2.0 Release

We are happy to announce the release of Cascading-Hive 2.0. This release adds compatibility with Cascading 3.0. Furthermore it contains a major contribution from the Cascading community, namely It is now possible to read and write ACID ORC tables with Cascading-Hive. This feature relies…

Read more