Commercial Grade Open Source Technology for Big Data Applications

Cascading is a Java application framework that enables typical developers to quickly and easily develop rich Data Analytics and Data Management applications that can be deployed and managed across a variety of computing environments. Cascading works seamlessly with Apache Hadoop 1.0 and API compatible distributions.

75,000+ Downloads per Month

Cascading is the most widely used and deployed technology for Big Data applications with more than 75,000+ user downloads a month. Used by thousands of data driven businesses including Twitter, eBay, The Climate Corp and Etsy, Cascading is the de-facto application framework for building and deploying large scale data processing applications.

The Business of Big Data

Cascading was designed to fit into any Enterprise Java development environment. With its clear distinction between “data processing” and “data integration”, its clean Java API, and JUnit testing framework, Cascading can be easily tested at any scale. Even the core Cascading development team runs 1,500 tests daily on an continuous Integration server and deploys all the tested Java libraries into our own public Maven repository, conjars.org.