Commercial Grade Open Source Technology for Big Data Applications
Cascading is a Java application framework that enables typical developers to quickly and easily develop rich Data Analytics and Data Management applications that can be deployed and managed across a variety of computing environments. Cascading works seamlessly with Apache Hadoop 1.0 and API compatible distributions.
75,000+ Downloads per Month
Cascading is the most widely used and deployed technology for Big Data applications with more than 75,000+ user downloads a month. Used by thousands of data driven businesses including Twitter, eBay, The Climate Corp and Etsy, Cascading is the de-facto application framework for building and deploying large scale data processing applications.
The Business of Big Data
Cascading was designed to fit into any Enterprise Java development environment. With its clear distinction between “data processing” and “data integration”, its clean Java API, and JUnit testing framework, Cascading can be easily tested at any scale. Even the core Cascading development team runs 1,500 tests daily on an continuous Integration server and deploys all the tested Java libraries into our own public Maven repository, conjars.org.