Cascading Modules

Cascading Modules are user contributed code and extensions to Cascading.

Many projects are hosted on GitHub and may have multiple branches and forks as users enrich the original projects.


Tools

  • Bixo - A Cascading based web crawling and data mining toolkit. A more robust replacement for Apache Nutch maintained by Bixo Labs.

  • Cascading-DBMigrate - An alternative to Cascading.JDBC for relational data access and integration.

  • Cascading.Multitool - A command line tool for processing large text files and datasets. Source code


Programming Languages (DSLs)


Integration


Related

  • Riffle - Annotations and Classes for managing and executing dependent processes. A lightweight alternative to Cascading's Cascade class. For use by non-GPL licensed projects.


Many modules are still early stage and are being put forth so more users can test and improve them (so fork freely).

Modules are externally hosted and many are maintained by users. To have your module listed, send an email to support@cascading.org.