Cascading is an application framework for Java developers to simply develop robust Data Analytics and Data Management applications on Apache Hadoop. Learn.more.

Lingual is ANSI SQL for Cascading on Apache Hadoop through an API, JDBC Driver, and command shell. Learn.more.

Pattern is Machine Learning for Cascading on Apache Hadoop through an API, and standards based PMML. Learn.more.

Architecture Diagram

Latest News

Cascading Pattern – Machine Learning for Cascading and Hadoop

Announcing Pattern, a new library and framework that executes PMML workflows as Cascading applications on Apache Hadoop clusters. Read more about it on the Pattern project page, signup for announcements on the mail list, or read the press release.

Lingual Public Access

Lingual is now available for download or build. See the Lingual page for details, or visit the Lingual project page.

Lingual’s Architecture

Julian Hyde discusses how Optiq and Cascading work together to become Lingual.

Cascading Lingual – True SQL for Cascading and Hadoop

Announcing Lingual, a new framework that executes ANSI SQL queries as Cascading applications on Apache Hadoop clusters. Read more about it on the Lingual project page, signup for announcements on the mail list, or read the press release.

Cascading 2.2 WIP and CoercibleTypes

Cascading 2.2 is starting to take shape for those interested in test driving emerging features. Of note is “field type” support. This allows fields read from an input file to have type information retained through to where the data is sinked/stored to a file. This…

Read.more.