Category: Impatient

“Cascading for the Impatient” blog post series

Cascading for the Impatient, Part 6

In our fifth installment of this series we showed how to implement TF-IDF in Cascading application. If you haven’t read that yet, it’s probably best to start there. Today’s post extends the TF-IDF app to show best practices for test-driven development (TDD) at scale. We’ll… Continue reading

Cascading for the Impatient, Part 5

In our fourth installment of this series we showed how to use HashJoin on two pipes, to perform “stop words” filtering at scale in a Cascading 2.0 application. If you haven’t read that yet, it’s probably best to start there. Today’s lesson builds on that… Continue reading

Cascading for the Impatient, Part 4

In our third installment of this series we showed how to write a custom Operation for a Cascading 2.0 application. If you haven’t read that yet, it’s probably best to start there. Today’s lesson takes that same Word Count app and expands on it to… Continue reading

Cascading for the Impatient, Part 3

In our second installment of this series we showed how to implement Word Count as a Cascading 2.0 application. If you haven’t read that yet, it’s probably best to start there. Today’s lesson takes the same app and stretches it even more. We’ll show how… Continue reading

Cascading for the Impatient, Part 2

In our first installment of this series we showed how to create the simplest possible Cascading 2.0 application. If you haven’t read that yet, it’s probably best to start there. Today’s lesson takes the same app and stretches it a bit further. Undoubtedly you’ve seen… Continue reading

Cascading for the Impatient, Part 1

The lesson today is how to write a simple Cascading 2.0 app. The goal is clear and concise: create the simplest application possible in Cascading, while following best practices. No bangs, no whistles, just good solid code. https://github.com/Cascading/Impatient/tree/master/part1 Here’s a brief Java program, about a… Continue reading