<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0">
    <channel>
        <title>Cascading</title>
        <link>http://www.cascading.org/</link>
        <description></description>
        <language>en</language>
        <copyright>Copyright 2010</copyright>
        <lastBuildDate>Thu, 26 Aug 2010 11:17:18 -0800</lastBuildDate>
        <generator>http://www.sixapart.com/movabletype/</generator>
        <docs>http://www.rssboard.org/rss-specification</docs>
        
        <item>
            <title>Bixo Hackathon</title>
            <description><![CDATA[<p>There will be a <a href="http://openbixo.org/">Bixo</a> hackathon in Nevada City, CA this Sept 7th and 8th. Read more about it <a href="http://openbixo.org/2010/08/26/bixo-hackaton-september-7th-8th/">here</a>.</p>

<p>Note that even if you’re not a hard-core Bixo user, fringe benefits from participating include learning a lot about the very useful underlying technologies (Cascading, Hadoop, HttpClient) as well as getting an excuse to visit beautiful <a href="http://www.nevadacitychamber.com/">Nevada City</a>.</p>

<p>Hope to see you there.</p>]]></description>
            <link>http://www.cascading.org/2010/08/bixo-hackathon.html</link>
            <guid>http://www.cascading.org/2010/08/bixo-hackathon.html</guid>
            
                <category domain="http://www.sixapart.com/ns/types#category">News</category>
            
            
            <pubDate>Thu, 26 Aug 2010 11:17:18 -0800</pubDate>
        </item>
        
        <item>
            <title>O&apos;Reilly Strata Conference</title>
            <description><![CDATA[<p>The new <a href="http://strataconf.com/strata2011">Strata Conference</a> has just been announced with a Call for Proposals ending Sept 28. 
</p>
<p>This new conference is on the 'business of data' and is the sister conference to Velocity.</p>
<p>Hope to see lots of proposals coming in from Hadoop, Cascading, Bixo, and Cascalog users and developers.</p>]]></description>
            <link>http://www.cascading.org/2010/08/oreilly-strata-conference.html</link>
            <guid>http://www.cascading.org/2010/08/oreilly-strata-conference.html</guid>
            
                <category domain="http://www.sixapart.com/ns/types#category">News</category>
            
            
            <pubDate>Wed, 25 Aug 2010 14:03:36 -0800</pubDate>
        </item>
        
        <item>
            <title>Cascading 1.1.2</title>
            <description><![CDATA[<p>We are happy to announce that Cascading 1.1.2 is now publicly available for <a href="/downloads.html">download</a>.</p>

<p>This release features many bug fixes.</p>

<p>For a detailed list of changes see:
<a href="http://github.com/concurrentinc/cascading/blob/1.1.2/CHANGES.txt">CHANGES.txt</a></p>

<p>This release will run against Hadoop 0.18.3, 0.19.x, and 0.20.x. Including Amazon Elastic MapReduce.</p>

<p><em>Note the tests will not compile or run against Hadoop 0.18.3 due to package changes since that version.</em></p>
]]></description>
            <link>http://www.cascading.org/2010/08/cascading-112.html</link>
            <guid>http://www.cascading.org/2010/08/cascading-112.html</guid>
            
                <category domain="http://www.sixapart.com/ns/types#category">News</category>
            
            
            <pubDate>Mon, 02 Aug 2010 09:07:56 -0800</pubDate>
        </item>
        
        <item>
            <title>BigDataCamp 2010</title>
            <description><![CDATA[Quick note that Chris will be at the <a href="http://bigdatacamp.org">BigDataCamp</a> on June 28, 2010, the night before the Hadoop Summit. Register now before all the seats are taken.]]></description>
            <link>http://www.cascading.org/2010/06/bigdatacamp-2010.html</link>
            <guid>http://www.cascading.org/2010/06/bigdatacamp-2010.html</guid>
            
                <category domain="http://www.sixapart.com/ns/types#category">Articles</category>
            
            
            <pubDate>Mon, 14 Jun 2010 12:01:56 -0800</pubDate>
        </item>
        
        <item>
            <title>Cascading 1.1.0 Available</title>
            <description><![CDATA[<p>We are happy to announce that Cascading 1.1.0 is now publicly available for <a href="/downloads.html">download</a>.</p>

<p>This release features many performance and usability enhancements while remaining backwards compatible with 1.0.</p>

<p>Specifically:
<ul>
 <li>Performance optimizations with all join types</li>
 <li>Numerous job planner optimizations</li>
 <li>Dynamic optimizations when running in Amazon Elastic MapReduce and S3</li>
 <li>API usability improvements around large number of field names</li>
 <li>Support for TSV, CSV, and custom delimited text files</li>
 <li>Support for manipulating and serializing non-Comparable custom Java types</li>
 <li>Debug levels supported by the job planner</li>
</ul></p>

<p>For a detailed list of changes see:
<a href="http://github.com/concurrentinc/cascading/blob/1.1.0/CHANGES.txt">CHANGES.txt</a></p>

<p>Along with this release are a number of <a href="/modules.html">extensions</a> created by the Cascading user community.</p>

<p>Among these extension are:
<ul>
 <li><a href="http://openbixo.org/">Bixo</a> - a data mining toolkit</li>
 <li><a href="http://github.com/backtype/cascading-dbmigrate">DBMigrate</a> - a tool for migrating data to/from RDBMSs into Hadoop</li>
 <li>Apache HBase, Amazon SimpleDB, and JDBC integration</li>
 <li>JRuby and Clojure based scripting languages for Cascading</li>
 <li><a href="http://github.com/nathanmarz/cascalog">Cascalog</a> - a robust interactive extensible query language</li>
</ul></p>

<p>This release will run against Hadoop 0.18.3, 0.19.x, and 0.20.x. Including Amazon Elastic MapReduce.</p>

<p><em>Note the tests will not compile or run against Hadoop 0.18.3 due to package changes since that version.</em></p>
]]></description>
            <link>http://www.cascading.org/2010/04/cascading-110-available.html</link>
            <guid>http://www.cascading.org/2010/04/cascading-110-available.html</guid>
            
                <category domain="http://www.sixapart.com/ns/types#category">News</category>
            
            
            <pubDate>Thu, 22 Apr 2010 15:18:39 -0800</pubDate>
        </item>
        
        <item>
            <title>Interview on Parallel Programming</title>
            <description><![CDATA[<p>A very interesting <a href="http://www.infoq.com/interviews/billy-newport-parallel">interview with Billy Newport</a> on InfoQ about "the need for higher level abstraction to do parallel programming with multi-core systems effectively."</p>

<p>"Billy Newport is a Distinguished Engineer working on WebSphere eXtreme Scale (ObjectGrid) and on WebSphere high availability."</p>]]></description>
            <link>http://www.cascading.org/2010/04/interview-on-parallel-programm.html</link>
            <guid>http://www.cascading.org/2010/04/interview-on-parallel-programm.html</guid>
            
                <category domain="http://www.sixapart.com/ns/types#category">News</category>
            
            
            <pubDate>Thu, 22 Apr 2010 14:40:55 -0800</pubDate>
        </item>
        
        <item>
            <title>Cascalog: An Interactive Query Language</title>
            <description><![CDATA[<p><a href="">Nathan Marz</a> has just announced and released Cascalog.</p>

<p>Cascalog is an interactive query language for Hadoop with a focus on simplicity, expressiveness, and flexibility intended to be used by Analysts and Developers alike.</p>

<p>Cascalog eschews the SQL syntax for a simpler and more expressive syntax based on <a href="http://en.wikipedia.org/wiki/Datalog">Datalog</a>.</p>

<p>With this added expressiveness, Cascalog can query existing data stores "out of the box" with no required data "importing" or "under the hood" configuration necessary.</p> 

<p>Because Cascalog sits on top of Clojure, a powerful JVM based language and interactive shell, adding new operations to a query is as simple as defining a new function.</p>

<p>Cascalog also relies on Cascading, a robust data processing API and query planner.</p>

<p>Here is the canonical "word count" query in Cascalog:

<blockquote>(?<- (stdout) [?word ?count] (sentence ?s) (split ?s :> ?word) (c/ count ?count))</blockquote></p>

<p>You can check out an introductory blog post here:
<a href="http://nathanmarz.com/blog/introducing-cascalog/">http://nathanmarz.com/blog/introducing-cascalog/</a></p>

<p>The project is hosted here: <a href="http://github.com/nathanmarz/cascalog">http://github.com/nathanmarz/cascalog</a></p>]]></description>
            <link>http://www.cascading.org/2010/04/cascalog-an-interactive-query.html</link>
            <guid>http://www.cascading.org/2010/04/cascalog-an-interactive-query.html</guid>
            
                <category domain="http://www.sixapart.com/ns/types#category">News</category>
            
            
            <pubDate>Fri, 16 Apr 2010 06:53:52 -0800</pubDate>
        </item>
        
        <item>
            <title>Karmasphere Studio Ships with Cascading</title>
            <description><![CDATA[<p>The recently released <a href="http://www.karmasphere.com/2010/04/12/new-version-of-karmasphere-studio/">Karmasphere Studio 1.2</a> now includes support for Cascading 1.0 in the free community download.</p>

<p>Karmasphere Studio is an IDE and Debugger for Hadoop MapReduce application developers that also includes integration with the Amazon Web Services platform.</p>

<p>And with Cascading support directly in the Debugger and IDE, developers can even more quickly develop and debug complex Hadoop jobs.</p>

<p>Also worthy of note, <a href="http://www.karmasphere.com/2010/04/08/on-karmaspheres-first-funding/">Karmasphere recently received $5M Series A funding</a>.</p>]]></description>
            <link>http://www.cascading.org/2010/04/karmasphere-studio-ships-with.html</link>
            <guid>http://www.cascading.org/2010/04/karmasphere-studio-ships-with.html</guid>
            
                <category domain="http://www.sixapart.com/ns/types#category">News</category>
            
            
            <pubDate>Mon, 12 Apr 2010 14:43:58 -0800</pubDate>
        </item>
        
        <item>
            <title>Cascading 1.1 RC3 Available</title>
            <description><![CDATA[<p>Cascading 1.1 RC3 is now available from the <a href="http://www.cascading.org/downloads.html">downloads page</a>.</p>

<p>Note we are no longer serving downloads from Google Code but from links off the download page.</p>]]></description>
            <link>http://www.cascading.org/2010/04/cascading-11-rc3-available.html</link>
            <guid>http://www.cascading.org/2010/04/cascading-11-rc3-available.html</guid>
            
                <category domain="http://www.sixapart.com/ns/types#category">News</category>
            
            
            <pubDate>Sun, 04 Apr 2010 12:10:12 -0800</pubDate>
        </item>
        
        <item>
            <title>Cascading-DBMigrate</title>
            <description><![CDATA[<p><a href="http://nathanmarz.com/">Nathan</a> at <a href="http://www.backtype.com/">BackType</a> has <a href="http://tech.backtype.com/migrating-data-from-a-sql-database-to-hadoop">announced</a> and released <a href="http://github.com/backtype/cascading-dbmigrate">Cascading-DBMigrate</a>.</p>

<p>In short, DBMigrate is a more flexible and reliable alternative to <a href="http://www.cloudera.com/blog/2009/06/introducing-sqoop/" rel="nofollow">Sqoop</a> for moving data to/from a relational data store.</p>

<p><a href="http://www.cascading.org/modules.html">Cascading.JDBC</a> has been around for quite a while, but DBMigrate overcomes some of the limitations when dealing with MySQL servers (AsterData did not have the same limitations) and OFFSET/LIMIT queries.</p>]]></description>
            <link>http://www.cascading.org/2010/03/cascadingdbmigrate.html</link>
            <guid>http://www.cascading.org/2010/03/cascadingdbmigrate.html</guid>
            
                <category domain="http://www.sixapart.com/ns/types#category">News</category>
            
            
            <pubDate>Fri, 26 Mar 2010 14:43:03 -0800</pubDate>
        </item>
        
        <item>
            <title>Riffle: Lightweight Workflow</title>
            <description><![CDATA[<p><a href="http://n3.nabble.com/Fwd-riffle-small-scale-workflow-manager-td637952.html#a637952%23a637952">Riffle has been announced</a> on the <a href="http://lucene.apache.org/mahout/">Mahout</a> mailing list.</p>

<p><a href="http://github.com/cwensel/riffle">Riffle</a> is a lightweight Java library for executing collections of dependent processes as a single process. It is Apache licensed so it can be included in non-GPL compatible projects.</p> 

<p>The next major version of Cascading (1.2) will support the Riffle annotations so that projects like Mahout and Pig can participate in a <a href="/documentation/features/topological-scheduler.html">Cascading Cascade</a> execution.</p>

<p>Riffle can be found on its GitHub <a href="http://github.com/cwensel/riffle">project page</a>.</p>]]></description>
            <link>http://www.cascading.org/2010/03/riffle-lightweight-workflow.html</link>
            <guid>http://www.cascading.org/2010/03/riffle-lightweight-workflow.html</guid>
            
                <category domain="http://www.sixapart.com/ns/types#category">News</category>
            
            
            <pubDate>Fri, 26 Mar 2010 13:52:26 -0800</pubDate>
        </item>
        
        <item>
            <title>Cascading 1.1 RC1 Available</title>
            <description><![CDATA[<p>Cascading 1.1 RC1 is now available from the <a href="http://www.cascading.org/downloads.html">downloads page</a>.</p>

<p>You can read about all the changes in the <a href="http://github.com/concurrentinc/cascading/blob/1.1.rc1/CHANGES.txt">CHANGES.txt file</a>.</p>

<p>Note we are no longer serving downloads from Google Code but from links off the download page.</p>]]></description>
            <link>http://www.cascading.org/2010/03/cascading-11-rc1-available.html</link>
            <guid>http://www.cascading.org/2010/03/cascading-11-rc1-available.html</guid>
            
                <category domain="http://www.sixapart.com/ns/types#category">News</category>
            
            
            <pubDate>Tue, 23 Mar 2010 11:08:28 -0800</pubDate>
        </item>
        
        <item>
            <title>Cascading at RazorFish and AWS</title>
            <description><![CDATA[Check out the new Case Study published by Amazon on <a href="http://www.concurrentinc.com/news-events/entry/case_study_razorfish_user_segmentation_with_cascading_and_amazon_elastic_ma/">User Segmentation at RazorFish</a>.]]></description>
            <link>http://www.cascading.org/2010/03/cascading-at-razorfish-and-aws.html</link>
            <guid>http://www.cascading.org/2010/03/cascading-at-razorfish-and-aws.html</guid>
            
                <category domain="http://www.sixapart.com/ns/types#category">News</category>
            
            
            <pubDate>Sat, 20 Mar 2010 13:51:03 -0800</pubDate>
        </item>
        
        <item>
            <title>SimpleDB Support</title>
            <description><![CDATA[<p><a href="http://bixolabs.com/">Bixo Labs</a> has <a href="http://bixolabs.com/2010/03/16/simpledb-tap-for-cascading/">recently announced</a> a new project for integrating Hadoop and Cascading with Amazon Simple DB. Check it out on GitHub at <a href="http://github.com/bixolabs/cascading.simpledb">cascading.simpledb</a>.</p>

<p>This is in part a result of their <a href="http://bixolabs.com/2009/11/01/announcing-the-public-terabyte-dataset-project/">Public Terabyte Dataset Project</a> in AWS.</p>]]></description>
            <link>http://www.cascading.org/2010/03/simpledb-support.html</link>
            <guid>http://www.cascading.org/2010/03/simpledb-support.html</guid>
            
                <category domain="http://www.sixapart.com/ns/types#category">News</category>
            
            
            <pubDate>Sat, 20 Mar 2010 13:41:15 -0800</pubDate>
        </item>
        
        <item>
            <title>Cascading 1.1 User Guide Draft</title>
            <description><![CDATA[<p>In anticipation for the Cascading 1.1 release this month, we have published a draft of the <a href="http://www.cascading.org/documentation/userguide.html">1.1 User Guide</a>.</p> 

<p>Please feel free to review and email in any comments or suggestions to the <a href="http://groups.google.com/group/cascading-user/topics">mailing list</a>.</p>

<p>To download the most recent build of Cascading 1.1, please visit the <a href="http://www.concurrentinc.com/downloads/">download page</a> at Concurrent. There are plans to have a 1.1 final release candidate available on the community site this week.</p>]]></description>
            <link>http://www.cascading.org/2010/03/cascading-11-user-guide-draft.html</link>
            <guid>http://www.cascading.org/2010/03/cascading-11-user-guide-draft.html</guid>
            
                <category domain="http://www.sixapart.com/ns/types#category">News</category>
            
            
            <pubDate>Sun, 07 Mar 2010 21:36:15 -0800</pubDate>
        </item>
        
    </channel>
</rss>
