<?xml version="1.0" encoding="utf-8"?>
<feed xmlns="http://www.w3.org/2005/Atom">
    <title>Cascading</title>
    <link rel="alternate" type="text/html" href="http://www.cascading.org/" />
    <link rel="self" type="application/atom+xml" href="http://www.cascading.org/atom.xml" />
    <id>tag:www.cascading.org,2008-04-05://2</id>
    <updated>2010-08-26T18:17:18Z</updated>
    
    <generator uri="http://www.sixapart.com/movabletype/">Movable Type Open Source 4.1</generator>

<entry>
    <title>Bixo Hackathon</title>
    <link rel="alternate" type="text/html" href="http://www.cascading.org/2010/08/bixo-hackathon.html" />
    <id>tag:www.cascading.org,2010://2.502</id>

    <published>2010-08-26T18:17:18Z</published>
    <updated>2010-08-26T18:17:18Z</updated>

    <summary>There will be a Bixo hackathon in Nevada City, CA this Sept 7th and 8th. Read more about it here. Note that even if you’re not a hard-core Bixo user, fringe benefits from participating include learning a lot about the...</summary>
    <author>
        <name></name>
        <uri>http://chris.wensel.net/</uri>
    </author>
    
        <category term="News" scheme="http://www.sixapart.com/ns/types#category" />
    
    
    <content type="html" xml:lang="en" xml:base="http://www.cascading.org/">
        <![CDATA[<p>There will be a <a href="http://openbixo.org/">Bixo</a> hackathon in Nevada City, CA this Sept 7th and 8th. Read more about it <a href="http://openbixo.org/2010/08/26/bixo-hackaton-september-7th-8th/">here</a>.</p>

<p>Note that even if you’re not a hard-core Bixo user, fringe benefits from participating include learning a lot about the very useful underlying technologies (Cascading, Hadoop, HttpClient) as well as getting an excuse to visit beautiful <a href="http://www.nevadacitychamber.com/">Nevada City</a>.</p>

<p>Hope to see you there.</p>]]>
        
    </content>
</entry>

<entry>
    <title>O&apos;Reilly Strata Conference</title>
    <link rel="alternate" type="text/html" href="http://www.cascading.org/2010/08/oreilly-strata-conference.html" />
    <id>tag:www.cascading.org,2010://2.501</id>

    <published>2010-08-25T21:03:36Z</published>
    <updated>2010-08-25T21:03:36Z</updated>

    <summary>The new Strata Conference has just been announced with a Call for Proposals ending Sept 28. This new conference is on the &apos;business of data&apos; and is the sister conference to Velocity. Hope to see lots of proposals coming in...</summary>
    <author>
        <name></name>
        <uri>http://chris.wensel.net/</uri>
    </author>
    
        <category term="News" scheme="http://www.sixapart.com/ns/types#category" />
    
    
    <content type="html" xml:lang="en" xml:base="http://www.cascading.org/">
        <![CDATA[<p>The new <a href="http://strataconf.com/strata2011">Strata Conference</a> has just been announced with a Call for Proposals ending Sept 28. 
</p>
<p>This new conference is on the 'business of data' and is the sister conference to Velocity.</p>
<p>Hope to see lots of proposals coming in from Hadoop, Cascading, Bixo, and Cascalog users and developers.</p>]]>
        
    </content>
</entry>

<entry>
    <title>Cascading 1.1.2</title>
    <link rel="alternate" type="text/html" href="http://www.cascading.org/2010/08/cascading-112.html" />
    <id>tag:www.cascading.org,2010://2.499</id>

    <published>2010-08-02T16:07:56Z</published>
    <updated>2010-08-02T16:07:56Z</updated>

    <summary>We are happy to announce that Cascading 1.1.2 is now publicly available for download. This release features many bug fixes. For a detailed list of changes see: CHANGES.txt This release will run against Hadoop 0.18.3, 0.19.x, and 0.20.x. Including Amazon...</summary>
    <author>
        <name></name>
        <uri>http://chris.wensel.net/</uri>
    </author>
    
        <category term="News" scheme="http://www.sixapart.com/ns/types#category" />
    
    
    <content type="html" xml:lang="en" xml:base="http://www.cascading.org/">
        <![CDATA[<p>We are happy to announce that Cascading 1.1.2 is now publicly available for <a href="/downloads.html">download</a>.</p>

<p>This release features many bug fixes.</p>

<p>For a detailed list of changes see:
<a href="http://github.com/concurrentinc/cascading/blob/1.1.2/CHANGES.txt">CHANGES.txt</a></p>

<p>This release will run against Hadoop 0.18.3, 0.19.x, and 0.20.x. Including Amazon Elastic MapReduce.</p>

<p><em>Note the tests will not compile or run against Hadoop 0.18.3 due to package changes since that version.</em></p>
]]>
        
    </content>
</entry>

<entry>
    <title>BigDataCamp 2010</title>
    <link rel="alternate" type="text/html" href="http://www.cascading.org/2010/06/bigdatacamp-2010.html" />
    <id>tag:www.cascading.org,2010://2.497</id>

    <published>2010-06-14T19:01:56Z</published>
    <updated>2010-06-14T19:01:56Z</updated>

    <summary>Quick note that Chris will be at the BigDataCamp on June 28, 2010, the night before the Hadoop Summit. Register now before all the seats are taken....</summary>
    <author>
        <name></name>
        <uri>http://chris.wensel.net/</uri>
    </author>
    
        <category term="Articles" scheme="http://www.sixapart.com/ns/types#category" />
    
    
    <content type="html" xml:lang="en" xml:base="http://www.cascading.org/">
        <![CDATA[Quick note that Chris will be at the <a href="http://bigdatacamp.org">BigDataCamp</a> on June 28, 2010, the night before the Hadoop Summit. Register now before all the seats are taken.]]>
        
    </content>
</entry>

<entry>
    <title>Cascading 1.1.0 Available</title>
    <link rel="alternate" type="text/html" href="http://www.cascading.org/2010/04/cascading-110-available.html" />
    <id>tag:www.cascading.org,2010://2.496</id>

    <published>2010-04-22T22:18:39Z</published>
    <updated>2010-04-22T22:18:39Z</updated>

    <summary>We are happy to announce that Cascading 1.1.0 is now publicly available for download. This release features many performance and usability enhancements while remaining backwards compatible with 1.0. Specifically: Performance optimizations with all join types Numerous job planner optimizations Dynamic...</summary>
    <author>
        <name></name>
        <uri>http://chris.wensel.net/</uri>
    </author>
    
        <category term="News" scheme="http://www.sixapart.com/ns/types#category" />
    
    
    <content type="html" xml:lang="en" xml:base="http://www.cascading.org/">
        <![CDATA[<p>We are happy to announce that Cascading 1.1.0 is now publicly available for <a href="/downloads.html">download</a>.</p>

<p>This release features many performance and usability enhancements while remaining backwards compatible with 1.0.</p>

<p>Specifically:
<ul>
 <li>Performance optimizations with all join types</li>
 <li>Numerous job planner optimizations</li>
 <li>Dynamic optimizations when running in Amazon Elastic MapReduce and S3</li>
 <li>API usability improvements around large number of field names</li>
 <li>Support for TSV, CSV, and custom delimited text files</li>
 <li>Support for manipulating and serializing non-Comparable custom Java types</li>
 <li>Debug levels supported by the job planner</li>
</ul></p>

<p>For a detailed list of changes see:
<a href="http://github.com/concurrentinc/cascading/blob/1.1.0/CHANGES.txt">CHANGES.txt</a></p>

<p>Along with this release are a number of <a href="/modules.html">extensions</a> created by the Cascading user community.</p>

<p>Among these extension are:
<ul>
 <li><a href="http://openbixo.org/">Bixo</a> - a data mining toolkit</li>
 <li><a href="http://github.com/backtype/cascading-dbmigrate">DBMigrate</a> - a tool for migrating data to/from RDBMSs into Hadoop</li>
 <li>Apache HBase, Amazon SimpleDB, and JDBC integration</li>
 <li>JRuby and Clojure based scripting languages for Cascading</li>
 <li><a href="http://github.com/nathanmarz/cascalog">Cascalog</a> - a robust interactive extensible query language</li>
</ul></p>

<p>This release will run against Hadoop 0.18.3, 0.19.x, and 0.20.x. Including Amazon Elastic MapReduce.</p>

<p><em>Note the tests will not compile or run against Hadoop 0.18.3 due to package changes since that version.</em></p>
]]>
        
    </content>
</entry>

<entry>
    <title>Interview on Parallel Programming</title>
    <link rel="alternate" type="text/html" href="http://www.cascading.org/2010/04/interview-on-parallel-programm.html" />
    <id>tag:www.cascading.org,2010://2.495</id>

    <published>2010-04-22T21:40:55Z</published>
    <updated>2010-04-22T21:40:55Z</updated>

    <summary>A very interesting interview with Billy Newport on InfoQ about &quot;the need for higher level abstraction to do parallel programming with multi-core systems effectively.&quot; &quot;Billy Newport is a Distinguished Engineer working on WebSphere eXtreme Scale (ObjectGrid) and on WebSphere high...</summary>
    <author>
        <name></name>
        <uri>http://chris.wensel.net/</uri>
    </author>
    
        <category term="News" scheme="http://www.sixapart.com/ns/types#category" />
    
    
    <content type="html" xml:lang="en" xml:base="http://www.cascading.org/">
        <![CDATA[<p>A very interesting <a href="http://www.infoq.com/interviews/billy-newport-parallel">interview with Billy Newport</a> on InfoQ about "the need for higher level abstraction to do parallel programming with multi-core systems effectively."</p>

<p>"Billy Newport is a Distinguished Engineer working on WebSphere eXtreme Scale (ObjectGrid) and on WebSphere high availability."</p>]]>
        
    </content>
</entry>

<entry>
    <title>Cascalog: An Interactive Query Language</title>
    <link rel="alternate" type="text/html" href="http://www.cascading.org/2010/04/cascalog-an-interactive-query.html" />
    <id>tag:www.cascading.org,2010://2.494</id>

    <published>2010-04-16T13:53:52Z</published>
    <updated>2010-04-16T13:53:52Z</updated>

    <summary>Nathan Marz has just announced and released Cascalog. Cascalog is an interactive query language for Hadoop with a focus on simplicity, expressiveness, and flexibility intended to be used by Analysts and Developers alike. Cascalog eschews the SQL syntax for a...</summary>
    <author>
        <name></name>
        <uri>http://chris.wensel.net/</uri>
    </author>
    
        <category term="News" scheme="http://www.sixapart.com/ns/types#category" />
    
    
    <content type="html" xml:lang="en" xml:base="http://www.cascading.org/">
        <![CDATA[<p><a href="">Nathan Marz</a> has just announced and released Cascalog.</p>

<p>Cascalog is an interactive query language for Hadoop with a focus on simplicity, expressiveness, and flexibility intended to be used by Analysts and Developers alike.</p>

<p>Cascalog eschews the SQL syntax for a simpler and more expressive syntax based on <a href="http://en.wikipedia.org/wiki/Datalog">Datalog</a>.</p>

<p>With this added expressiveness, Cascalog can query existing data stores "out of the box" with no required data "importing" or "under the hood" configuration necessary.</p> 

<p>Because Cascalog sits on top of Clojure, a powerful JVM based language and interactive shell, adding new operations to a query is as simple as defining a new function.</p>

<p>Cascalog also relies on Cascading, a robust data processing API and query planner.</p>

<p>Here is the canonical "word count" query in Cascalog:

<blockquote>(?<- (stdout) [?word ?count] (sentence ?s) (split ?s :> ?word) (c/ count ?count))</blockquote></p>

<p>You can check out an introductory blog post here:
<a href="http://nathanmarz.com/blog/introducing-cascalog/">http://nathanmarz.com/blog/introducing-cascalog/</a></p>

<p>The project is hosted here: <a href="http://github.com/nathanmarz/cascalog">http://github.com/nathanmarz/cascalog</a></p>]]>
        
    </content>
</entry>

<entry>
    <title>Karmasphere Studio Ships with Cascading</title>
    <link rel="alternate" type="text/html" href="http://www.cascading.org/2010/04/karmasphere-studio-ships-with.html" />
    <id>tag:www.cascading.org,2010://2.493</id>

    <published>2010-04-12T21:43:58Z</published>
    <updated>2010-04-12T21:43:58Z</updated>

    <summary>The recently released Karmasphere Studio 1.2 now includes support for Cascading 1.0 in the free community download. Karmasphere Studio is an IDE and Debugger for Hadoop MapReduce application developers that also includes integration with the Amazon Web Services platform. And...</summary>
    <author>
        <name></name>
        <uri>http://chris.wensel.net/</uri>
    </author>
    
        <category term="News" scheme="http://www.sixapart.com/ns/types#category" />
    
    
    <content type="html" xml:lang="en" xml:base="http://www.cascading.org/">
        <![CDATA[<p>The recently released <a href="http://www.karmasphere.com/2010/04/12/new-version-of-karmasphere-studio/">Karmasphere Studio 1.2</a> now includes support for Cascading 1.0 in the free community download.</p>

<p>Karmasphere Studio is an IDE and Debugger for Hadoop MapReduce application developers that also includes integration with the Amazon Web Services platform.</p>

<p>And with Cascading support directly in the Debugger and IDE, developers can even more quickly develop and debug complex Hadoop jobs.</p>

<p>Also worthy of note, <a href="http://www.karmasphere.com/2010/04/08/on-karmaspheres-first-funding/">Karmasphere recently received $5M Series A funding</a>.</p>]]>
        
    </content>
</entry>

<entry>
    <title>Cascading 1.1 RC3 Available</title>
    <link rel="alternate" type="text/html" href="http://www.cascading.org/2010/04/cascading-11-rc3-available.html" />
    <id>tag:www.cascading.org,2010://2.492</id>

    <published>2010-04-04T19:10:12Z</published>
    <updated>2010-04-04T19:10:12Z</updated>

    <summary>Cascading 1.1 RC3 is now available from the downloads page. Note we are no longer serving downloads from Google Code but from links off the download page....</summary>
    <author>
        <name></name>
        <uri>http://chris.wensel.net/</uri>
    </author>
    
        <category term="News" scheme="http://www.sixapart.com/ns/types#category" />
    
    
    <content type="html" xml:lang="en" xml:base="http://www.cascading.org/">
        <![CDATA[<p>Cascading 1.1 RC3 is now available from the <a href="http://www.cascading.org/downloads.html">downloads page</a>.</p>

<p>Note we are no longer serving downloads from Google Code but from links off the download page.</p>]]>
        
    </content>
</entry>

<entry>
    <title>Cascading-DBMigrate</title>
    <link rel="alternate" type="text/html" href="http://www.cascading.org/2010/03/cascadingdbmigrate.html" />
    <id>tag:www.cascading.org,2010://2.490</id>

    <published>2010-03-26T21:43:03Z</published>
    <updated>2010-03-26T21:43:03Z</updated>

    <summary>Nathan at BackType has announced and released Cascading-DBMigrate. In short, DBMigrate is a more flexible and reliable alternative to Sqoop for moving data to/from a relational data store. Cascading.JDBC has been around for quite a while, but DBMigrate overcomes some...</summary>
    <author>
        <name></name>
        <uri>http://chris.wensel.net/</uri>
    </author>
    
        <category term="News" scheme="http://www.sixapart.com/ns/types#category" />
    
    
    <content type="html" xml:lang="en" xml:base="http://www.cascading.org/">
        <![CDATA[<p><a href="http://nathanmarz.com/">Nathan</a> at <a href="http://www.backtype.com/">BackType</a> has <a href="http://tech.backtype.com/migrating-data-from-a-sql-database-to-hadoop">announced</a> and released <a href="http://github.com/backtype/cascading-dbmigrate">Cascading-DBMigrate</a>.</p>

<p>In short, DBMigrate is a more flexible and reliable alternative to <a href="http://www.cloudera.com/blog/2009/06/introducing-sqoop/" rel="nofollow">Sqoop</a> for moving data to/from a relational data store.</p>

<p><a href="http://www.cascading.org/modules.html">Cascading.JDBC</a> has been around for quite a while, but DBMigrate overcomes some of the limitations when dealing with MySQL servers (AsterData did not have the same limitations) and OFFSET/LIMIT queries.</p>]]>
        
    </content>
</entry>

<entry>
    <title>Riffle: Lightweight Workflow</title>
    <link rel="alternate" type="text/html" href="http://www.cascading.org/2010/03/riffle-lightweight-workflow.html" />
    <id>tag:www.cascading.org,2010://2.489</id>

    <published>2010-03-26T20:52:26Z</published>
    <updated>2010-03-26T20:52:26Z</updated>

    <summary>Riffle has been announced on the Mahout mailing list. Riffle is a lightweight Java library for executing collections of dependent processes as a single process. It is Apache licensed so it can be included in non-GPL compatible projects. The next...</summary>
    <author>
        <name></name>
        <uri>http://chris.wensel.net/</uri>
    </author>
    
        <category term="News" scheme="http://www.sixapart.com/ns/types#category" />
    
    
    <content type="html" xml:lang="en" xml:base="http://www.cascading.org/">
        <![CDATA[<p><a href="http://n3.nabble.com/Fwd-riffle-small-scale-workflow-manager-td637952.html#a637952%23a637952">Riffle has been announced</a> on the <a href="http://lucene.apache.org/mahout/">Mahout</a> mailing list.</p>

<p><a href="http://github.com/cwensel/riffle">Riffle</a> is a lightweight Java library for executing collections of dependent processes as a single process. It is Apache licensed so it can be included in non-GPL compatible projects.</p> 

<p>The next major version of Cascading (1.2) will support the Riffle annotations so that projects like Mahout and Pig can participate in a <a href="/documentation/features/topological-scheduler.html">Cascading Cascade</a> execution.</p>

<p>Riffle can be found on its GitHub <a href="http://github.com/cwensel/riffle">project page</a>.</p>]]>
        
    </content>
</entry>

<entry>
    <title>Cascading 1.1 RC1 Available</title>
    <link rel="alternate" type="text/html" href="http://www.cascading.org/2010/03/cascading-11-rc1-available.html" />
    <id>tag:www.cascading.org,2010://2.488</id>

    <published>2010-03-23T18:08:28Z</published>
    <updated>2010-03-23T18:08:28Z</updated>

    <summary>Cascading 1.1 RC1 is now available from the downloads page. You can read about all the changes in the CHANGES.txt file. Note we are no longer serving downloads from Google Code but from links off the download page....</summary>
    <author>
        <name></name>
        <uri>http://chris.wensel.net/</uri>
    </author>
    
        <category term="News" scheme="http://www.sixapart.com/ns/types#category" />
    
    
    <content type="html" xml:lang="en" xml:base="http://www.cascading.org/">
        <![CDATA[<p>Cascading 1.1 RC1 is now available from the <a href="http://www.cascading.org/downloads.html">downloads page</a>.</p>

<p>You can read about all the changes in the <a href="http://github.com/concurrentinc/cascading/blob/1.1.rc1/CHANGES.txt">CHANGES.txt file</a>.</p>

<p>Note we are no longer serving downloads from Google Code but from links off the download page.</p>]]>
        
    </content>
</entry>

<entry>
    <title>Cascading at RazorFish and AWS</title>
    <link rel="alternate" type="text/html" href="http://www.cascading.org/2010/03/cascading-at-razorfish-and-aws.html" />
    <id>tag:www.cascading.org,2010://2.487</id>

    <published>2010-03-20T20:51:03Z</published>
    <updated>2010-03-20T20:51:03Z</updated>

    <summary>Check out the new Case Study published by Amazon on User Segmentation at RazorFish....</summary>
    <author>
        <name></name>
        <uri>http://chris.wensel.net/</uri>
    </author>
    
        <category term="News" scheme="http://www.sixapart.com/ns/types#category" />
    
    
    <content type="html" xml:lang="en" xml:base="http://www.cascading.org/">
        <![CDATA[Check out the new Case Study published by Amazon on <a href="http://www.concurrentinc.com/news-events/entry/case_study_razorfish_user_segmentation_with_cascading_and_amazon_elastic_ma/">User Segmentation at RazorFish</a>.]]>
        
    </content>
</entry>

<entry>
    <title>SimpleDB Support</title>
    <link rel="alternate" type="text/html" href="http://www.cascading.org/2010/03/simpledb-support.html" />
    <id>tag:www.cascading.org,2010://2.486</id>

    <published>2010-03-20T20:41:15Z</published>
    <updated>2010-03-20T20:41:15Z</updated>

    <summary>Bixo Labs has recently announced a new project for integrating Hadoop and Cascading with Amazon Simple DB. Check it out on GitHub at cascading.simpledb. This is in part a result of their Public Terabyte Dataset Project in AWS....</summary>
    <author>
        <name></name>
        <uri>http://chris.wensel.net/</uri>
    </author>
    
        <category term="News" scheme="http://www.sixapart.com/ns/types#category" />
    
    
    <content type="html" xml:lang="en" xml:base="http://www.cascading.org/">
        <![CDATA[<p><a href="http://bixolabs.com/">Bixo Labs</a> has <a href="http://bixolabs.com/2010/03/16/simpledb-tap-for-cascading/">recently announced</a> a new project for integrating Hadoop and Cascading with Amazon Simple DB. Check it out on GitHub at <a href="http://github.com/bixolabs/cascading.simpledb">cascading.simpledb</a>.</p>

<p>This is in part a result of their <a href="http://bixolabs.com/2009/11/01/announcing-the-public-terabyte-dataset-project/">Public Terabyte Dataset Project</a> in AWS.</p>]]>
        
    </content>
</entry>

<entry>
    <title>Cascading 1.1 User Guide Draft</title>
    <link rel="alternate" type="text/html" href="http://www.cascading.org/2010/03/cascading-11-user-guide-draft.html" />
    <id>tag:www.cascading.org,2010://2.484</id>

    <published>2010-03-08T05:36:15Z</published>
    <updated>2010-03-08T05:36:15Z</updated>

    <summary>In anticipation for the Cascading 1.1 release this month, we have published a draft of the 1.1 User Guide. Please feel free to review and email in any comments or suggestions to the mailing list. To download the most recent...</summary>
    <author>
        <name></name>
        <uri>http://chris.wensel.net/</uri>
    </author>
    
        <category term="News" scheme="http://www.sixapart.com/ns/types#category" />
    
    
    <content type="html" xml:lang="en" xml:base="http://www.cascading.org/">
        <![CDATA[<p>In anticipation for the Cascading 1.1 release this month, we have published a draft of the <a href="http://www.cascading.org/documentation/userguide.html">1.1 User Guide</a>.</p> 

<p>Please feel free to review and email in any comments or suggestions to the <a href="http://groups.google.com/group/cascading-user/topics">mailing list</a>.</p>

<p>To download the most recent build of Cascading 1.1, please visit the <a href="http://www.concurrentinc.com/downloads/">download page</a> at Concurrent. There are plans to have a 1.1 final release candidate available on the community site this week.</p>]]>
        
    </content>
</entry>

</feed>
