by pretz on 10/29/10, 9:26 PM with 13 comments
by stevejohnson on 10/30/10, 12:31 AM
I can confirm that it is a great way to learn about MapReduce.
Link: http://github.com/irskep/mrjob/tree/py3k
I will likely totally restart the py3k port now that I know what I am doing a bit better. I've been writing Python 3 for about, oh, two weeks.
by ashika on 10/30/10, 3:10 AM
Disco (http://discoproject.org) is a really elegant MR framework implemented in erlang and python, with additional support for jobs in C and Java. I've used it for a little over a year and am convinced it is the superior MR platform (Hadoop's terasort victories notwithstanding). New features are being integrated very quickly, the core platform is rock solid, management is simple and it's extremely flexible.
by derwiki on 10/29/10, 10:10 PM
by deathflute on 10/30/10, 7:27 PM
by FraaJad on 10/29/10, 10:56 PM
by LiveTheDream on 10/30/10, 8:54 PM