by mxmxm on 5/12/15, 12:23 PM with 2 comments
by gopalv on 5/12/15, 6:58 PM
The page based inner loop makes it very predictable when it does data operations - they were doing this early last year and this mechanism made it very cpu cache friendly & didn't trigger the massive GC pauses.
I was playing with Flink a bit earlier, because Flink can be integrated into Tez, so that Tez could do DAG scheduling while Flink ran with its inner loops on turbo.
That inner loop can edge out even hand-written java code I wrote for page-rank (delta iterations are nice).
by jtagx on 5/12/15, 12:36 PM