by Oculus on 10/26/14, 3:03 PM with 124 comments
by KaiserPro on 10/26/14, 4:38 PM
Its grand that its worked out for them, but there are a few big drawbacks that backblasze have software'd their way around.
Nowadays its cheaper to use an engenio based array like the MD3260 (also sold/made by netapps/LSI)
First you can hot swap the disks. Second you don't need to engineer your own storage manager. Thirdly you get much much better performance(2gigabytes a second sustained, which is enough to saturate to 10 gig nics). Fourthly you can get 4 hour 24/7 response. Finally the air flow is a bit suspect.
we use a 1u two socket server with SAS to server the data.
If you're brave, you can skip the raid controller and the JBOD enclosure instead and ZFS over the top. However ZFS fragments like a bitch, so watch out if you're running at 75% plus
by 2close4comfort on 10/26/14, 3:10 PM
by jpalomaki on 10/26/14, 6:17 PM
by andyidsinga on 10/26/14, 4:02 PM
> But the intelligence of where to store data and how to encrypt it, deduplicate it, and index it is all at a higher level (outside the scope of this blog post).
I'm curious about their software that works outside the nodes too. I've been working on storage clusters over this past 9 months using the Ceph ( http://ceph.com/ ) open source storage software. Its pretty amazing -- and I suspect it could be deployed to a set of backblaze pods too.
It seems to be that for production environment where you wanted to maintain availability you would to build at least 3 of those pods for any deployment - enabling replication across pods/storage nodes.
by jdub on 10/26/14, 5:34 PM
by bkruse on 10/26/14, 7:31 PM
Playing with Swift now - but it has really opened my eyes to how much more difficult 2-3 petabytes of storage is (disk failures, number of disks in your infrastructure, the time to redeploy a datacenter on a 1gpbs connection). All the little problems become much bigger!
by harel on 10/26/14, 8:40 PM
by corv on 10/26/14, 3:41 PM
Are they checksumming on a higher level and is that cheaper than using ZFS with necessarily more expensive hardware?
by immortalx on 10/26/14, 7:29 PM
Seems to me like the design is backwards and doesn't make any sense.
by ciupicri on 10/26/14, 3:38 PM
by hendzen on 10/26/14, 3:25 PM
by fredsted on 10/26/14, 5:48 PM
In the meantime, does anyone have a link?
by aliakhtar on 10/26/14, 6:48 PM
by mschuster91 on 10/26/14, 3:18 PM
edit: just spotted it, their boot drive is PATA?! Why is this, given that PATA drives are slower and more expensive than SATA ones?
by ksec on 10/26/14, 3:49 PM
by tkinom on 10/26/14, 3:38 PM
Love to see more write up on software selection process, tradeoff, failure recovery process/methods and benchmark data.
by sidcool on 10/26/14, 4:01 PM
by iflyun on 10/26/14, 7:33 PM
by codeonfire on 10/26/14, 4:22 PM