
I have a lot of data to work with, and I want to do it with just my mismatched bunch of servers, desktops, SSDs, and spinning disks. My equipment is old, so I want a filesystem that is robust not only to the failure of any drive, but also to the failure of any one machine. My preference is to build a hyper-converged system, where each machine hosts data in addition to working on compute jobs. Following are reviews I found on the main open-source distributed filesystems out there:
Continue reading