EXPERT RESPONSE
There are several different approaches to configuring fault tolerance for a disk subsystem. If you're looking for protection against hard disk failures, RAID is generally the most common option. Some storage solutions provide snapshot and file system replication options, as well. The details of how to implement fault tolerance will vary based on your specific budget, performance, and capacity needs.
Similarly, the process of restoring from a failure will vary significantly based on the high-availability or fault tolerance method you select. For example, if you're using shared storage clustering, you'll need to restore the failed node and then perform a fail-back operations. The bottom line is that there's no single way to implement and manage fault tolerance. Good luck!
|