For example, this may be a small business server supporting an office of ten users or less—the storage system would be the hard drives that are inside of that server where user information is located. In large business environments, the storage systems can be the large SAN cabinet that is full of hard drives and the space has been sliced-and-diced in different ways to provide redundancy and performance.
The Ever-Changing Storage System Technology
Today’s storage technology encompasses all sorts of storage media. These could include WORM systems, tape library systems and virtual tape library systems. Over the past few years, SAN and NAS systems have provided excellent reliability. What is the difference between the two?
For some time, large-scale storage has been out reach of the small business. Serial ATA (SATA) hard disk drive-based SAN systems are becoming a cost-effective way of providing large amounts of storage space. These array units are also becoming mainstream for virtual tape backup systems—literally RAID arrays that are presented as tape machines; thereby removing the tape media element completely.
Other storage technologies such as iSCSI (learn more here), DAS (Direct Attached Storage), Near-Line Storage (data that is attached to removable media), and CAS (Content Attached Storage) are all methods for providing data availability. Storage Architects know that just having a ‘backup’ is not enough. In today’s high information environments, a normal nightly incremental or weekly full backup is obsolete in hours or even minutes after creation. In large data warehouse environments, backing up data that constantly changes is not even an option. The only method for those massive systems is to have storage system mirrors—literally identical servers with the exact same storage space.
How does one decide which system is best? (See Gartner Research-Disk Storage Overview in the reference section).
Careful analysis of the operation environment is required. Most would say that having no failures at all is the best environment—that is true for users and administrators alike! The harsh truth is that data disasters happen every day despite the implementation of risk mitigation policies and plans.
When reviewing your own or your client’s storage needs, consider these questions:
Unique Data Protection Schemes
Storage System manufacturers are pursuing unique ways of processing large amounts of data while still being able to provide redundancy in case of disaster. Some large SAN units incorporate intricate device block-level organisation, essentially creating a low-level file system from the RAID perspective. Other SAN units have an internal block-level transaction log in place so that the Control Processor of the SAN is tracking all of the block-level writes to the individual disks. Using this transaction log, the SAN unit can recover from unexpected power failures or shutdowns. (See Examples of some SAN systems - references section)
Some computer scientists specialising in the storage system field are proposing adding more intelligence to the RAID array controller card so that it is ‘file system aware.’ This technology would provide more recoverability in case disaster struck, the goal being the storage array would become more self-healing. (See On the Cutting Edge of Storage Technology-Smarter RAID Controller Cards - references section)
Other ideas along these lines are to have a heterogeneous storage pool where multiple computers can access information without being dependant on a specific system’s file system. In organisations where there are multiple hardware and system platforms, a transparent file system will provide access to data regardless of what system wrote the data. (See On the Cutting Edge of Storage Technology-SAN File System - references section)
Other computer scientists are approaching the redundancy of the storage array quite differently. The RAID concept is in use on a vast number of systems, yet computer scientists and engineers are looking for new ways to provide better data protection in case of failure. The goals that drive this type of RAID development are data protection and redundancy without sacrificing performance. (See On the Cutting Edge of Storage Technology-RAID Implementation Improvements - references section)
Reviewing the University of California, Berkeley (See How Much Information Produced in 2003?) report about the amount of digital information that was produced 2003 is staggering. Your or your client’s site may not have terabytes or petabytes of information, yet during a data disaster, every file is critically important.
Avoiding Storage System Failures
There are many ways to reduce or eliminate the impact of storage system failures. You may not be able to prevent a disaster from happening, but you may be able to minimise the disruption of service to your clients.
There are many ways to add redundancy to primary storage systems. Some of the options can be quite costly and only large business organisations can afford the investment. These options include duplicate storage systems or identical servers, known as ‘mirror sites’. Additionally, elaborate backup processes or file-system ‘snapshots’ that always have a checkpoint to restore to, provide another level of data protection.
Experience has shown there are usually multiple or rolling failures that happen when an organisation has a data disaster. Therefore, to rely on just one restoration protocol is shortsighted. (See Backup Basics - references section) A successful storage organisation will have multiple layers of restoration pathways.
Ontrack has heard thousands of IT horror stories of initial storage failures turning into complete data calamities. In an effort to bring back a system, some choices can permanently corrupt the data. Here are several risk mitigation policies that storage administrators can adopt that will help minimise data loss when a disaster happens:
The Leader in Storage System Recoveries Ontrack Data Recovery has been successfully recovering data from large storage systems for many years. Ontrack’s unique approach is what sets us apart from other data recovery companies.
A recovery of a data volume implementing a RAID configuration starts out with a Senior Engineer evaluating each hard disk involved and analysing the data structures to determine the proper recovery path. There is no standard configuration for these systems and each OEM implements RAID configurations differently, making every job unique and challenging. The final step is verifying the file system is correctly pointing to the data, validating the file system information and data.
These types of recoveries are the pinnacle of engineering challenges. It is amasing to see one of these systems come together after hours of hard work – going from a data disaster to a complete and successful recovery. Often times, these recoveries result in the original files being recovered and archived without any hardware or software manipulation required on the part of the customer.
The storage industry is continually developing new technologies in order to find better ways to preserve data and maintain business continuity. Some failures are beyond the soft recovery methods that the hardware can handle. This is where Ontrack fits into your or your client’s Data Availability plans. Ontrack has services available to accommodate your or your client’s time requirements for original data restoration.
With nearly 20 years experience, high R&D investment and highly trained data recovery engineers, Ontrack Data Recovery is the leader in storage system recoveries. Ontrack is the recovery company of choice for users, partners, and IT professionals who have high requirements for data recovery.
References
Examples of some SAN systems
EMC Clarion SAN
http://www.emc.com/products/systems/clariion_cx700.jsp
EMC Symmetrix SAN
http://www.emc.com/products/systems/DMX_series.jsp
Hitachi Lighting SAN
http://www.hds.com/products_services/storage_systems/enterprise_storage/
Hitachi Thunder SAN
http://www.hds.com/products_services/storage_systems/modular_storage/
On the Cutting Edge of Storage Technology
Smarter RAID Controller Cards
http://www.usenix.org/publications/library/proceedings/fast04/tech/sivathanu.html
RAID Implementation Improvements
http://www.usenix.org/publications/library/proceedings/fast04/tech/corbett.html
http://h18004.www1.hp.com/products/servers/proliantstorage/arraycontrollers/adg/index.html
How Much Information Produced in 2003?
http://www.sims.berkeley.edu/research/projects/how-much-info-2003/execsum.htm
Gartner Research – Storage Systems
http://www4.gartner.com/resources/116400/116480/2003_disk_stora.pdf
© 2004 Kroll Ontrack Inc. All rights reserved.