Chapter 4. A Journey to the Center of the Storage Brain
The Past: Refrigerators and Cards
- Skip
The Middle Ages: The Age of RAID Controllers
RAID controllers combined the basic functionality of disk controllers with the ability to group drives together for added performance and reliability
- Communication between the underlying disks and the attached host computer
- Protection against disk failure through the creation of one or more RAID groups
- Dual-controller systems, which could be used for added performance, availability and automated failover
The Modern Age: Storage Array Controllers
- The storage industry became very volatile during the 90s
- Modern-day networked storage was born.
- EMC and NetApp capitalized on increased storage intelligence and grew at record paces
Enterprise Array Controllers: A More Intelligent Brain
As the storage industry moved from the late 90s into the mid 00s, the market stabilized into a smaller albeit more mature set of suppliers
Table: Data Management Tasks Addressed by Modern Arrays
Function
|
Description
|
Performance
|
Storage arrays are required to quickly process a multitude of data I/O requests arriving simultaneously from hundreds (or thousands) of desktops and servers
|
Resiliency
|
Ø Automated RAID Rebuilds
Ø Data Integrity Checks
Ø “Phone Home” Alerts
Ø Environmental Monitoring
Ø Non-disruptive Software Updates
|
Virtualization
|
Ø Transparent Volume/LUN resizing
Ø Thin Provisioning
Ø Data Cloning
Ø Data Compression
Ø Data Deduplication
|
Chapter 5. Without Memory, You Don’t Have a Brain
Human
|
PC
|
Sensory memory
|
buffers and registers
|
Short-term memory
|
cache, Random Access Memory (RAM), flash and solid-state disks (SSDs)
|
Long-term memory
|
hard disk drives
|
Era
|
Memory Type
|
40s-50s
|
Cathode Ray Tubes
|
50s-60s
|
Magnetic Core Memory
|
70s-present
|
Random Access Memory (RAM) integrated circuits
|
NetApp add some interesting algorithm-level intelligence to its PAM cache
- Priority-based caching
- Non-redundant data caching
- Predictive caching
- Immediate caching
- Metadata caching
Here are the trends we’ll see in the industry’s efforts to reach this goal
- Application servers and workstations will cache more and more of their own data
- Storage networking switches will cache more and more data as it travels through their path
- Storage systems will cache more and more front-end data
- SSDs will become the first line of defense for large amounts of data that can’t be stored in cache
- Hybrid disk drives will be the final step
Chapter 6. The Storage Nervous System
NAS Communications Protocols
|
|
NFS
|
NFS stands for the Network File System
protocol used by UNIX-
based clients or servers
|
CIFS
|
CIFS stands for Common Internet File
System. CIFS is another
network-based protocol used for file
access communications
between Microsoft Windows clients and
servers.
|
SAN Communications Protocols
|
|
Fibre Channel Protocol (FCP)
|
Typically occurs via specialized Fibre
Channel cabling, Fibre Channel host bus adapters (HBAs) and Fibre Channel
switches operating between the SAN and its hosts
|
iSCSI
|
iSCSI stands for Internet Small Computer
Systems Interface. iSCSI is “a transport protocol that provides for the SCSI
protocol to
be carried over a TCP-based IP network.”
|
FcoE
|
FcoE stands for Fibre Channel over
Ethernet. FcoE allows Fibre Channel storage traffic to be sent over Ethernet networks
|
What is data package collision in network?
- A network collision occurs when more than one device attempts to send a packet on a network segment at the same time
- Collisions are resolved using carrier sense multiple access with collision detection in which the competing packets are discarded and re-sent one at a time
- This becomes a source of inefficiency in the network
- Collision domains are found in a hub environment
- Collision domains are also found in wireless networks such asWi-Fi.
- Modern wired networks use a network switch to eliminate collisions.
10 Megabit per second (Mbps)
Ethernet was the norm with 100 Mbps Ethernet just
emerging.
SANs came out of the gate with fiber optic cables and a protocol that could move data at 1 Gigabit per second (Gbps), a ten-fold improvement over the fastest Ethernet-based NAS transport- anyone with a “need for speed” simply had to use SAN storage
- Databases, transaction processing, analytics, and similar applications fell into this category
- NAS, although less costly and easier to implement than SAN, was usually relegated to “slower” applications
- User files, images and Web content
New Realities Point to SAN/NAS Convergence and Unification
10-Gigabit Ethernet (10-GbE or 10 Gbps) is common. 100-Gigabit Ethernet (100-GbE or 100 Gbps) devices have also been demonstrated. Fibre Channel networks operating at 4-Gbps are common today, with 8-Gbps having also been newly delivered.
- Unifying block and file data transport onto the same network fabric
- Unifying SAN and NAS data storage functionality onto the same storage system.
Advancing Intelligence for Internal Communications
SAN Virtual Storage
- add a second logical abstraction layer to the physical disk drives
- did not map LUNs to physical drives, it mapped LUNs to the logical blocks stored on these drives
- NetApp had created an intelligent internal communications network specifically designed for storage systems. The WAFL file system provided a communications network based on logical file-based objects
Next breakthrough in storage protocols and communication becomes object-based storage, SONET, RDMA
沒有留言:
張貼留言