ServeRAID disk drive error recovery
While a drive is in manufacturing, a set of low level tests is run against it to establish two internal sector lists. These tests create the "known good sectors" and the "known bad sectors" lists. The hard drive is then locked to their capacity by the firmware installed and defines which sectors become actively available. For example, a 36GB drive may actually accommodate 38GB of usable space. This extra space is listed in the NVRAM of the drive in another list called the "Known good reserved sectors".[@more@]
Sector sparingWhile a drive is in operation, the head may come across a sector with a weakened magnetic reading. The data is still readable but may fall below the preferred threshold for qualified good sector readings. This disk drive would consider this a failing sector and would "sector spare" this data to a new location available in the "known good reserve" list. Once the data is moved, the old sector address is added to the "Grown Defects" list, never to be used again. This process is a "recoverable" media error. The drive will give a Performance Failure Alert (PFA) once the drive "sector spares" the majority of its "known good reserved sectors". Hard drives do this as a routine, and PFAs are part of the Mean-Time-Between-Failures (MTBF) calculations for the drives.
Using this same example, a drive will only know to sector spare when it does a read, read-modify-write or a write-verify to a sector. This is important because if a drive does not read or write to a sector that is failing, the drive will never know to correct the problem, resulting in an "unrecoverable media error" on a future read or write before the disk can save the data. When an "unrecoverable media error" occurs, sector sparing still takes place, but no data can be moved.
Knowing this, you can use simple math to see that the risk of problems is doubled when you go from one drive to two drives in an array. If there are ten (10) to sixteen (16) drives in an array, media errors become more common.
Synchronization
Relating this to IBM's ServeRAID technology and synchronization, syncs are designed to force all the physical hard disks in a logical drive or array to do a read to each sector. This will cause the drives to sector spare "recoverable" media errors, hopefully before they become unrecoverable errors. If an "unrecoverable" media error occurs, it is corrected by the ServeRAID controller synchronizations operation on redundant logical drives, (RAID-1, RAID-1E, RAID-5, RAID-5E, RAID-5EE, RAID-10, RAID-1e0, and RAID-50) by rewriting the missing data.
Foreground syncs can be manually initiated two ways, by using the ServeRAID Manager GUI or using the IPSSEND command. The IPSSEND command can be used in a BAT or CMD file and then automated using most any scheduling utility.
引自:
來自 “ ITPUB部落格 ” ,連結:http://blog.itpub.net/743764/viewspace-1004010/,如需轉載,請註明出處,否則將追究法律責任。
相關文章
- mysql crash with disk sector errorMySqlError
- Troubleshooting ServeRAIDServerAI
- one drive
- Disk Drill Media Recovery for Mac(支援多種格式的資料恢復工具)Mac資料恢復
- parallels desktop安裝win10開機顯示non-system disk or disk error怎麼辦ParallelWin10Error
- Gradient_patch_recovery, Z_Z posteriori error estimatorError
- ServeRAID mismatched software levels can result in system problemsServerAI
- 執行SQL發生錯誤!錯誤:disk I/O errorSQLError
- Create Bootable USB Driveboot
- 【BUUCTF】Youngter-drive
- Common Causes and Solutions on ORA-376 Error Found in Backup & RecoveryError
- OGG-01028 Recovery record is missing ERRORError
- 多種格式的資料恢復軟體Disk Drill Media Recovery 4.5.972中文資料恢復
- OGG-01028 Recovery record is missing ERROR[續]Error
- Error 945 Database cannot be opened due to inaccessible files or insufficient memory or disk spaceErrorDatabase
- MRP0- Background Media Recovery terminated with error 1111Error
- DISK OPERATION ERROR,交換機級聯單模光纖鏈路故障解決Error
- 求助:安裝紅帽子重引導後出現GRUB Hard Disk Error(轉)Error
- 手工建立ASM Disk Groups、為 ASM Disk Groups 新增 diskASM
- drop asm disk、撤銷drop asm diskASM
- Causes and Solutions on ORA-1113 Error Found in Backup & Recovery_183367.1Error
- MRP0: Background Media Recovery terminated with error 328,ORA-00328Error
- NTFS Disk by Omi NTFS Mac;NTFS Disk by Omi NTFSMac
- PBOOTCMS網站程式提示“執行SQL發生錯誤!錯誤:DISK I/O ERROR”boot網站SQLError
- move linux os from disk A to disk B with 0 lossLinux
- Asm disk managerASM
- win10電腦開機出現a disk read error occurred錯誤修復方法Win10Error
- ttAdmin -Ramload Error TT0848: Recovery failed on 2 set(s) of data store filesErrorAI
- 利用 iCloud Drive 來同步 Xcode 配置CloudXCode
- Google Drive和Dropbox大比拼Go
- Oracle ASM on Linux with single Hard DriveOracleASMLinux
- win10系統開啟冰封王座出現錯誤提示CD-ROM drive error怎麼解決Win10Error
- Azure Managed Disk操作
- Oracle ASM Disk DirectoryOracleASM
- Oracle ASM Disk PartnerOracleASM
- disk_repair_timeAI
- ASM Disk Group TemplateASM
- Crash recovery和Media recovery的區別