LINUX系統磁碟FC-SAN ext3系統突然變位只讀了
日誌如下:
Jan 21 03:38:20 D0-LNXAPP03 kernel: SCSI error : <0 0 0 2> return code = 0x20000
Jan 21 03:38:20 D0-LNXAPP03 kernel: end_request: I/O error, dev sda, sector 3301127367
Jan 21 03:38:20 D0-LNXAPP03 kernel: EXT3-fs error (device sda1): ext3_readdir: directory #206293234 contains a hole at offset 0
Jan 21 03:38:20 D0-LNXAPP03 kernel: Aborting journal on device sda1.
Jan 21 03:38:20 D0-LNXAPP03 kernel: ext3_abort called.
Jan 21 03:38:20 D0-LNXAPP03 kernel: EXT3-fs error (device sda1): ext3_journal_start_sb: Detected aborted journal
Jan 21 03:38:20 D0-LNXAPP03 kernel: Remounting filesystem read-only
Jan 21 03:38:20 D0-LNXAPP03 kernel: EXT3-fs error (device sda1) in start_transaction: Journal has aborted
Why does the ext3 filesystems on my Storage Area Network (SAN) repeatedly become read-only?
by Chris Snook
When ext3 encounters possible corruption in filesystem metadata, it aborts the journal and remounts it as read-only to prevent causing damage to the metadata on disk. This can occur due to I/O errors while reading metadata, even if there is no metadata corruption on disk.
If filesystems on multiple disk arrays or accessed by multiple clients are repeatedly becoming read-only in a SAN environment, the most common cause is a SCSI timeout while the Fibre Channel HBA driver is handling an RSCN event on the Fibre Channel fabric.
An RSCN (Registered State Change Notification) is generated whenever the configuration of a Fibre Channel fabric changes, and is propagated to any HBA that shares a zone with the device that changed state. RSCNs may be generated when an HBA, switch, or LUN is added or removed, or when the zoning of the fabric is changed.
Resolution:
Some cases of this behavior. may be due to a known bug in the interaction between NFS and ext3. For this reason, it is recommended that users experiencing this problem on NFS servers update their kernel, at least to version 2.6.9-42.0.2.EL. Here is the link to the related bugzilla entry
The lpfc driver update in Red Hat Enterprise Linux 4 Update 4 includes a change to RSCN handling which prevents this problem in many environments. Users of Emulex HBAs experiencing this problem are advised to update their kernel, at least to version 2.6.9-42.EL. Here is the link to the related bugzilla entry
The lpfc and qla2xxx drivers also have configuration options which cause the driver to handle RSCNs in a less invasive manner, which often prevents timeouts during RSCN handling. These options must be set in the /etc/modprobe.conf file:
options lpfc lpfc_use_adisc=1
options qla2xxx ql2xprocessrscn=1
After making these changes, the initrd must be rebuilt and the system must be rebooted for the changes to take effect.
Recommendation:
This problem may be prevented or mitigated by applying SAN vendor recommended configurations and firmware updates to HBAs, switches, and disk arrays on the fabric, as well as recommended configurations and updates to multipathing software. This particularly applies to timeout and retry settings.
The architecture of Fibre Channel assumes that the fabric changes infrequently, so RSCNs can be disruptive even on properly configured fabrics. Events which generate RSCNs should be minimized, particularly at times of high activity, since this causes RSCN handling to take longer than it would on a mostly idle fabric.
In multipathed environments with separate fabrics for different paths, zone changes to the fabrics should be made far apart in time. It is not uncommon for complete handling of a zone change to take many minutes on a busy fabric with many systems and LUNs. Performing zone changes separately minimizes the risk of all paths timing out due to RSCN handling.
來自 “ ITPUB部落格 ” ,連結:http://blog.itpub.net/7728585/viewspace-684443/,如需轉載,請註明出處,否則將追究法律責任。
相關文章
- Linux檔案系統變成只讀Linux
- 檔案系統(十一):Linux Squashfs只讀檔案系統介紹Linux
- Linux系統磁碟管理Linux
- Linux系統磁碟IOLinux
- Linux 檔案系統-ext3 檔案系統介紹(轉)Linux
- ubuntu莫名的 系統出現檔案系統只讀Ubuntu
- 磁碟故障引起的系統變慢定位
- linux磁碟和檔案系統Linux
- Linux(centos)手動掛載系統磁碟和自動掛載系統磁碟教程LinuxCentOS
- Linux磁碟管理與檔案系統(關於磁碟方面)Linux
- Linux Centos系統 磁碟分割槽和檔案系統管理 (深入理解)LinuxCentOS
- 生產系統中只讀表的實現思路
- linux系統環境變數Linux變數
- win10系統磁碟全部被佔用怎麼辦 win10系統磁碟滿了怎麼辦Win10
- Linux系統中ext3和ext4有什麼不同?Linux
- VM虛擬機器 linux系統新增磁碟虛擬機Linux
- Linux 磁碟分割槽及檔案系統管理Linux
- Linux系統監控之磁碟I/O篇Linux
- 【LINUX】磁碟格式化 建立檔案系統Linux
- linux檢視核心版本、系統版本、系統位數(32or64)Linux
- 『學了就忘』Linux檔案系統管理 — 63、磁碟配額介紹Linux
- Windows下磁碟只讀Windows
- Linux系統硬碟讀寫測試Linux硬碟
- kubernetes高階之建立只讀檔案系統以及只讀asp.net core容器ASP.NET
- Linux系統管理-系統概述Linux
- 讀書系統
- ext3檔案系統基礎知識點
- 在Linux 64位系統下使用hugepageLinux
- linux 檢視作業系統位數Linux作業系統
- 『學了就忘』Linux基礎 — 16、Linux系統與Windows系統的不同LinuxWindows
- 用GParted給linux系統進行磁碟分割槽?Linux
- Linux系統磁碟和分割槽有哪些命令呢?Linux
- Unix/Linux 系統自動化管理: 磁碟管理篇Linux
- 【Linux】Linux檔案系統管理7 磁碟配額管理Linux
- windows10系統下基本磁碟變成動態磁碟瞭如何解決Windows
- 系統改變號和系統提交號
- 磁碟管理與檔案系統
- 儲存系統-cache-磁碟