Applicable Products
- QuTS hero h5.3.0 or later
- High Availability Manager
- Storage Manager
Scenario
While using QNAP NAS with high availability (HA) enable, I received a system warning indicating an SSD cache issue. One or more cache disks are reported as degraded or crashed. How can I repair or replace the faulty disks without interrupting ongoing services?
Procedure
Before you begin, please do the following:
- Back up important data.
Always back up your data before performing disk operations to mitigate the risk of data loss. - Identify the faulty SSD(s).
- Go to Storage Manager > Disks > Disk.
- Identify the SSDs with “Cache” listed under the column header Usage Type.
- Identify the SSDs with “Error” or “Warning” listed under the column header Status.
- Determine whether hot-swapping is supported for the drive bays/slots where your faulty SSDs are installed.
Tip
- 3.5-inch or 2.5-inch drive bays usually support hot-swapping.
- Drive slots that require access to the system board (such as M.2) usually do not support hot-swapping.
- To determine hot-swapping support for drive bays/slots on your specific model, download and check the hardware user guide for your model in Download Center.
- Follow the relevant instructions depending on whether hot-swapping is supported:
Case 1: Hot-swapping supported
- Replace the faulty disk.
- Remove the faulty disk.
- Install a healthy disk of the same or larger capacity in the same slot.
The system automatically detects the new disk.
- If the SSD cache is degraded, the system will automatically start rebuilding the cache RAID.
Note
If the system does not automatically start rebuilding, try setting the new disk as a spare disk to trigger the rebuild process.
- If the SSD cache crashed, manually remove and recreate the SSD cache.
- Go to Storage Manager > Cache Acceleration.
- Remove the SSD cache.
For details, see Removing the SSD cache.
The system automatically flushes cached data back to the storage pool. - Recreate the SSD cache.
For details, see Creating the SSD cache.
Case 2: Hot-swapping not supported
- If the faulty disk is on the active node, switch the node role to passive node by performing a switchover.
If the faulty disk is on the passive node, skip to the next step.- Go to High Availability Manager > Cluster.
- Click Manage, and then select Switch Over.
The original active node becomes the passive node.
- Shut down the passive node.
- Go to High Availability Manager > Nodes.
- Identify the passive node.
- Click
, and then select Shut Down.
- Replace the faulty disk.
- Remove the faulty disk.
- Install a healthy disk of the same or larger capacity in the same slot.
- Power on the passive node.
After the passive node starts, it automatically rejoins the HA cluster. - Switch the passive node’s role to active node.
SSD cache is managed by the active node. To repair the SSD cache, the host NAS must be in the active node role. - Go to High Availability Manager > Cluster.
- Click Manage, and then select Switch Over.
The passive node is now the active node.
- If the SSD cache is degraded, the system will automatically start rebuilding the cache RAID.
Note
If the system does not automatically start rebuilding, try setting the new disk as a spare disk to trigger the rebuild process.
- If the SSD cache crashed, manually remove and recreate the SSD cache.
- Go to Storage Manager > Cache Acceleration.
- Remove the SSD cache.
For details, see Removing the SSD cache.
The system automatically flushes cached data back to the storage pool. - Recreate the SSD cache.
For details, see Creating the SSD cache.
Tip
After replacing a disk, we recommended performing a switchover to ensure that the switchover/failover mechanism works properly and that HA functionality has been restored.
Further Reading
适用产品
- QuTS hero h5.3.0 or later
- High Availability Manager
- Storage Manager
场景
在使用启用高可用性 (HA)的QNAP NAS时,我收到系统警告,提示SSD缓存问题。一个或多个缓存磁盘被报告为降级或崩溃。我如何在不中断正在进行的服务的情况下修复或更换故障磁盘?
步骤
开始之前,请执行以下操作:
- 备份重要数据。
在执行磁盘操作之前始终备份数据,以降低数据丢失的风险。 - 识别故障SSD。
- 前往存储 管理器 > 磁盘 > 磁盘.
- 识别在列标题下标有“缓存”的SSD使用类型.
- 识别在列标题下标有“错误”或“警告”的SSD状态.
- 确定故障SSD安装的驱动器托架/插槽是否支持热插拔。
提示
- 3.5英寸或2.5英寸驱动器托架通常支持热插拔。
- 需要访问系统板的驱动器插槽(如M.2)通常不支持热插拔。
- 要确定特定型号驱动器托架/插槽的热插拔支持,请下载并查看下载中心中的硬件用户指南。
- 根据是否支持热插拔,遵循相关说明:
情况1:支持热插拔
- 更换故障磁盘。
- 移除故障磁盘。
- 在同一插槽安装容量相同或更大的健康磁盘。
系统会自动检测新磁盘。
- 如果SSD缓存降级,系统将自动开始重建缓存RAID。
注意
如果系统未自动开始重建,请尝试将新磁盘设置为备用磁盘以触发重建过程。
- 如果SSD缓存崩溃,手动移除并重新创建SSD缓存。
- 前往存储 管理器 > 缓存加速.
- 移除SSD缓存。
详情请参见移除SSD缓存。
系统会自动将缓存数据刷新回存储池。 - 重新创建SSD缓存。
详情请参见创建SSD缓存。
案例 2:不支持热插拔
- 如果故障磁盘在活动节点上,通过执行切换将节点角色切换为被动节点。
如果故障磁盘在被动节点上,跳到下一步。- 进入High Availability Manager > 群集。
- 点击管理,然后选择切换。
原活动节点变为被动节点。
- 关闭被动节点。
- 进入High Availability Manager > 节点.
- 识别被动节点。
- 点击
, 然后选择关闭.
- 更换故障磁盘。
- 移除故障磁盘。
- 在同一插槽安装容量相同或更大的健康磁盘。
- 启动被动节点。
被动节点启动后,会自动重新加入HA 群集。 - 将被动节点的角色切换为活动节点。
SSD缓存由活动节点管理。要修复SSD缓存,主机NAS必须处于活动节点角色。- 进入High Availability Manager > 群集。
- 点击管理,然后选择切换。
被动节点现在是活动节点。
- 如果SSD缓存降级,系统将自动开始重建缓存RAID。
注意
如果系统没有自动开始重建,尝试将新磁盘设置为备用磁盘以触发重建过程。
- 如果SSD缓存崩溃,手动移除并重新创建SSD缓存。
- 进入存储 管理器 > 缓存加速.
- 移除SSD缓存。
详情请参见移除SSD缓存。
系统会自动将缓存数据刷新回存储池。 - 重新创建SSD缓存。
详情请参见创建SSD缓存。
提示
更换磁盘后,我们建议执行切换以确保切换/故障转移机制正常工作,并恢复HA功能。
进一步阅读