Identify failed drive
zfs is reporting read and write errors:
pool: pool0
state: ONLINE
status: One or more devices has experienced an unrecoverable error. An
attempt was made to correct the error. Applications are unaffected.
action: Determine if the device needs to be replaced, and clear the errors
using 'zpool clear' or replace the device with 'zpool replace'.
see: http://zfsonlinux.org/msg/ZFS-8000-9P
scan: scrub in progress since Sun Sep 29 02:42:02 2019
970G scanned out of 44.4T at 7.43M/s, (scan is slow, no estimated time)
0B repaired, 2.14% done
config:
NAME STATE READ WRITE CKSUM
pool0 ONLINE 0 0 0
raidz2-0 ONLINE 0 0 0
scsi-35000cca252119064 ONLINE 0 0 0
scsi-35000cca2525e92c8 ONLINE 0 0 0
scsi-35000cca2525e970c ONLINE 0 0 0
scsi-35000cca2525eb8bc ONLINE 37 768 0
scsi-35000cca2525ec8e8 ONLINE 0 0 0
scsi-35000cca2525ed36c ONLINE 0 0 0
scsi-35000cca2525eda08 ONLINE 0 0 0
scsi-35000cca2525ef9bc ONLINE 0 0 0
scsi-35000cca2525f4f6c ONLINE 0 0 0
scsi-35000cca2525fd754 ONLINE 0 0 0
dmesg can confirm these errors:
# dmesg -T | grep blk_update_request
[Sat Sep 28 11:52:11 2019] blk_update_request: I/O error, dev sdf, sector 949800376
[Sat Sep 28 11:52:11 2019] blk_update_request: I/O error, dev sdf, sector 949799600
[Sat Sep 28 11:52:11 2019] blk_update_request: I/O error, dev sdf, sector 949800120
[Sat Sep 28 11:52:11 2019] blk_update_request: I/O error, dev sdf, sector 949799088
[Sat Sep 28 11:52:11 2019] blk_update_request: I/O error, dev sdf, sector 949798792
[Sat Sep 28 11:52:11 2019] blk_update_request: I/O error, dev sdf, sector 949798808
[Sat Sep 28 11:52:11 2019] blk_update_request: I/O error, dev sdf, sector 949798800
[Sat Sep 28 11:52:11 2019] blk_update_request: I/O error, dev sdf, sector 949798688
[Sat Sep 28 11:52:11 2019] blk_update_request: I/O error, dev sdf, sector 949798824
[Sat Sep 28 11:52:11 2019] blk_update_request: I/O error, dev sdf, sector 949798656
Time to change disk. 1/12 disks is Faulty. The Problem I have no clue which one.
ledctl to the rescue:
apt install -y ledmon
ledctl failure=/dev/disk/by-id/scsi-35000cca2525eb8bc
Now I can safely pull the faulty disk.
Last update:
March 22, 2021