I have a RAID-Z2 pool which seems to keep getting new permanent errors. I was nervous that it was related to swapping in larger disks to repalce functional smaller ones, with my ultimate goal being to expand the pool once all smaller disks are repalced. None of the disks show read or write errors, only checksum. If I wait 1-2 days after doing extensive I/O (like migrating VMware files into the NFS share running on top of one of the ZFS datastores) and run a scrub, I get typically one file each time, always a VMDK file, with permanent errors. I don't believe the motherboard is the problem. Unfortuntely I am not using ECC memory, but my understanding is that the type of error that protects you from should be very very uncommon, and not something happening every 1-2 days.
None of the devices are reporting SMART errors. I clear the error counters on the zpool each time I repalce a drive and successfully resilver or scrub. At this time I see checksum errors on every drive in the vdev holding the RAID-Z2 pool. Any suggestions?
This is the old XigmaNAS forum in read only mode,
it will taken offline by the end of march 2021!
I like to aks Users and Admins to rewrite/take over important post from here into the new fresh main forum!
Its not possible for us to export from here and import it to the main forum!
it will taken offline by the end of march 2021!
I like to aks Users and Admins to rewrite/take over important post from here into the new fresh main forum!
Its not possible for us to export from here and import it to the main forum!
RAID-Z2 pool keeps getting new permanent errors (SOLVED)
-
PaganGod
- Status: Offline
RAID-Z2 pool keeps getting new permanent errors (SOLVED)
Last edited by PaganGod on 09 Apr 2013 18:08, edited 1 time in total.
- b0ssman
- Forum Moderator

- Posts: 2438
- Joined: 14 Feb 2013 08:34
- Location: Munich, Germany
- Status: Offline
Re: RAID-Z2 pool keeps getting new permanent errors
just to be sure run a memtest for 24 hours
Nas4Free 11.1.0.4.4517. Supermicro X10SLL-F, 16gb ECC, i3 4130, IBM M1015 with IT firmware. 4x 3tb WD Red, 4x 2TB Samsung F4, both GEOM AES 256 encrypted.
-
PaganGod
- Status: Offline
Re: RAID-Z2 pool keeps getting new permanent errors
Thanks, b0ssman! The problem was indeed a bad stick of memory. Live and learn, I should have run a couple full passes with MemText x86+ on this box before putting any production data on it. Since getting the bad stick out (which also caused NFSd to stop responding yesterday) it is stable, and a scrub only found fixable errors. The one virtual disk file I lost is for an easily replacable VM.