This is the old XigmaNAS forum in read only mode,
it will taken offline by the end of march 2021!



I like to aks Users and Admins to rewrite/take over important post from here into the new fresh main forum!
Its not possible for us to export from here and import it to the main forum!

RAID-Z2 pool keeps getting new permanent errors (SOLVED)

Forum rules
Set-Up GuideFAQsForum Rules
Post Reply
PaganGod
Status: Offline

RAID-Z2 pool keeps getting new permanent errors (SOLVED)

Post by PaganGod »

I have a RAID-Z2 pool which seems to keep getting new permanent errors. I was nervous that it was related to swapping in larger disks to repalce functional smaller ones, with my ultimate goal being to expand the pool once all smaller disks are repalced. None of the disks show read or write errors, only checksum. If I wait 1-2 days after doing extensive I/O (like migrating VMware files into the NFS share running on top of one of the ZFS datastores) and run a scrub, I get typically one file each time, always a VMDK file, with permanent errors. I don't believe the motherboard is the problem. Unfortuntely I am not using ECC memory, but my understanding is that the type of error that protects you from should be very very uncommon, and not something happening every 1-2 days.

None of the devices are reporting SMART errors. I clear the error counters on the zpool each time I repalce a drive and successfully resilver or scrub. At this time I see checksum errors on every drive in the vdev holding the RAID-Z2 pool. Any suggestions?
Last edited by PaganGod on 09 Apr 2013 18:08, edited 1 time in total.

User avatar
b0ssman
Forum Moderator
Forum Moderator
Posts: 2438
Joined: 14 Feb 2013 08:34
Location: Munich, Germany
Status: Offline

Re: RAID-Z2 pool keeps getting new permanent errors

Post by b0ssman »

just to be sure run a memtest for 24 hours
Nas4Free 11.1.0.4.4517. Supermicro X10SLL-F, 16gb ECC, i3 4130, IBM M1015 with IT firmware. 4x 3tb WD Red, 4x 2TB Samsung F4, both GEOM AES 256 encrypted.

PaganGod
Status: Offline

Re: RAID-Z2 pool keeps getting new permanent errors

Post by PaganGod »

Thanks, b0ssman! The problem was indeed a bad stick of memory. Live and learn, I should have run a couple full passes with MemText x86+ on this box before putting any production data on it. Since getting the bad stick out (which also caused NFSd to stop responding yesterday) it is stable, and a scrub only found fixable errors. The one virtual disk file I lost is for an easily replacable VM.

Post Reply

Return to “ZFS (only!)”