This is an automated archive made by the Lemmit Bot.

The original was posted on /r/unraid by /u/valain on 2023-09-25 13:18:33.


[SOLVED, see below]

[Alas not solved]

Hello!

I moved my array to a new server (new mobo, CPU, RAM, etc). Since the move I first got on average 1 SMART error of type “udma CRC” on the parity drive per day; no other issues, parity builds just fine, etc.

Today I swapped the SATA cables of the parity drive with one of the other drives in the array to find out whether it’s disk, or cable/controller related. I started a parity rebuild to check.

Now I got TWO of the above errors, one still on the parity drive and the other on the drive I swapped the cables with initially.

So, I’m almost 100% confident that it’s actually not the disks, but rather an issue with the cables or the controller… and cables would be weird because before the cable swap, it was only one drive exhibiting the issue. So, controller?

What would be my next troubleshooting step… replace the 2 cables on the now affected drives, or switch these two cables with the other two drives I have in the array (total of 4 drives) ?

I’ve read somewhere else that some people never got rid of these errors and just de-activated Unraid reporting them… that doesn’t really sound like an option because if errors get reported (even not frequently), there still seems to be something wrong.

Thanks!

UPDATE 2 : Alas, not solved… 30% into the parity check, I now got a 3rd drive report a CRC udma error… So Now I have had CRC errors popping up on 3 different drives, not sharing any cables. This seems to point towards a problem with the built-in SATA controller on the mainboard (Asus Rog Strix Gaming Z690-G Wifi)?

UPDATE: SOLVED! [No, it’s not]

Typical case of error in front of the keyboard, or rather in front of the power cables. Initially I had 2 Y cables coming off a single SATA power cable, thus powering 4 drives on a single cable 🤯. I really didn’t notice this mistake when I built the machine. I have now added an additional power cable, so that each power cable feeds two drives each, and now the CRC errors are gone. Feeling stupid, but at the same time smarter now 🥳