Linux - ServerThis forum is for the discussion of Linux Software used in a server related context.
Notices
Welcome to LinuxQuestions.org, a friendly and active Linux Community.
You are currently viewing LQ as a guest. By joining our community you will have the ability to post topics, receive our newsletter, use the advanced search, subscribe to threads and access many other special features. Registration is quick, simple and absolutely free. Join our community today!
Note that registered members see fewer ads, and ContentLink is completely disabled once you log in.
If you have any problems with the registration process or your account login, please contact us. If you need to reset your password, click here.
Having a problem logging in? Please visit this page to clear all LQ-related cookies.
Get a virtual cloud desktop with the Linux distro that you want in less than five minutes with Shells! With over 10 pre-installed distros to choose from, the worry-free installation life is here! Whether you are a digital nomad or just looking for flexibility, Shells can put your Linux machine on the device that you want to use.
Exclusive for LQ members, get up to 45% off per month. Click here for more info.
It started to reshape the array, but then the new drive started giving errors, and the machine hung. I think it was due to a problem with a PCI sata card.
I resolved that, but now I've somehow broken the array.
mdadm -D /dev/md1
/dev/md1:
Version : 01.00.03
Creation Time : Sun Dec 23 01:28:08 2007
Raid Level : raid5
Device Size : 488383744 (465.76 GiB 500.10 GB)
Raid Devices : 5
Total Devices : 3
Preferred Minor : 1
Persistence : Superblock is persistent
Update Time : Fri May 16 22:05:16 2008
State : clean, degraded, Not Started
Active Devices : 3
Working Devices : 3
Failed Devices : 0
Spare Devices : 0
Layout : left-symmetric
Chunk Size : 128K
Delta Devices : 1, (4->5)
Name : 'Fuckyfucky3':1
UUID : 43eff327:8d1aa506:c0df2849:005c003f
Events : 1420750
Number Major Minor RaidDevice State
5 8 49 0 active sync /dev/sdd1
1 8 33 1 active sync /dev/sdc1
3 8 17 2 active sync /dev/sdb1
3 0 0 3 removed
4 0 0 4 removed
mdadm -E /dev/sda1
/dev/sda1:
Magic : a92b4efc
Version : 01
Feature Map : 0x4
Array UUID : 43eff327:8d1aa506:c0df2849:005c003f
Name : 'Fuckyfucky3':1
Creation Time : Sun Dec 23 01:28:08 2007
Raid Level : raid5
Raid Devices : 5
What should I do next? I should zero the superblock on a drive?
If I try to force the array to start:
mdadm --assemble --force /dev/md1 /dev/sda1 /dev/sdb1 /dev/sdc1 /dev/sdd1 /dev/sde1
mdadm: forcing event count in /dev/sda1(3) from 1420746 upto 1420750
mdadm: clearing FAULTY flag for device 0 in /dev/md1 for /dev/sda1
mdadm: /dev/md1 has been started with 4 drives (out of 5).
Then in dmesg.
raid5:md1: read error not correctable (sector 96720 on sda1).
raid5:md1: read error not correctable (sector 96728 on sda1).
raid5:md1: read error not correctable (sector 96736 on sda1).
raid5:md1: read error not correctable (sector 96744 on sda1).
raid5:md1: read error not correctable (sector 96752 on sda1).
raid5:md1: read error not correctable (sector 96760 on sda1).
ata1: EH complete
md: md1: sync done.
ata1.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x0
ata1.00: (BMDMA stat 0x20)
ata1.00: tag 0 cmd 0xc8 Emask 0x9 stat 0x51 err 0x40 (media error)
ata1: EH complete
With 3 failed drives, I suspect you're learning the hard way that software RAID5 is not the best of choices for data. Probably whatever crashed the system is what screwed up the drives, which is the unfortunate nature of software RAID. Obviously if your OS goes down for any reason, and the OS is also controlling the array, the array has no controller, so data will probably get lost.
It is also possible you have a hardware error with some of the drives, but I think that is a remote possibility at best.
I would expect the data could be recovered, but it would probably need recovery experts. Their cost would be way above the cost of a hardware controller.
Hmm, yeah, that's probably not the answer I was looking for.
Thanks for your honesty though.
Now, the data is just a bunch of movies and music I'd ripped at home, so it's not overly critical that I get it all back. However I'd like to think that I didn't do anything rashly irreversable to lose it.
Can anyone point me in the direction of some documentation I can read in order to better understand software raid in order to maybe try and recover the array? Bear in mind that I have (within reason) all the time in the world to fiddle around with the nuts and bolts rather than looking for one magic command to rebuild the array?
I pulled each drive from the box seperately, and did a full SMART test. The new Seagate failed, as well as one of the Samsungs.
I then did a full surface scan on the Samsung, which showed 5 faulty blocks.
Both should be replaced under warranty, but that doesn't help me. Since the 'grow' operation only got a very small way through, almost all of my data should still be on the old array members, right?
So what I'm thinking is that I get a couple of new drives, and dd the faulty samsung onto a new one. Then do the same for the faulty seagate and assemble the array. That /should/ contiue the grow operation, or at least bring up the array in a degraded state, right?
LinuxQuestions.org is looking for people interested in writing
Editorials, Articles, Reviews, and more. If you'd like to contribute
content, let us know.