Go Back   Pro/Forums > ProCooling Technical Discussions > Snap Server / NAS / Storage Technical Goodies
Password
Register FAQ Members List Calendar Chat

Snap Server / NAS / Storage Technical Goodies The Home for Snap Server Hacking, Storage and NAS info. And NAS / Snap Classifides

Reply
Thread Tools
Unread 06-29-2009, 06:50 PM   #1
mikecrom
Cooling Neophyte
 
Join Date: Jun 2009
Location: Lisle
Posts: 2
Default 4500 RAID failure

I have a 4500 with 2 expansion units. We had a power failure for 2 hours on Saturday. The system came up to the recovery console, and I chose to reboot. It rebooted, and could only be pinged -- no other access, then rebooted again, and came up. But once the system was up, it indicated RAID failure on md0 -- the 4 internal drives. The other drives were fine.
Looking at the drives from the web interface, the status is OK on all 4 drives. All 4 drives show the same type, 245G MAXTOR, but in the size column it shows 230.67GB for 2 of the drives, and 533.00 MB for the other 2 drives.
Logging on to the system via ssh, I see that hde and hdg are the drives in question (onlly showing hde):

md100: former device hde2 is unavailable, removing from array!
md101: former device hde5 is unavailable, removing from array!
hde1: bad access: block=2, count=2
end_request: I/O error, dev 21:01 (hde), sector 2
hde: unknown partition table
hde: unknown partition table
md: trying to remove hde2 from md100 ...
md: trying to hot-add hde2 to md100 ...
md: hde2 has zero size, marking faulty!
md: hde2 has zero size, marking faulty!
md: trying to remove hde5 from md101 ...
md: trying to hot-add hde5 to md101 ...
md: hde5 has zero size, marking faulty!
md: hde5 has zero size, marking faulty!
hde: unknown partition table

Running fdisk, I see that neither hde nor hdg have any partitioning info.
Running fdisk or sfdisk to partition the drives works -- it shows the same info (cyls, etc.) as hda & hdc, but then the partition map is wiped by something.

So it appears the drives are OK, and just need partitioned. I'm presuming they are the same partition map as hda hdc -- but don't know for sure.

Any ideas how to get these drives back in the RAID array?
Thanks. Mike
mikecrom is offline   Reply With Quote
Unread 06-30-2009, 08:02 AM   #2
blue68f100
Thermophile
 
blue68f100's Avatar
 
Join Date: Jul 2005
Location: Plano, TX
Posts: 3,135
Default Re: 4500 RAID failure

If you have lost 2 HD you may be sol. Run SpinRite on the failed HD's, it may beable to repair the broken table. This has worked for some. Then replace the Maxtor HD's with some good Interprise WD HD. I have seen many times where Maxtor fail in buckets. I don't trust them or use them.

You should have your units on UPS BU with auto shut down...... and restart. Just needs to have a UPS that has the network card. The latest OS supports UPB cable too.
__________________
1 Snap 4500 - 1.0T (4 x 250gig WD2500SB RE), Raid5,
1 Snap 4500 - 1.6T (4 x 400gig Seagates), Raid5,
1 Snap 4200 - 4.0T (4 x 2gig Seagates), Raid5, Using SATA converts from Andy

Link to SnapOS FAQ's http://forums.procooling.com/vbb/showthread.php?t=13820
blue68f100 is offline   Reply With Quote
Unread 06-30-2009, 09:04 AM   #3
mikecrom
Cooling Neophyte
 
Join Date: Jun 2009
Location: Lisle
Posts: 2
Default Re: 4500 RAID failure

Thanks. Actually the system is on UPS. It's in a server room with a huge UPS, and got hit with the rolling brown-out on Saturday. The UPS, however, only ran 12 minutes!

I'll give SpinRite a try. The drives seem fine, I just don't see why the partition map disappears even after I install new ones. Unfortunately, I'm not in the same locality as the 4500. Do you know if SpinRite can be booted from grub, or even a USB key? (I know, look in the docs...).

Also, Adaptec used to have the latest O/S for these for free with registration.
Now that Overland Storage has taken over -- and dropped the products -- I can't seem to find any upgades. Any ideas? I'm more interested in getting the drives working again at this point, though, so just an aside.

Thanks again.
mikecrom is offline   Reply With Quote
Unread 06-30-2009, 12:50 PM   #4
blue68f100
Thermophile
 
blue68f100's Avatar
 
Join Date: Jul 2005
Location: Plano, TX
Posts: 3,135
Default Re: 4500 RAID failure

The GOS is only free if you have a service contract.. Other wise you have to purchase it. Overland has released the old SnapOS for free downloads. But it is only for 10yr hardware that does not support Vist or OSX.

No generator at the site for backup power? Where I use to work we had huge UPS and diesel powered generators. When I left we only had 3 gen and another was in the budget for live spare. These Cat generators burned 120gal/hr, 12 cyl quad trubos locomotive engines. The startup seq had these on line and sync in less than 3 min. Power had to be restored for 5 min and stable before it would switch back to provider power. On your ups setup do you have it set to do an auto shutdown and restart. On restart I have mine set not to restart till the batteries have a min of 25% battery level. This normally takes about 15-20 min, but it eliminates failures when the power is bounced. If bounced normally it will die due to weak batteries. Remember the 4500 take 5-6 min to boot, so you need at least that amount before restarting.

Spinrite is setup to run from a CD or Floppy, both are bootable. It can be run from a usb drive if its bootable. This program is written in assembly language. You will need to be at the machine since you must power down and remove the HD's to be checked to a desktop pc. I know it's a pain, I recheck mine once a year. So I detect when a HD is starting to have problems.
__________________
1 Snap 4500 - 1.0T (4 x 250gig WD2500SB RE), Raid5,
1 Snap 4500 - 1.6T (4 x 400gig Seagates), Raid5,
1 Snap 4200 - 4.0T (4 x 2gig Seagates), Raid5, Using SATA converts from Andy

Link to SnapOS FAQ's http://forums.procooling.com/vbb/showthread.php?t=13820
blue68f100 is offline   Reply With Quote
Reply

Tags
4500, failure, raid


Currently Active Users Viewing This Thread: 1 (0 members and 1 guests)
 

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Forum Jump


All times are GMT -5. The time now is 06:18 PM.


Powered by vBulletin® Version 3.7.4
Copyright ©2000 - 2024, Jelsoft Enterprises Ltd.
(C) 2005 ProCooling.com
If we in some way offend you, insult you or your people, screw your mom, beat up your dad, or poop on your porch... we're sorry... we were probably really drunk...
Oh and dont steal our content bitches! Don't give us a reason to pee in your open car window this summer...