Go Back   Pro/Forums > ProCooling Technical Discussions > Snap Server / NAS / Storage Technical Goodies
Password
Register FAQ Members List Calendar Chat

Snap Server / NAS / Storage Technical Goodies The Home for Snap Server Hacking, Storage and NAS info. And NAS / Snap Classifides

Reply
Thread Tools
Unread 04-26-2006, 09:45 PM   #1
cestes
Cooling Neophyte
 
Join Date: Apr 2006
Location: DC
Posts: 4
Default Snap 4000 fsck'ing after power hit

Hi All,

I stumbled across this forum after a problem and I've learned a lot just reading.

We've got a problem and a plan of attack and I'd like some advice and/or suggestions. Anything I learn I'll gladly report back.

The Situation: We're moving to a new building. We moved our Snap Server 4000 (bought on eBay with 4*250GB disks configured RAID 5) to the new building so users could access it. The computer room isn't completed (no UPS yet) and the Snap Server was plugged into wall power. Guess what? We took a power hit! The box didn't come back after the power did. It kept giving errors. I called Adaptec and paid the one-time incident fee. I learned a lot and their techs were really helpful. We tried lots of things but no luck. It looks like the hardware is fine, but the file system is not reparable by the fsck. Finallly they told me today to call some data recovery places and gave me a list of numbers.

I spoke with one place that started me thinking. The sales person got an engineer on the phone to ask some questions. Finally he says, "Well, I'm thinking I can work this remote. No need to ship it anywhere. What you'd have to do is get an IDE RAID controller like a Promise, hook the drives up to that and put it in a spare PC you've got laying around."

The Plan: So, I start thinking to myself. I can certainly hook everything up as requested. I started reading and see that the Snap is running XFS inside. Gee, so does Knoppix. So, I went on eBay and bought an Adaptec 2400A RAID card that does IDE RAID 5. I'll slap the card and the 4 drives into a PC and boot Knoppix. Next I should be able to do 'xfs_repair' and I'm home free, right?

OK, maybe not so simple. How do I hook the drives up? What order? Master vs. Slave? Etc. Do I tell the controller that the 4 drives are RAID5 or will it figure it out? Will Knoppix see it right away? Etc.? Finally, will xfs_repair do the job?

Basically, has anyone pulled their RAID5 disks out, repaired them, and put them back again?

Any advice, suggestions, etc. welcome.

Thanks and I hope I'll have some good news early next week!

-Chris-
cestes is offline   Reply With Quote
Unread 04-27-2006, 01:29 AM   #2
re3dyb0y
Cooling Savant
 
re3dyb0y's Avatar
 
Join Date: Aug 2004
Location: UK
Posts: 909
Default Re: Snap 4000 fsck'ing after power hit

Is possible, but not probable

Knoppix will see the drives, but it wont let you read them - ie the files off it (well, no one has so far)

At best youu will be able to make an image, but doesnt mean you can get the datas

I think i've got it on the wiki, but, there is a 4000 service manual about which reference the drive id's to their positions
__________________
Snap Server Help Wiki - http://wiki.procooling.com/index.php/Snap_Server

Snap Server 2200 v3.4.807
2x 250GB Seagate Barracuda 7200.9
w/ UNIDFC601512M Replacement Fan

"Did you really think it would be that easy??"


Other NAS's
1x NSLU2 w/ 512mb Corsair Flash Voyager
Running Unslung 6.8b

1x NSLU2 w/ 8Gb LaCie Carte Orange
Running Debian/NSLU2 Stable 4.0r0


250GB LaCie Ethernet Disk
Running Windows XP Embedded
re3dyb0y is offline   Reply With Quote
Unread 04-27-2006, 07:25 AM   #3
rpmurray
Cooling Savant
 
Join Date: Apr 2006
Location: Tennessee
Posts: 157
Default Re: Snap 4000 fsck'ing after power hit

"I think i've got it on the wiki, but, there is a 4000 service manual about which reference the drive id's to their positions"

If you can find that in the wiki, you've got sharper eyes than I do. I still haven't found anything that shows which drives are which in a 4000.
rpmurray is offline   Reply With Quote
Unread 04-27-2006, 07:48 AM   #4
blue68f100
Thermophile
 
blue68f100's Avatar
 
Join Date: Jul 2005
Location: Plano, TX
Posts: 3,135
Default Re: Snap 4000 fsck'ing after power hit

Your best bet in repairing the unit is.????
If data must be recovered. BACKUP ALL DRIVES USING DD. It can make a image of the drives, for backup only. You can not read it. If you do image the drives with dd. You will be looking at 24 hrs +. To make a raw copy of the drives.

Do you have a copy of SpinRite from GRC.com. It works with most any drive and OS, XFS is not listed, though, raids are. It is the best option if you are going to need data recovery, with out sending in to a recovery service, which cost $$$$$.

If you hook the drives up in a attempt to read the drives you will proably loose all data. It will not reconize the File System. If you want to try use FreeBSD it is what the snap OS is build off of.

Only a 4000 will be able to read the data from a raid 5. Install into another 4000. But with the boot damged ????? Don't know.....

I would try to determine if you have a hardware failure or Drive/os failure.

Since the boot drive and Flashram (?) are damage. I would mark all drives as for position. I would then install a new drive in to position 1 (first). See if the unit formats the drive ands loads the OS. If so you will be able to recover. If not it gets more difficult.

If you are able to boot with 1 drive. The unit should have held the raid configuration. If so, see if you can set drive 1 up as a spare. Install the remaining drives. With any luck. It should comeup and start resync /rebuild the raid 5 set.

If Not.

Install a drive with the boot image file allready pre-loaded on the the drive. This will test the hardware. If it fails to boot. You have a hardware problem. Either the MB, Ram, or PS is bad. Get you multimeter out and check the PS. These are limited on power. And 250gig drives seam to be close the the max it can handle.

This should be a starting point.

If you do image the drives with dd. You will be looking at 24 hrs +. To make a raw copy of the drives.
__________________
1 Snap 4500 - 1.0T (4 x 250gig WD2500SB RE), Raid5,
1 Snap 4500 - 1.6T (4 x 400gig Seagates), Raid5,
1 Snap 4200 - 4.0T (4 x 2gig Seagates), Raid5, Using SATA converts from Andy

Link to SnapOS FAQ's http://forums.procooling.com/vbb/showthread.php?t=13820

Last edited by blue68f100; 04-27-2006 at 08:35 AM.
blue68f100 is offline   Reply With Quote
Unread 04-27-2006, 11:34 AM   #5
re3dyb0y
Cooling Savant
 
re3dyb0y's Avatar
 
Join Date: Aug 2004
Location: UK
Posts: 909
Default Re: Snap 4000 fsck'ing after power hit

Quote:
Originally Posted by rpmurray
"I think i've got it on the wiki, but, there is a 4000 service manual about which reference the drive id's to their positions"

If you can find that in the wiki, you've got sharper eyes than I do. I still haven't found anything that shows which drives are which in a 4000.
Lol, yeah, i thought it was....

Well, i had it my files, so have uploaded it for you http://rapidshare.de/files/19065196/...guide.pdf.html



Cheers


Sam
__________________
Snap Server Help Wiki - http://wiki.procooling.com/index.php/Snap_Server

Snap Server 2200 v3.4.807
2x 250GB Seagate Barracuda 7200.9
w/ UNIDFC601512M Replacement Fan

"Did you really think it would be that easy??"


Other NAS's
1x NSLU2 w/ 512mb Corsair Flash Voyager
Running Unslung 6.8b

1x NSLU2 w/ 8Gb LaCie Carte Orange
Running Debian/NSLU2 Stable 4.0r0


250GB LaCie Ethernet Disk
Running Windows XP Embedded
re3dyb0y is offline   Reply With Quote
Unread 04-27-2006, 12:18 PM   #6
cestes
Cooling Neophyte
 
Join Date: Apr 2006
Location: DC
Posts: 4
Default Re: Snap 4000 fsck'ing after power hit

Thanks to all for the info...

To be clear, it's not a hardware problem... everything appears to power up fine. Its not a raid problem (AFIK). It's not a boot/OS problem, since the thing boots up and I can talk to it. It's a file system (XFS) problem that I'm having.

The machine comes up and starts doing a fsck on device 60000, which is my RAID 5 device and that's where it runs into trouble.

I'll take the advice to make copies of everything before I start messing around with it!

Question: Do I need to know the chunk size, parity scheme, etc. that defines the RAID 5 if I'm going to try and read it, or will my controller be able to figure that out?

Thanks again!

-Chris-
cestes is offline   Reply With Quote
Unread 04-27-2006, 09:41 PM   #7
blue68f100
Thermophile
 
blue68f100's Avatar
 
Join Date: Jul 2005
Location: Plano, TX
Posts: 3,135
Default Re: Snap 4000 fsck'ing after power hit

You are not going to be able to read it without another snap. Due to the XFS file system. Moving it to another non snap will be detramental. The XFS is unique to the snaps, only.

Does the error log indicates which drive is having a problem?

Do you have a copy of SpinRite? It is the best disk recovery program on the market.
__________________
1 Snap 4500 - 1.0T (4 x 250gig WD2500SB RE), Raid5,
1 Snap 4500 - 1.6T (4 x 400gig Seagates), Raid5,
1 Snap 4200 - 4.0T (4 x 2gig Seagates), Raid5, Using SATA converts from Andy

Link to SnapOS FAQ's http://forums.procooling.com/vbb/showthread.php?t=13820
blue68f100 is offline   Reply With Quote
Unread 04-28-2006, 06:50 AM   #8
cestes
Cooling Neophyte
 
Join Date: Apr 2006
Location: DC
Posts: 4
Default Re: Snap 4000 fsck'ing after power hit

Quote:
Originally Posted by blue68f100
The XFS is unique to the snaps, only.
I'm not sure about that at all. XFS was developed by SGI and is used on a lot of systems:

http://en.wikipedia.org/wiki/Xfs

One of the engineers I spoke to at one of the big data recovery shops basically spelled out the approach. I've got to figure that he's somewhat competent as he was getting ready to charge me between $7,500 and $15,000 for the recovery job!

I'm going to pick up some identical drives and images each drive and play with those to avoid breaking my real ones.

Thanks again for all your input!

-Chris-
cestes is offline   Reply With Quote
Unread 04-28-2006, 08:20 AM   #9
blue68f100
Thermophile
 
blue68f100's Avatar
 
Join Date: Jul 2005
Location: Plano, TX
Posts: 3,135
Default Re: Snap 4000 fsck'ing after power hit

Quote:
Older versions of XFS suffered from out-of-order write hazards, which can result in problems such as files being appended to during a crash gaining a tail of garbage on the next mount.
Snap has been using this/modified since 98. So this may explain the error. When snap went from v1 to 2, it required a hd reinit as part of the process. So they are using a Modified XFS.

While you are at it. SpinRite supports raid drives. It's a $90 program. It may be worth the try. It's written in assembly language, and run from a dos (x86). It is well know for its recovery process. It's a lot cheaper than $15k.

DD on 250gig drive will take about ??????
dd defaults to 512k bytes, for it raw copy. If I recall you can change it to 1024K and it will be faster.

Have you looked closely at the Error Log? Generally it tells you which drive.
__________________
1 Snap 4500 - 1.0T (4 x 250gig WD2500SB RE), Raid5,
1 Snap 4500 - 1.6T (4 x 400gig Seagates), Raid5,
1 Snap 4200 - 4.0T (4 x 2gig Seagates), Raid5, Using SATA converts from Andy

Link to SnapOS FAQ's http://forums.procooling.com/vbb/showthread.php?t=13820

Last edited by blue68f100; 04-28-2006 at 08:29 AM.
blue68f100 is offline   Reply With Quote
Unread 04-28-2006, 01:47 PM   #10
re3dyb0y
Cooling Savant
 
re3dyb0y's Avatar
 
Join Date: Aug 2004
Location: UK
Posts: 909
Default Re: Snap 4000 fsck'ing after power hit

You can change it to 1024K

If they are gonna charge that much, buying a new unit would work probably better, and then allow you to have 2 units!
__________________
Snap Server Help Wiki - http://wiki.procooling.com/index.php/Snap_Server

Snap Server 2200 v3.4.807
2x 250GB Seagate Barracuda 7200.9
w/ UNIDFC601512M Replacement Fan

"Did you really think it would be that easy??"


Other NAS's
1x NSLU2 w/ 512mb Corsair Flash Voyager
Running Unslung 6.8b

1x NSLU2 w/ 8Gb LaCie Carte Orange
Running Debian/NSLU2 Stable 4.0r0


250GB LaCie Ethernet Disk
Running Windows XP Embedded
re3dyb0y is offline   Reply With Quote
Unread 04-28-2006, 01:55 PM   #11
re3dyb0y
Cooling Savant
 
re3dyb0y's Avatar
 
Join Date: Aug 2004
Location: UK
Posts: 909
Default Re: Snap 4000 fsck'ing after power hit

I know on Mandrake/Mandrivia that also does support XFS/it is in the list of hard drives you could mount

I just wasn't able to get it to install at the time

Quote:
XFS has been merged into the mainline Linux 2.4 (as of 2.4.25, when Marcelo Tosatti judged it stable enough) and 2.6 kernels, making it almost universally available on Linux systems. Installation programs for the SuSE, Gentoo, Mandriva, Slackware, Ubuntu and Debian Linux distributions all offer XFS as a choice of filesystem. FreeBSD gained read-only support for XFS in December 2005.

Seems most of the newer OS's also include it.....
__________________
Snap Server Help Wiki - http://wiki.procooling.com/index.php/Snap_Server

Snap Server 2200 v3.4.807
2x 250GB Seagate Barracuda 7200.9
w/ UNIDFC601512M Replacement Fan

"Did you really think it would be that easy??"


Other NAS's
1x NSLU2 w/ 512mb Corsair Flash Voyager
Running Unslung 6.8b

1x NSLU2 w/ 8Gb LaCie Carte Orange
Running Debian/NSLU2 Stable 4.0r0


250GB LaCie Ethernet Disk
Running Windows XP Embedded
re3dyb0y is offline   Reply With Quote
Reply


Currently Active Users Viewing This Thread: 1 (0 members and 1 guests)
 

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Forum Jump


All times are GMT -5. The time now is 06:17 PM.


Powered by vBulletin® Version 3.7.4
Copyright ©2000 - 2025, Jelsoft Enterprises Ltd.
(C) 2005 ProCooling.com
If we in some way offend you, insult you or your people, screw your mom, beat up your dad, or poop on your porch... we're sorry... we were probably really drunk...
Oh and dont steal our content bitches! Don't give us a reason to pee in your open car window this summer...