Corrupted RAIDFrame device

Previous message: [thread] [date] [author]
Next message: [thread] [date] [author]
From: Paul M
Date: Wednesday, October 29, 2008 - 9:51 pm

Hi all

I have a simple 2 disk RAID 1 array which has become corrupted by a 
faulty memory module.

If I repeatedly generate an MD5 hash on the same file, I consistantly 
get 1 of 2 values back, roughly alternating, so I assume that the 2 
disks have different versions of the same file and they are accessed 
more-or-less alternately. 'raidclt -s' tells me that all is well with 
the array.
It appears that the likelyhood of corruption is greater with larger 
files - >approx 1/2 gig are pretty much all corrupt while small files 
are pretty much all ok. All this sounds reasonable under the 
circumstances.

My idea on recovering as much as possible was to disconnect 1 drive, 
copy all the data off, switch to the other drive and do the same, then 
run an anaysis on the 2 copies - if a file is the same on both copys, 
the it's probably ok, if they differ, then one or both will be bad.

So, I did the first copy, but when I swap to the other disk, RAIDFrame 
has remembered that this has 'failed' so will not configure it into the 
set (as I feared it would(nt)).

Does anyone know how I can tell RAIDFrame that the first drive is 
actually ok, or is my reasoning just nonsense anyway?
What would a parity re-write do in this case?

Ironicaly this computer is in the process of being configured as backup 
storage, so while I have the originals of most of the data, there is 
some that I dont, and I haven't yet set up the secondary (off site) 
backups. And yes I did test the backups were ok, the first ones at 
least. It appears the module failed some time during the process. I 
know, I should have been anal and checked every single one, but it was 
all brand new hardware ...
Actually, that's when failure rates are high.


paulm
Previous message: [thread] [date] [author]
Next message: [thread] [date] [author]

Messages in current thread:
Longest Uptime?, new_guy, (Tue Oct 28, 5:54 pm)
Re: Longest Uptime?, Stephane Lapie, (Tue Oct 28, 6:11 pm)
Re: Longest Uptime?, Antoine Jacoutot, (Tue Oct 28, 6:12 pm)
Re: Longest Uptime?, Jason Crawford, (Tue Oct 28, 6:21 pm)
Re: Longest Uptime?, William Boshuck, (Tue Oct 28, 6:27 pm)
Re: Longest Uptime?, bofh, (Tue Oct 28, 6:58 pm)
Re: Longest Uptime?, J.C. Roberts, (Tue Oct 28, 7:29 pm)
Re: Longest Uptime?, Chris Lawder, (Tue Oct 28, 7:43 pm)
Re: Longest Uptime?, Guido Tschakert, (Tue Oct 28, 11:45 pm)
Re: Longest Uptime?, Artur Grabowski, (Tue Oct 28, 11:56 pm)
Re: Longest Uptime?, Mike Swanson, (Wed Oct 29, 12:25 am)
Re: Longest Uptime?, Gilles Chehade, (Wed Oct 29, 2:15 am)
Re: Longest Uptime?, guilherme m. schroeder, (Wed Oct 29, 10:49 am)
Re: Longest Uptime?, bofh, (Wed Oct 29, 11:15 am)
Corrupted RAIDFrame device, Paul M, (Wed Oct 29, 9:51 pm)
Re: Longest Uptime?, Andres Genovez, (Wed Oct 29, 10:27 pm)
Re: Longest Uptime?, Gilles Chehade, (Thu Oct 30, 2:09 am)
Re: Longest Uptime?, Han Boetes, (Thu Oct 30, 2:39 am)
Re: Longest Uptime?, Pete Vickers, (Thu Oct 30, 3:25 am)
Re: Longest Uptime?, Marco Peereboom, (Thu Oct 30, 6:50 am)
Re: Longest Uptime?, Laurent CARON, (Thu Oct 30, 8:41 am)
Re: Longest Uptime?, Lori Barfield, (Mon Nov 3, 8:55 am)
Re: Longest Uptime?, new_guy, (Mon Nov 3, 9:43 am)