3 Replies Latest reply on Mar 10, 2012 11:53 AM by coolbone

    D975XBX2 Missing (Bad) Drive Oddness ICH7R/DH

    Amarand

      A few days ago, I noticed that my photography ingest process was going extremely slow.  My images are stored on the Intel RAID controller of the D975XBX2.  I have four, 1.0TB WD Caviar Black drives, connected in a RAID 1+0 configuration.  One of the drives (serial number ending in 0769) started flaking out, and after a reboot, showed as missing.  After another reboot, the same drive showed as available, and the array started to rebuild, but very slowly.  I patiently let it sit there for a few days (yes, very slowly) and after it got to about 30-something percent, it came back as failed.  I then shut the system down and replaced the 0769 with an identical cold-spare (serial number ending 8555, showing Available).  The array still shows Failed, even though it's a RAID 1+0, and the other three drives are still showing okay.  0769 does not show Failed, but shows Missing.  I'm confused as to how I should proceed.  I'm including my System Report below, and would prefer (obviously) not to have to rebuild the array and restore from backup, as it's over a terabyte and that's not much fun to transfer all of that over the network (NAS) at once.

       

      I have tried, of course, to place the original disk back in (0769) and it causes the Intel RAID BIOS to hang at the banner, seemingly forever (waited an hour).  Right now, my system is booted (drives in the Marvell controller [boot] are fine), and I can play around with the Intel RST, but the only option I have is to delete the array (not ideal) and mark the Available disk as a Spare...which doesn't do anything.  My fear is that the original flaky disk is now bad, but the Intel RAID didn't detect it as such, so it's just calling it Missing instead.  Is there any way to fail-out the Missing drive and force it to rebuild using the new disk?

       

       

      System Report

       

      System Information
      OS name:  Microsoft Windows XP Professional
      OS version:  5.1.2600 Service Pack 3 2600
      System name:  FLANDERS
      System manufacturer:  INTEL_
      System model:  D975XBX2
      Processor:  GenuineIntel x86 Family 6 Model 15 Stepping 11 2.4  GHz
      BIOS:  Intel Corp., BX97520J.86A.2838.2008.0903.1859

       

      Intel® Rapid Storage Technology
      Kit installed:  10.1.0.1008
      User interface version:  10.1.0.1008
      Language:  English (United States)
      Intel RAID controller:  Intel(R) ICH7R/DH SATA RAID Controller
      Number of SATA ports:  4
      RAID option ROM version:  5.6.2.1002
      Driver version:  10.1.0.1008
      ISDI version:  10.1.0.1008

       

      Device Information
      Name:  Array_0000
      Size:  3,726 GB
      Available space:  0 GB
      Disk data cache:  Enabled
      Number of volumes:  1
      Volume member:  Ginormous2
      Number of disks:  4
      Array disk:  WD-WCATR0230904
      Array disk:  WD-WCATR0234925
      Array disk:  WD-WCATR0230252
      Array disk:  D-WCATR0230769:0

       

      Name:  Ginormous2
      Status:  Failed
      Type:  RAID 10
      Size:  1,863 GB
      Data strip size:  64 KB
      Write-back cache:  Enabled
      System volume:  No
      Initialized:  Yes 
      Verification errors found:  0
      Blocks with media errors:  0
      Physical sector size:  512 Bytes
      Logical sector size:  512 Bytes
      Parent array:  Array_0000
      Number of disks:  4
      Array disk:  WD-WCATR0230904
      Array disk:  WD-WCATR0234925
      Array disk:  WD-WCATR0230252
      Array disk:  D-WCATR0230769:0

       

      Disk on port 0
      Port location:  Internal
      Status:  Normal
      Usage:  Array disk
      Size:  932 GB
      Serial number:  WD-WCATR0230904
      Model:  WDC WD1002FAEX-00Z3A0
      Firmware:  05.01D05
      System disk:  No
      Password protected:  No
      Disk data cache:  Enabled
      Native command queuing:  Yes 
      SATA transfer rate:  3 Gb/s
      Physical sector size:  512 Bytes
      Logical sector size:  512 Bytes

       

      Disk on port 1
      Port location:  Internal
      Status:  Normal
      Usage:  Array disk
      Size:  932 GB
      Serial number:  WD-WCATR0234925
      Model:  WDC WD1002FAEX-00Z3A0
      Firmware:  05.01D05
      System disk:  No
      Password protected:  No
      Disk data cache:  Enabled
      Native command queuing:  Yes 
      SATA transfer rate:  3 Gb/s
      Physical sector size:  512 Bytes
      Logical sector size:  512 Bytes

       

      Disk on port 2
      Port location:  Internal
      Status:  Normal
      Usage:  Available
      Size:  932 GB
      Serial number:  WD-WCATR0228555
      Model:  WDC WD1002FAEX-00Z3A0
      Firmware:  05.01D05
      System disk:  No
      Password protected:  No
      Disk data cache:  Enabled
      Native command queuing:  Yes 
      SATA transfer rate:  3 Gb/s
      Physical sector size:  512 Bytes
      Logical sector size:  512 Bytes

       

      Disk on port 3
      Port location:  Internal
      Status:  Normal
      Usage:  Array disk
      Size:  932 GB
      Serial number:  WD-WCATR0230252
      Model:  WDC WD1002FAEX-00Z3A0
      Firmware:  05.01D05
      System disk:  No
      Password protected:  No
      Disk data cache:  Enabled
      Native command queuing:  Yes 
      SATA transfer rate:  3 Gb/s
      Physical sector size:  512 Bytes
      Logical sector size:  512 Bytes

       

      Disk on port Unknown
      Port location:  Unknown
      Status:  Missing
      Usage:  Array disk
      Size:  0 GB
      Serial number:  D-WCATR0230769:0
      Model: 
      Firmware: 
      System disk:  No
      Password protected:  No
      Disk data cache:  Enabled
      Native command queuing:  No
      SATA transfer rate:  Unknown
      Physical sector size:  0 Bytes
      Logical sector size:  0 Bytes

        • 1. Re: D975XBX2 Missing (Bad) Drive Oddness ICH7R/DH
          Amarand

          Well, I'm a little bit disappointed with the Intel Communities as far as a help resource.  135 people read this post over the past four days and not one person replied.

           

          I've since ordered a pair of 2TB drives, and will be finding out if my full backup is any good.  I originally started out with RAID 5, switched to RAID 10, and now I'm going to a mirrored pair, because it scares me a little less with this particular Intel RAID chipset.

           

          I've left my photography workstation open and virtually unusable just in case someone had any ideas, and once the new drives come in, I will be deleting the RAID 10 to make way for the new mirror.

           

          I'm also disappointed with Intel's ability to rebuild and/or diagnose an array.  I specifically switched to a RAID 10 thinking that, by design, it would allow for at least one failure.  With the drop-outs on this chipset, and the apparent lack of controls (timeout flexibility, etc.), it's a bit scary.  I'm also not using cheap drives: I went for the Western Digital Caviar Black, and even have a spare.

           

          I work in the support industry, and the documentation on this particular RAID chipset is sparse, which is not something I'd expect from Intel. 

          • 2. Re: D975XBX2 Missing (Bad) Drive Oddness ICH7R/DH
            Amarand

            07-07-2011 6-44 AM.png

            Here's the "Advanced" management.  Wait, there's nothing for me to click?  No way to test the array, or delete "missing" drives from the array to replace them?  How bogus is this?

            • 3. Re: D975XBX2 Missing (Bad) Drive Oddness ICH7R/DH
              coolbone

              I'm dealing with this same problem.

               

              I had a 3 disk raid5 that I wanted to add another drive to. Added it and then around 13% through the data migration, one of my drives got bumped out. I can mount it in windows as a whole new drive, and I even see it outside of the array view, but there's no way to get it back in (there's a copy of it with a slightly corrupt serial still inside the array)

               

              I'll attach a screenshot and system report soon.