2 Replies Latest reply on Dec 26, 2013 5:00 AM by gerferra

    DH67BL - Help with failing RAID

    gerferra

      Hello.

       

      Some time ago, about a year, I set up a RAID 5 with three 2TB Seagate SATA II discs, using the RAID controller which comes with the IDH67BL board. This was for a home server runnin Ubuntu server 12.10.

      Yesterday, the server stopped booting ...
      After booting with a live cd, I could see that SMART check of one of the disks was reporting bad sectors and that have had overheating in the past ...

      I ran the SMART self-test for the three discs and two reported as OK, with one reporting as failed ( read errors ).

      From the live cd I can mount the RAID, but when I try to access some files, it gives me a read error ....

      My questions are:
      1. If only one drive is reporting errors: why the machine is not booting and I can't access the data? Doesn't a RAID5 should keep working but in a degraded mode?
      1b. Does this mean that surely one of the other discs also have problems?

      2. How can I be sure that the data of the two disks, for which the SMART self-test reports as OK, its really OK?

      3. What would be the steps to restore the RAID (if it is possible at all)? Is it enough to remove the failed disk and install another ?

      Thank you very much .

      Regards,
      Germán

       

        • 1. Re: IDH67BL - Help with failing RAID
          sylvia_intel

          German, you are not able to boot to your operating system because the OS installed some boot files into the defective hard drive. 

          I would recommend you to please access the BIOS, check under SATA mode and make sure it is set as RAID not AHCI.

           

          Please access the onboard RAID BIOS and check the status of the RAID. If it is degraded you should be able to replace the defective hard drive and rebuild the RAID 5. There is a 50/50 chance the RAID could be rebuilt without problems.

           

          If the status of the RAID is failed, that means that the structure of the raid was corrupt since the beginning.

          1 of 1 people found this helpful
          • 2. Re: IDH67BL - Help with failing RAID
            gerferra

            Hello.

             

            Thank you for your reply.

             

            I would recommend you to please access the BIOS, check under SATA mode and make sure it is set as RAID not AHCI.

            Chipset SATA Mode = <RAID>

             

            Please access the onboard RAID BIOS and check the status of the RAID.

            I don't know exactly how to do this. Pressing Ctrl+I on boot, I can enter the "Intel Rapid Storage Technology - Option ROM - 11.0.0.1339". Is this what you're referring to?

             

            The information this program gives me is this:

             

            ID  Name              Level                 Strip     Size       Status      Bootable

            0    MAIN_RAID    RAID5(Parity)    64KB    3.6TB     Normal     Yes

             

            After that, there is some information about the physical devices, but I think it's not relevant here. If it is, please tell me and I will write the details here.

             

            Apparently the "Status" of the drive is "Normal" ...

             

            So, this makes me ask one more question:

            3. How is the BIOS reporting the RAID as "Normal" when the SMART self-test of one drive is reporting itself as FAILED (read errors) ?

             

            Thank you.