Home > Intel Communities > Support Community > General Discussion > Discussions
23 Replies Last post: Oct 25, 2009 5:48 PM by Dathon   1 2 Previous Next
theshowmecanuck 9 posts since
Jun 5, 2009
 
Currently Being Moderated

Jun 13, 2009 1:14 AM

matrix manager say drive failed, seagate drive test software says no it isn't

I asked this in graphics and chipsets about a week ago but no-one answered it. I noticed more RAID related questions in this topic, so I am reposting here instead of bumping the old one.

 

Intel RAID Controller: Intel(R) ICH8R/ICH9R/ICH10R/DO/PCH SATA RAID Controller On an Asus P6T Mobo using Windows 7 RC1. I have updated the Mobo bios to latest firmware.

 

I have two volumes defined: one for system disks and one for storage. The system volume is 2 x 640 GB WD drives in RAID1. The storage volume is 4 x 1 TB Seagate Barracuda drives in RAID10. All drives were brand new as of two weeks ago.

 

  • Periodically, the Matrix RAID is telling me that a drive has failed.
  • The first time I replaced the drive with a spare and tested the supposed bad hard drive. The drive was good.
  • Periodically the Matrix Manager would report that different drives had failed.
    • That is, it wasn't necessarily the same drive that was reported as failed each time. It was more or less random.
    • When it happened it would bring the system to its knees until I was able to coax the system to shut down and reboot... it could take a half hour for the system to close open apps before shutting down.
  • I stopped the (re)boot process at the RAID configuration menu (not the board bios)
    • I 'removed' the bad drive and then added it back and resumed the boot process
      • I did NOT physically remove the drive and replace it.
  • After Windows started, the Matrix Manager told me it was rebuilding the array which it was. 
    • The system performance was pretty close to normal until the rebuild completed.
  • After this happened about ten times, I went to the Seagate web site and downloaded their diagnostic software.
    • This was happening mostly to the Seagate drives, but occasionally to the WD drives.

 

The Seagate diagnositic software came on an iso file to create a bootable CD with the software on it.

  • You need to go into the mobo bios and tell the system to treat the drives like regular single drives again.
  • You reboot the computer with the CD in the drive and it boots to FreeDos and then runs the diagnostic software.
  • The program is an old school crude dos type graphical interface with support for the mouse.
  • The Diagnostic software discovers your drives and allows you to do a number of tests including a 'Short Test' and a 'Long Test'.
  • I only tested the Seagate drives since they made up 4 of 6 drives
    • good enough sample size to evaluate whether the problem was the drives or the RAID system since the RAID reported failures happened on more or less random drives when it happened.

 

Drive Test Results

 

Short Test

  • The Short Test checks thing like the SMART status, whether it had ever failed or marked as failed, operating temperatures, etc.
  • The Short Test reported all of the Seagate drives to be GOOD. None had ever failed or were marked as bad.

 

Long Test

  • The Long Test, does the short test plus checks all the blocks on the disk and takes a couple of hours to run.
  • The Long Test reported the drives as GOOD.... no failures, errors, etc.

 

OK People, What's The Scoop?

 

Why would the Intel Matrix Manager etc. continuously and ERRONEOUSLY tell me my drives are failing or failed and bring my system performance down to its knees, and be absolutely a pain in ths _ss?  The drives are good.

 

Anyone tell me how to fix this other than return the Mobo or buy a real RAID card (which I am 3ware 9559SXU-8PL... but I still want to know what is wrong with my Mobo). BTW, now I know why Linux fanboys refer to cards like the 3ware card I am buying as 'real' (the one I am buying has an actual controller risc chip and its own onboard memory (128MB)), and this Intel RAID as FAKERAID. This experience almost has me in the Linux fanboy club as I am very angry about this.

 

Any help is appreciated.

Average User Rating
(0 ratings)




chrisb   1 posts since
Jun 13, 2009
Currently Being Moderated
1. Jun 13, 2009 2:54 AM in response to: theshowmecanuck
Re: matrix manager say drive failed, seagate drive test software says no it isn't

I have had same problem

 

Had Intel matrix storage manager (IMSM) 6.2.  Did not have administrator privileges.

 

Downloaded IMSM 8.x.  Warning came up.  Opened IMSM.  Now able to double click icon in left pane to see two drives in array with one marked as failing.

 

Right click failing drive and 'mark as normal'

 

Hoping this was software and not failing drive but backed up just in case.

amunaor   13 posts since
Jun 5, 2009
Currently Being Moderated
2. Jun 15, 2009 10:28 PM in response to: theshowmecanuck
Re: matrix manager say drive failed, seagate drive test software says no it isn't

I too just experienced a 'degraded' disk failure in a 3x disk RAID 5 configuration - Western Digital RE3 WD2502ABYS-01B7A0 250GB - which incorporated the boot system. Worked fine for about 4 or 5 days and then this.

 

Intel RAID Controller: Intel(R) ICH8R/ICH9R/ICH10R/DO/PCH SATA RAID Controller On DX58SO. I too was using 64-bit Windows 7 RC1. DX58SO (AA# E30149-503) motherboard BIOS is the latest version: 4014. Processor i7-920 D0 stepping.

.

I shut down and removed the failing drive, now I'm awaiting an RMA replacement to test further.

.

Although Intel has released a handful of 'Beta Drivers' for evaluation within Windows 7 RC1, regretfully their latest release of the BIOS - version 4014 - was not one of them. In which case, I suspect ICH10R Matrix Manager and W7 RC1 may be stumbling over each others toes. Once the replacement disk arrives, I may need to limit my RAID affairs until Intel releases a Win7 bonafide BIOS for this set of chips.

.

Since you have an Asus mobo, this may not be the case for you. Anyway, lets hope a few others chime in on this!

.

Best Wishes

Speed_Demon   2 posts since
Jun 26, 2009
Currently Being Moderated
6. Jun 27, 2009 12:17 AM in response to: theshowmecanuck
Re: matrix manager say drive failed, seagate drive test software says no it isn't

I too just experienced a 'Degraded Volume' disk failure in a 4 x disk RAID 10 configuration - Seagate ST3320620AS - which incorporated the boot system. Worked fine for the past few months on Raid Driver 8.6.0.1007.  Seqence of events as follows:  loaded updated driver version 8.9.0.1023 three days ago.  First drive marked as failed occured two days ago.  Drive replaced and rebuilt.  Second drive failed today.  I attempted to plug the first failed drive into an open port on an Intel DG965WH motherboard in order to test it.  Raid hardware did not recognize the drive, much to my surprise.  Plugged in a backup drive to the other open port to see what would happen.  Much to my surprise again, the drive is not recognized by the hardware.  This is interesting as this backup drive has been plugged in previously when I needed to reload or transfer files for backup purposes.  Never had any recognition issues with this drive being powered up and plugged in on a rare occasional basis.   My suspicion at this point is either the 8.9.0.1023 driver itself or possibly the driver load, which occured without any warning of a problem occuring.  Looking forward to sorting this situation out finally, hopefully without losing anything on the drive and reverting back to driver V 8.6.0.1007.  If anyone has any ideas on why the hardware isn't recognizing a single "non-raid" drive being plugged into the motherboard I'd be interested.

mark7   8 posts since
Jun 25, 2009
Crazy_Train   30 posts since
Jul 1, 2009

It might be an issue with error handling in SATA hard drives. There's a good article on Wikipedia about it:

 

Time-Limited Error Recovery

 

"Modern hard drives feature an ability to recover from some read/write errors by internally remapping sectors and other forms of self test and recovery. The process for this can sometimes take several seconds or (under heavy usage) minutes, during which time the drive is unresponsive. RAID controllers are designed to recognize a drive which does not respond within a few seconds, and mark it as unreliable, indicating that it should be withdrawn from use and the array rebuilt from parity data. This is a long process, degrades performance, and if a second drive should fail under the resulting additional workload, it can be catastrophic.

 

If the drive itself is inherently reliable but has some bad sectors, then TLER and similar features prevent a disk from being unnecessarily marked as 'failed' by limiting the time spent on correcting detected errors before advising the array controller of a failed operation. The array controller can then handle the data recovery for the limited amount involved, rather than marking the entire drive as faulty."

 

http://en.wikipedia.org/wiki/Time-Limited_Error_Recovery

 

This might be the root of your problem.

DonN   3 posts since
Jul 8, 2009

I have an Intel DG965WH motherboard board with Windows Vista SP1.  The computer is fairly clean in terms of applications (latest version of Norton Internet Security, MS office, Winamp, and an FTP tool) and has been rock solid for the last year.  The comptuer is generally rebooted once a month after Microsoft releases patches, otherwise the computer is always on and connected to an APC UPS backup power supply.  I updated the Intel Matrix manager from 8.7 to 8.9 less than a week ago.  Computer has since frozen twice requiring a hard reboot and today one of the 3 hard drives was listed as failed in the RAID 5 array.  Also received a few errors in the Windows Event log with regards to "A request to write to the a file succeeded, but took an abnormally long time to be serviced by the OS." I went into the Intel Matrix Manager, right clicked on the failed hard drive, set it to normal and the RAID array rebuilt successfully.   Found other postings scattered across various blogs and support forums with regards to failed drives being reported via the Intel Matrix Manager after upgrading to 8.8 or 8.9.  Users also mentioned that they were testing the reported failed hard drives using the MFG hard drive test utilty and the drives were okay.  I found a copy of the Intel Matrix Manager 8.7 on Intel's website and downloaded + installed.  I'll try to get back here in a week or so to let you know if I'm back on a stable platform. If anyone's interested in reverting to 8.7, Google STOR_allOS_8.7.0.1007_PV.exe.  Appears Intel has removed most of the 8.7 downloads from thier site, however, I did manage to find it listed under the downloads section for one of the Intel boards. 

Speed_Demon   2 posts since
Jun 26, 2009
Currently Being Moderated
11. Jul 8, 2009 11:39 PM in response to: DonN
Re: matrix manager say drive failed, seagate drive test software says no it isn't

I've reverted back to 8.6.0.1007, running stable, as it was before the update. Sorted out the spare drive situation, it was a power cable issue and so far the array appears to be running without any issues.  I see that I'm not the only one with issues after updating to 8.9 including array rebuilds.  Using Seagate's Seatools, I did not find any problems with the drives that were reported as failed.  Interesting to say the least.......

Crazy_Train   30 posts since
Jul 1, 2009
Currently Being Moderated
12. Jul 9, 2009 9:17 AM in response to: DonN
Re: matrix manager say drive failed, seagate drive test software says no it isn't

It looks like Intel tweaked a timing setting in the Matrix Storage Manager 8.8/8.9, and it occasionally causes the software to drop a drive out of the array. Nice....  I haven't seen this issue myself, but the Intel RAID setups that I support are running 8.6 or earlier. Here are the URLs for downloading 8.7 from Intel:

 

Intel Matrix Storage Manager 8.7.0.1007 - executable
http://downloadcenter.intel.com/Detail_Desc.aspx?strState=LIVE&ProductID=2101&DwnldID=17268&agr=Y&lang=eng&PrdMap=2101

 

32-bit floppy configuration utility
http://downloadcenter.intel.com/Detail_Desc.aspx?strState=LIVE&ProductID=2101&DwnldID=17269&agr=Y&lang=eng&PrdMap=2101

 

64-bit floppy configuration utility
http://downloadcenter.intel.com/Detail_Desc.aspx?strState=LIVE&ProductID=2101&DwnldID=17270&agr=Y&lang=eng&PrdMap=2101

DonN   3 posts since
Jul 8, 2009
Currently Being Moderated
13. Jul 9, 2009 9:22 PM in response to: Crazy_Train
Re: matrix manager say drive failed, seagate drive test software says no it isn't

Thanks for posting the link - yes, that was the file downloaded and used to revert to 8.7.  Hopefully this is helpful to others.

mark7   8 posts since
Jun 25, 2009
Currently Being Moderated
14. Jul 12, 2009 1:23 PM in response to: DonN
Re: matrix manager say drive failed, seagate drive test software says no it isn't

driveragent2_2009-02-11_en[1].exei updated all my chipset drivers bar 1 i got the usb mass storage device im buying a new sound card will , driver update download what i need if i buy it thanks drivers out of date , i need the smbus contoller 9 and these usb 2.0

More Like This

  • Retrieving data ...