1) Make sure you are running the most current code stack. (BMC in particulate since it sounds like you may be locking up the BMC)
2) Update (or reflash) the RMM2 firmware. (look under maintance - Device Information to see what version you have) http://downloadcenter.intel.com/Detail_Desc.aspx?agr=Y&DwnldID=18116&lang=eng
IPMI sensor commands access the BMC to report the vaules logged in the FRUSDR.
The RMM2 is a interface to the same path.
Can you try:
Remove AC power for 30 seconds and then power on again
Boot to BIOS set-up (F2) and navagate to the Server Management - System Infomation
Verfy that the board part number , serial number and BMC firmware revision all have data or not. (If data is present, BMC is running)
Verfy the RMM2 to see if it is working now after the AC cycle.
I've got exactly the same problem, No Sensor + IPMI CH#3 is not working most of the time (there is some time it's working and stop working later on). I also saw "Unknown Error" in BMC's event log from time to time.
Right now, I removed RMM2 and there is no "Unknown Error" in BMC's event log anymore.
Would Intel's engineer help me conclude the source of problem ?
- it's the defective of my RMM2 ..
- or -
- it's because of the difference of stepping of the two CPUs on my system ? -- @mini , does your system the same as me ?
1) Information please.
What FW is on your RMM2?
What BMC (& BIOS) version are you using?
What motherboard or system are you using?
Update both the RMM2 firmware and the BMC (and BIOS) to the most current.
RMM2 & the BMC are very closely tight together, A miss match between them can cause issues.
Use the SELVIEW tool you your product and dump the SEL log in Text and HEX format and attach. Without the HEX SEL data, the Unknow error is Unknown.
The BMC \ RMM2 comunications is IPMI based. Are you running any IPMI software?
Also What OS are you running?
Miss matched processors.
Not advised but if the same speed and cache size will likly not have issues.
Thanks for your reply ..
All the firmware are the lastest one available on download site of intel.com. The system is SR1500SAS (S5000PAL) with 2x E5310 Xeon (stepping is differ).
The problem occured even the OS has not been loaded but the KVM functionality is working fine all the time.
I will put the RMM2 back some later time (as soon as possible) and get the HEX dump per your advice.
I'm really appreciated with your help.
Mother board firmware = BIOS 101 / BMC 64/ SDR 48? (You may want to reflash just to insure everything is correct.)
RMM2 firmware =6717 (You may want to reflash just to insure everything is correct.)
Selview for Windows & Linux http://downloadcenter.intel.com/Detail_Desc.aspx?agr=Y&DwnldID=18871&lang=eng
You do not need to install the RMM2 to get the SEL log. You have to get it using one of the viewer tools.
These tools report the entire log data (in Text or Hex) where the RMM log is trunketed.