8 Replies Latest reply on Aug 2, 2011 6:37 AM by symie

    prozessor temperature

    symie

      Hello,

       

      I have a Server Board S3420GPLX with a XEON 3450, BIOS 49, BMC124, FRUSDR09,

      withe the RMM3 Modul.

      neither in the BIOS nor with the RMM3 I find the CPU temperature indicator.
      Where can I see my CPU temperature at S3420GPLX?

       

      greetings

        • 1. Re: prozessor temperature
          symie

          the Remote Managemetn Module 3 says at Sensor Readings - Temperature Sensors:

          Name:

          CPU Therm Trip

          Status:

          reports it has been deasserted,reports it has been asserted

          Reading:

          0x0080

           

          what does this mean?

          However it must be a way to find out the temperature of the CPU ...

          Can someone  help me?

          • 2. Re: prozessor temperature
            Doc_SilverCreek

            The actual temperature of the CPU is not reported as it is a meaning number with out knowing what the maximum temperature rating is for your processor.

            P1 therm margin is the current CPU temp minus the CPU temp maximiun allowed vaule

             

            Therm trip assert indicates you over heated and shut down your system

            Thern trip deassert indicates it is no longer in a critical thermal area

            1 of 1 people found this helpful
            • 3. Re: prozessor temperature
              symie

              Thank you for responding.

              can you help me interpret the following from my RMM3 helth-site:

               

              Baseboard TempNormal25 degrees C
              P1 Mem MarginNormal-47 degrees C
              P1 Therm MarginNormal-79 degrees C
              P1 Therm Ctrl %NormalNot Available
              CPU Therm Tripreports it has been deasserted,reports it has been asserted0x0080

               

              I have a Xeon X3450 (TDP 95W) on a S3420GP with a big Cooler (Noctua NH-C12P SE14)

               

              The row: "CPU Therm Trip" "reports ..." is allways there and allways the same - is it normal

              or this might indicate an error?

               

              Is there a way the temperature of the CPU running Debian Linux to read?


              • 4. Re: prozessor temperature
                Doc_SilverCreek

                At first glance I was thinking you must have one heck of a CPU cooler to get a margin of -79C, Looking at a photo of this cooler. it appears that you do!

                 

                Even parsec should be happy with this one.

                 

                This is the therimal limit on a X3450 processor http://ark.intel.com/products/42929/Intel-Xeon-Processor-X3450-(8M-Cache-2_66-GHz)

                 

                TCASE
                72.7°C

                 

                Tcore will run slightly higher, (Usually 5 to 10 degs)

                Therm Margin is based off Tcore so a -79 therm margin would indicate the CPU core tempeature is around between -1C and 5 degC.

                 

                These numbers are too low to be believeable.

                Unless you have Ice forming on this heat sink, it must be above the 0C.

                The ambient air read by the Base board temp is 25C (which is a little cooler than normal, but not that much!) 

                The data I found on this heat sink indicates you should expect 8.9 to 14.1 rise over ambient. http://www.frostytech.com/articleview.cfm?articleid=2562&page=5

                Best case youe therim margin should be around - 45 C

                 

                So, conclusion is something is wrong.

                Check you cpu socket for bent pins is my first guess,

                2nd What is the part number printed on the top of the CPU?

                Should be a number starting with S or Q (Sxxxx or Qxxxx)

                 

                 

                P1 Mem Margin is the therimal head room till memory throttling. -- Looks great

                For the CPU Therm Trip, try clearing the SEL log before you check the CPU and see if this comes back.

                • 5. Re: prozessor temperature
                  Doc_SilverCreek

                  Missed the Linux part.

                  Must be a mental block.

                  Linux and I don't communicate too well.

                  I am sure there are tools to read the temperature directly in Linux, I just don;t know what they are,

                   

                  CPUID hardware monitor will do it in Windows  http://www.cpuid.com/softwares/hwmonitor.html

                  (Great company & software by the way.)

                   

                  IPMITOOL will read the sensors, but that should be the same as what you get on the RMM3

                   

                  Looks like LMSENSOR can give you something. I don't know if it is a direct read of the processor MSR or if it gets it from teh BMC SDR sensors also.

                   

                  http://www.howtoforge.com/forums/showthread.php?t=2790

                  1 of 1 people found this helpful
                  • 6. Re: prozessor temperature
                    symie

                    Doc_SilverCreek schrieb:


                    So, conclusion is something is wrong.

                    Check you cpu socket for bent pins is my first guess,

                    2nd What is the part number printed on the top of the CPU?

                    Should be a number starting with S or Q (Sxxxx or Qxxxx)

                     

                     

                    P1 Mem Margin is the therimal head room till memory throttling. -- Looks great

                    For the CPU Therm Trip, try clearing the SEL log before you check the CPU and see if this comes back.

                     

                    Hallo Doc_SilverCreek,

                     

                    thank for your helpful answer.

                    at present I can not believe that a physical error exists.

                    I worked very carefully ..

                    Number starting with S on the Box: is S-spec: SLBLD

                    last night I have test the system (cpu, hdd etc.) with a linux stresstest called "stress" (it is in the repositories)

                    and in some seconds the temperature changed from -76°C to -53°C

                    without the test in some seconds the temperature goes back to -74~-76"C

                    And the stresstest always ended successfully

                    I have to think about it ...

                     

                    thank you

                    • 7. Re: prozessor temperature
                      Doc_SilverCreek

                      Well, 13 deg rise under stress is about right so I am stumped.

                      • 8. Re: prozessor temperature
                        symie

                        Hello,

                         

                        there ist a linux live CD called stresslinux:  http://www.stresslinux.org/sl/wiki

                        it can search for temp. sensors an found 8:

                        1-4: Core0 at Core3

                        5-8: jc42.. - temp1

                        idel: core0 26°C core3 19°C

                        you can run the stresstest on console 1 (Alt+F1) and have a look at the sensors at console 12 (Alt-F12)

                        if i run the stresstest in the same second the temp goes between: core0 48°C core3 36°C

                        after the test also fast back to ~26°C

                        (the sensors 5-8 around 27 - 32°C)

                         

                        I think everything is okay

                         

                        I'll ignore the continuing message in RMM3

                         

                        CPU Therm Tripreports it has been deasserted,reports it has been asserted0x0080

                         

                        greetings