1 2 Previous Next 15 Replies Latest reply on Dec 11, 2017 9:34 AM by GeraldP

    NUC7I3BNH freezes - iwlwifi Microcode SW error detected

    cville

      For 2 months I've tried to deploy a NUC7I3BNH without success as a result of random system freezes. I'm running Ubuntu 17.04 (GNU/Linux 4.10.0-28-generic x86_64)  - server edition in a headless configuration.

      The first task I tried to run on this system is loading data via Bluetooth from my PV inverter. On the system's first day, I encountered Bluetooth problems, i.e. corrupted messages and a frozen Bluetooth stack that forced me to restart the BT service over and over again.

       

      I tried the following:

      • upgraded the BIOS to the (at the time) latest version: 0047
      • upgraded the iwlwifi firmware to version 22.361476.0

      • configured Monit to automatically check and if necessary restart the Bluetooth service
      • log the CPU temperature in a database as the temperature increases significantly during a freeze:
        2017-07-21_14-39-56.jpg

       

       

      All of the above did not result in any improvements as the system usually freezes (no longer responds to terminal commands or ping requests) after a few days but sometimes only after minutes. Today I noticed for the first time an anomaly in the dmesg output (see below): +0.001852] iwlwifi 0000:3a:00.0: Microcode SW error detected.  Restarting 0x2000000. (occurs multiple times).

       

      I now have the following questions:

      1. is this most likely a hardware or a software problem (i.e. iwlwifi driver)?
      2. what else can I do to find the root cause?

       

      Any help will be much appreciated.

       

      Output from dmesg:

      pi@eon:[~]: dmesg|grep iwlwifi

      [  +0.000048] iwlwifi 0000:3a:00.0: enabling device (0000 -> 0002)

      [  +0.004545] iwlwifi 0000:3a:00.0: Direct firmware load for iwlwifi-8265-26.ucode failed with error -2

      [  +0.000019] iwlwifi 0000:3a:00.0: Direct firmware load for iwlwifi-8265-25.ucode failed with error -2

      [  +0.000102] iwlwifi 0000:3a:00.0: Direct firmware load for iwlwifi-8265-24.ucode failed with error -2

      [  +0.000013] iwlwifi 0000:3a:00.0: Direct firmware load for iwlwifi-8265-23.ucode failed with error -2

      [  +0.003230] iwlwifi 0000:3a:00.0: loaded firmware version 22.361476.0 op_mode iwlmvm

      [  +0.008316] iwlwifi 0000:3a:00.0: Detected Intel(R) Dual Band Wireless AC 8265, REV=0x230

      [  +0.002038] iwlwifi 0000:3a:00.0: L1 Disabled - LTR Enabled

      [  +0.000618] iwlwifi 0000:3a:00.0: L1 Disabled - LTR Enabled

      [  +0.001193] iwlwifi 0000:3a:00.0 wlp58s0: renamed from wlan0

      [  +0.001127] iwlwifi 0000:3a:00.0: L1 Disabled - LTR Enabled

      [  +0.000254] iwlwifi 0000:3a:00.0: L1 Disabled - LTR Enabled

      [  +0.115551] iwlwifi 0000:3a:00.0: L1 Disabled - LTR Enabled

      [  +0.001171] iwlwifi 0000:3a:00.0: L1 Disabled - LTR Enabled

      [  +0.006814] iwlwifi 0000:3a:00.0: L1 Disabled - LTR Enabled

      [  +0.000630] iwlwifi 0000:3a:00.0: L1 Disabled - LTR Enabled

      [  +0.115211] iwlwifi 0000:3a:00.0: L1 Disabled - LTR Enabled

      [  +0.000261] iwlwifi 0000:3a:00.0: L1 Disabled - LTR Enabled

      [Jul21 01:38] iwlwifi 0000:3a:00.0: Microcode SW error detected.  Restarting 0x82000000.

      [  +0.000105] iwlwifi 0000:3a:00.0: CSR values:

      [  +0.000050] iwlwifi 0000:3a:00.0: (2nd byte of CSR_INT_COALESCING is CSR_INT_PERIODIC_REG)

      [  +0.000090] iwlwifi 0000:3a:00.0:        CSR_HW_IF_CONFIG_REG: 0X00c89000

      [  +0.000087] iwlwifi 0000:3a:00.0:          CSR_INT_COALESCING: 0X00000040

      [  +0.000106] iwlwifi 0000:3a:00.0:                     CSR_INT: 0X00000000

      [  +0.000085] iwlwifi 0000:3a:00.0:                CSR_INT_MASK: 0X00000000

      [  +0.000074] iwlwifi 0000:3a:00.0:           CSR_FH_INT_STATUS: 0X00000000

      [  +0.000075] iwlwifi 0000:3a:00.0:                 CSR_GPIO_IN: 0X00000010

      [  +0.000074] iwlwifi 0000:3a:00.0:                   CSR_RESET: 0X00000000

      [  +0.000074] iwlwifi 0000:3a:00.0:                CSR_GP_CNTRL: 0X08040005

      [  +0.000075] iwlwifi 0000:3a:00.0:                  CSR_HW_REV: 0X00000230

      [  +0.000074] iwlwifi 0000:3a:00.0:              CSR_EEPROM_REG: 0Xd55555d5

      [  +0.000074] iwlwifi 0000:3a:00.0:               CSR_EEPROM_GP: 0Xd55555d5

      [  +0.000075] iwlwifi 0000:3a:00.0:              CSR_OTP_GP_REG: 0Xd55555d5

      [  +0.000074] iwlwifi 0000:3a:00.0:                 CSR_GIO_REG: 0X001f0044

      [  +0.000076] iwlwifi 0000:3a:00.0:            CSR_GP_UCODE_REG: 0X00000000

      [  +0.000109] iwlwifi 0000:3a:00.0:           CSR_GP_DRIVER_REG: 0X00000000

      [  +0.000126] iwlwifi 0000:3a:00.0:           CSR_UCODE_DRV_GP1: 0X00000000

      [  +0.000074] iwlwifi 0000:3a:00.0:           CSR_UCODE_DRV_GP2: 0X00000000

      [  +0.000074] iwlwifi 0000:3a:00.0:                 CSR_LED_REG: 0X00000060

      [  +0.000075] iwlwifi 0000:3a:00.0:        CSR_DRAM_INT_TBL_REG: 0X8826b7cc

      [  +0.000074] iwlwifi 0000:3a:00.0:        CSR_GIO_CHICKEN_BITS: 0X07800200

      [  +0.000074] iwlwifi 0000:3a:00.0:             CSR_ANA_PLL_CFG: 0Xd55555d5

      [  +0.000074] iwlwifi 0000:3a:00.0:      CSR_MONITOR_STATUS_REG: 0Xc03803c0

      [  +0.000074] iwlwifi 0000:3a:00.0:           CSR_HW_REV_WA_REG: 0X0001001a

      [  +0.000074] iwlwifi 0000:3a:00.0:        CSR_DBG_HPET_MEM_REG: 0Xffff0010

      [  +0.000072] iwlwifi 0000:3a:00.0: FH register values:

      [  +0.000071] iwlwifi 0000:3a:00.0:         FH_RSCSR_CHNL0_STTS_WPTR_REG: 0X27004b00

      [  +0.000092] iwlwifi 0000:3a:00.0:        FH_RSCSR_CHNL0_RBDCB_BASE_REG: 0X027358c0

      [  +0.000092] iwlwifi 0000:3a:00.0:                  FH_RSCSR_CHNL0_WPTR: 0X00000008

      [  +0.000092] iwlwifi 0000:3a:00.0:         FH_MEM_RCSR_CHNL0_CONFIG_REG: 0X80801054

      [  +0.000092] iwlwifi 0000:3a:00.0:          FH_MEM_RSSR_SHARED_CTRL_REG: 0X000000fc

      [  +0.000094] iwlwifi 0000:3a:00.0:            FH_MEM_RSSR_RX_STATUS_REG: 0X07830000

      [  +0.000092] iwlwifi 0000:3a:00.0:    FH_MEM_RSSR_RX_ENABLE_ERR_IRQ2DRV: 0X00000000

      [  +0.000092] iwlwifi 0000:3a:00.0:                FH_TSSR_TX_STATUS_REG: 0X07ff0000

      [  +0.000092] iwlwifi 0000:3a:00.0:                 FH_TSSR_TX_ERROR_REG: 0X00000000

      [  +0.000198] iwlwifi 0000:3a:00.0: Start IWL Error Log Dump:

      [  +0.000062] iwlwifi 0000:3a:00.0: Status: 0x00000000, count: 6

      [  +0.000063] iwlwifi 0000:3a:00.0: Loaded firmware version: 22.361476.0

      [  +0.000071] iwlwifi 0000:3a:00.0: 0x000028AA | ADVANCED_SYSASSERT

      [  +0.000074] iwlwifi 0000:3a:00.0: 0x000002F0 | trm_hw_status0

      [  +0.000062] iwlwifi 0000:3a:00.0: 0x00000000 | trm_hw_status1

      [  +0.000061] iwlwifi 0000:3a:00.0: 0x0000E258 | branchlink2

      [  +0.000061] iwlwifi 0000:3a:00.0: 0x0002730C | interruptlink1

      [  +0.001435] iwlwifi 0000:3a:00.0: 0x00000000 | interruptlink2

      [  +0.000686] iwlwifi 0000:3a:00.0: 0x00000003 | data1

      [  +0.000706] iwlwifi 0000:3a:00.0: 0x00000000 | data2

      [  +0.000671] iwlwifi 0000:3a:00.0: 0x000332AD | data3

      [  +0.000651] iwlwifi 0000:3a:00.0: 0x7EC123A1 | beacon time

      [  +0.000657] iwlwifi 0000:3a:00.0: 0xAF17DC46 | tsf low

      [  +0.000626] iwlwifi 0000:3a:00.0: 0x00000113 | tsf hi

      [  +0.000621] iwlwifi 0000:3a:00.0: 0x00000000 | time gp1

      [  +0.000616] iwlwifi 0000:3a:00.0: 0x4EE65329 | time gp2

      [  +0.000614] iwlwifi 0000:3a:00.0: 0x00000000 | uCode revision type

      [  +0.000625] iwlwifi 0000:3a:00.0: 0x00000016 | uCode version major

      [  +0.000499] iwlwifi 0000:3a:00.0: 0x00058404 | uCode version minor

      [  +0.000431] iwlwifi 0000:3a:00.0: 0x00000230 | hw version

      [  +0.000375] iwlwifi 0000:3a:00.0: 0x00C89000 | board version

      [  +0.000377] iwlwifi 0000:3a:00.0: 0x0000001C | hcmd

      [  +0.000376] iwlwifi 0000:3a:00.0: 0x00022000 | isr0

      [  +0.000374] iwlwifi 0000:3a:00.0: 0x00000000 | isr1

      [  +0.000374] iwlwifi 0000:3a:00.0: 0x08001802 | isr2

      [  +0.000371] iwlwifi 0000:3a:00.0: 0x004154C0 | isr3

      [  +0.000366] iwlwifi 0000:3a:00.0: 0x00000000 | isr4

      [  +0.000375] iwlwifi 0000:3a:00.0: 0x00000110 | last cmd Id

      [  +0.000372] iwlwifi 0000:3a:00.0: 0x00000000 | wait_event

      [  +0.000392] iwlwifi 0000:3a:00.0: 0x0000E22F | l2p_control

      [  +0.000610] iwlwifi 0000:3a:00.0: 0x00000000 | l2p_duration

      [  +0.000610] iwlwifi 0000:3a:00.0: 0x00000003 | l2p_mhvalid

      [  +0.000498] iwlwifi 0000:3a:00.0: 0x00000020 | l2p_addr_match

      [  +0.000363] iwlwifi 0000:3a:00.0: 0x0000000D | lmpm_pmg_sel

      [  +0.000362] iwlwifi 0000:3a:00.0: 0x03071928 | timestamp

      [  +0.000364] iwlwifi 0000:3a:00.0: 0x00000818 | flow_handler

      [  +0.000408] iwlwifi 0000:3a:00.0: Start IWL Error Log Dump:

      [  +0.000356] iwlwifi 0000:3a:00.0: Status: 0x00000000, count: 7

      [  +0.000366] iwlwifi 0000:3a:00.0: 0x00000070 | ADVANCED_SYSASSERT

      [  +0.000569] iwlwifi 0000:3a:00.0: 0x00000000 | umac branchlink1

      [  +0.000501] iwlwifi 0000:3a:00.0: 0xC0082F64 | umac branchlink2

      [  +0.000343] iwlwifi 0000:3a:00.0: 0xC0081000 | umac interruptlink1

      [  +0.000335] iwlwifi 0000:3a:00.0: 0xC0081000 | umac interruptlink2

      [  +0.000333] iwlwifi 0000:3a:00.0: 0x00000800 | umac data1

      [  +0.000322] iwlwifi 0000:3a:00.0: 0xC0081000 | umac data2

      [  +0.000314] iwlwifi 0000:3a:00.0: 0xDEADBEEF | umac data3

      [  +0.000318] iwlwifi 0000:3a:00.0: 0x00000016 | umac major

      [  +0.000304] iwlwifi 0000:3a:00.0: 0x00058404 | umac minor

      [  +0.000301] iwlwifi 0000:3a:00.0: 0xC0886280 | frame pointer

      [  +0.000313] iwlwifi 0000:3a:00.0: 0xC0886280 | stack pointer

      [  +0.000474] iwlwifi 0000:3a:00.0: 0x0033010D | last host cmd

      [  +0.000474] iwlwifi 0000:3a:00.0: 0x00000000 | isr status reg

      [  +0.492710] iwlwifi 0000:3a:00.0: L1 Disabled - LTR Enabled

      [  +0.000255] iwlwifi 0000:3a:00.0: L1 Disabled - LTR Enabled

      [  +0.116244] iwlwifi 0000:3a:00.0: L1 Disabled - LTR Enabled

      [  +0.000264] iwlwifi 0000:3a:00.0: L1 Disabled - LTR Enabled

      [ +19.276917] iwlwifi 0000:3a:00.0: regular scan timed out

      [  +0.001852] iwlwifi 0000:3a:00.0: Microcode SW error detected.  Restarting 0x2000000.

      [  +0.001073] iwlwifi 0000:3a:00.0: CSR values:

      [  +0.001127] iwlwifi 0000:3a:00.0: (2nd byte of CSR_INT_COALESCING is CSR_INT_PERIODIC_REG)

      [  +0.002015] iwlwifi 0000:3a:00.0:        CSR_HW_IF_CONFIG_REG: 0X00c89000

      [  +0.001192] iwlwifi 0000:3a:00.0:          CSR_INT_COALESCING: 0X00000040

      [  +0.000350] iwlwifi 0000:3a:00.0:                     CSR_INT: 0X00000000

      [  +0.000366] iwlwifi 0000:3a:00.0:                CSR_INT_MASK: 0X00000000

      [  +0.000497] iwlwifi 0000:3a:00.0:           CSR_FH_INT_STATUS: 0X00000000

      [  +0.000355] iwlwifi 0000:3a:00.0:                 CSR_GPIO_IN: 0X00000010

      [  +0.000352] iwlwifi 0000:3a:00.0:                   CSR_RESET: 0X00000000

      [  +0.000352] iwlwifi 0000:3a:00.0:                CSR_GP_CNTRL: 0X08040005

      [  +0.000353] iwlwifi 0000:3a:00.0:                  CSR_HW_REV: 0X00000230

      [  +0.000350] iwlwifi 0000:3a:00.0:              CSR_EEPROM_REG: 0Xd55555d5

      [  +0.000359] iwlwifi 0000:3a:00.0:               CSR_EEPROM_GP: 0Xd55555d5

      [  +0.000353] iwlwifi 0000:3a:00.0:              CSR_OTP_GP_REG: 0Xd55555d5

      [  +0.000346] iwlwifi 0000:3a:00.0:                 CSR_GIO_REG: 0X001f0044

      [  +0.000354] iwlwifi 0000:3a:00.0:            CSR_GP_UCODE_REG: 0X00000000

      [  +0.000362] iwlwifi 0000:3a:00.0:           CSR_GP_DRIVER_REG: 0X00000000

      [  +0.000578] iwlwifi 0000:3a:00.0:           CSR_UCODE_DRV_GP1: 0X00000000

      [  +0.000576] iwlwifi 0000:3a:00.0:           CSR_UCODE_DRV_GP2: 0X00000000

      [  +0.000505] iwlwifi 0000:3a:00.0:                 CSR_LED_REG: 0X00000060

      [  +0.000358] iwlwifi 0000:3a:00.0:        CSR_DRAM_INT_TBL_REG: 0X8826b7cc

      [  +0.000355] iwlwifi 0000:3a:00.0:        CSR_GIO_CHICKEN_BITS: 0X07800200

      [  +0.000362] iwlwifi 0000:3a:00.0:             CSR_ANA_PLL_CFG: 0Xd55555d5

      [  +0.000362] iwlwifi 0000:3a:00.0:      CSR_MONITOR_STATUS_REG: 0Xc03803c0

      [  +0.000363] iwlwifi 0000:3a:00.0:           CSR_HW_REV_WA_REG: 0X0001001a

      [  +0.000373] iwlwifi 0000:3a:00.0:        CSR_DBG_HPET_MEM_REG: 0Xffff0010

      [  +0.000610] iwlwifi 0000:3a:00.0: FH register values:

      [  +0.000528] iwlwifi 0000:3a:00.0:         FH_RSCSR_CHNL0_STTS_WPTR_REG: 0X27004b00

      [  +0.000394] iwlwifi 0000:3a:00.0:        FH_RSCSR_CHNL0_RBDCB_BASE_REG: 0X027358c0

      [  +0.000397] iwlwifi 0000:3a:00.0:                  FH_RSCSR_CHNL0_WPTR: 0X00000060

      [  +0.000400] iwlwifi 0000:3a:00.0:         FH_MEM_RCSR_CHNL0_CONFIG_REG: 0X00801054

      [  +0.000403] iwlwifi 0000:3a:00.0:          FH_MEM_RSSR_SHARED_CTRL_REG: 0X000000fc

      [  +0.000404] iwlwifi 0000:3a:00.0:            FH_MEM_RSSR_RX_STATUS_REG: 0X03030000

      [  +0.000406] iwlwifi 0000:3a:00.0:    FH_MEM_RSSR_RX_ENABLE_ERR_IRQ2DRV: 0X00000000

      [  +0.000414] iwlwifi 0000:3a:00.0:                FH_TSSR_TX_STATUS_REG: 0X07ff0000

      [  +0.000426] iwlwifi 0000:3a:00.0:                 FH_TSSR_TX_ERROR_REG: 0X00000000

      [  +0.000791] iwlwifi 0000:3a:00.0: Start IWL Error Log Dump:

      [  +0.000571] iwlwifi 0000:3a:00.0: Status: 0x00000000, count: 6

      [  +0.000417] iwlwifi 0000:3a:00.0: Loaded firmware version: 22.361476.0

      [  +0.000422] iwlwifi 0000:3a:00.0: 0x00000084 | NMI_INTERRUPT_UNKNOWN

      [  +0.000430] iwlwifi 0000:3a:00.0: 0x000002F0 | trm_hw_status0

      [  +0.000429] iwlwifi 0000:3a:00.0: 0x00000000 | trm_hw_status1

      [  +0.000425] iwlwifi 0000:3a:00.0: 0x0000E258 | branchlink2

      [  +0.000445] iwlwifi 0000:3a:00.0: 0x00CCA90E | interruptlink1

      [  +0.000712] iwlwifi 0000:3a:00.0: 0x0002730C | interruptlink2

      [  +0.000518] iwlwifi 0000:3a:00.0: 0x00000000 | data1

      [  +0.000418] iwlwifi 0000:3a:00.0: 0x00000080 | data2

      [  +0.000408] iwlwifi 0000:3a:00.0: 0x07830000 | data3

      [  +0.000401] iwlwifi 0000:3a:00.0: 0x1EC067F8 | beacon time

      [  +0.000395] iwlwifi 0000:3a:00.0: 0xB047B7EA | tsf low

      [  +0.000380] iwlwifi 0000:3a:00.0: 0x00000113 | tsf hi

      [  +0.000382] iwlwifi 0000:3a:00.0: 0x00000000 | time gp1

      [  +0.000385] iwlwifi 0000:3a:00.0: 0x01257EAF | time gp2

      [  +0.000620] iwlwifi 0000:3a:00.0: 0x00000000 | uCode revision type

      [  +0.000557] iwlwifi 0000:3a:00.0: 0x00000016 | uCode version major

      [  +0.000380] iwlwifi 0000:3a:00.0: 0x00058404 | uCode version minor

      [  +0.000378] iwlwifi 0000:3a:00.0: 0x00000230 | hw version

      [  +0.000380] iwlwifi 0000:3a:00.0: 0x00C89000 | board version

      [  +0.000378] iwlwifi 0000:3a:00.0: 0x0052019C | hcmd

      [  +0.000374] iwlwifi 0000:3a:00.0: 0x00022000 | isr0

      [  +0.000374] iwlwifi 0000:3a:00.0: 0x00800000 | isr1

      [  +0.000394] iwlwifi 0000:3a:00.0: 0x08001802 | isr2

      [  +0.000608] iwlwifi 0000:3a:00.0: 0x40400082 | isr3

      [  +0.000557] iwlwifi 0000:3a:00.0: 0x00000000 | isr4

      [  +0.000373] iwlwifi 0000:3a:00.0: 0x00800110 | last cmd Id

      [  +0.000372] iwlwifi 0000:3a:00.0: 0x00000000 | wait_event

      [  +0.000375] iwlwifi 0000:3a:00.0: 0x0000537E | l2p_control

      [  +0.000372] iwlwifi 0000:3a:00.0: 0x00000020 | l2p_duration

      [  +0.000374] iwlwifi 0000:3a:00.0: 0x00000000 | l2p_mhvalid

      [  +0.000372] iwlwifi 0000:3a:00.0: 0x00000030 | l2p_addr_match

      [  +0.000368] iwlwifi 0000:3a:00.0: 0x0000000D | lmpm_pmg_sel

      [  +0.000372] iwlwifi 0000:3a:00.0: 0x03071928 | timestamp

      [  +0.000586] iwlwifi 0000:3a:00.0: 0x00346070 | flow_handler

      [  +0.000640] iwlwifi 0000:3a:00.0: Start IWL Error Log Dump:

      [  +0.000357] iwlwifi 0000:3a:00.0: Status: 0x00000000, count: 7

      [  +0.000352] iwlwifi 0000:3a:00.0: 0x00000070 | ADVANCED_SYSASSERT

      [  +0.000352] iwlwifi 0000:3a:00.0: 0x00000000 | umac branchlink1

      [  +0.000349] iwlwifi 0000:3a:00.0: 0xC0082F64 | umac branchlink2

      [  +0.000341] iwlwifi 0000:3a:00.0: 0xC008940A | umac interruptlink1

      [  +0.000335] iwlwifi 0000:3a:00.0: 0xC0081000 | umac interruptlink2

      [  +0.000329] iwlwifi 0000:3a:00.0: 0x00000800 | umac data1

      [  +0.000343] iwlwifi 0000:3a:00.0: 0xC0081000 | umac data2

      [  +0.000516] iwlwifi 0000:3a:00.0: 0xDEADBEEF | umac data3

      [  +0.000515] iwlwifi 0000:3a:00.0: 0x00000016 | umac major

      [  +0.000420] iwlwifi 0000:3a:00.0: 0x00058404 | umac minor

      [  +0.000302] iwlwifi 0000:3a:00.0: 0xC0886280 | frame pointer

      [  +0.000309] iwlwifi 0000:3a:00.0: 0xC0886280 | stack pointer

      [  +0.000299] iwlwifi 0000:3a:00.0: 0x0052019C | last host cmd

      [  +0.000297] iwlwifi 0000:3a:00.0: 0x00000000 | isr status reg

      [  +0.493849] iwlwifi 0000:3a:00.0: L1 Disabled - LTR Enabled

      [  +0.000255] iwlwifi 0000:3a:00.0: L1 Disabled - LTR Enabled

      [  +0.116124] iwlwifi 0000:3a:00.0: L1 Disabled - LTR Enabled

      [  +0.000264] iwlwifi 0000:3a:00.0: L1 Disabled - LTR Enabled

      pi@eon:[~]:

        • 1. Re: NUC7I3BNH freezes - iwlwifi Microcode SW error detected
          Intel Corporation
          This message was posted on behalf of Intel Corporation

          : Thank you very much for joining the Intel® NUC communities.

           

          We will start a research on this matter in order to try to find a possible solution for this problem. We will try to replicate the issue, as soon as I get any updates I wil post all the detail on this thread.

           

          Just to confirm, could you please provide the model of the memory RAM you are using and the model of the SSD/Hard drive?

           

          Any further questions, please let me know.

           

          Regards,

          Alberto R

          • 2. Re: NUC7I3BNH freezes - iwlwifi Microcode SW error detected
            cville

            Alberto,

             

            Thanks for your reply. Here's the info you requested:

            RAM: Kingston HyperX Impact - DDR4 [8Gb]

            SSD: Samsung SSD 960 EVO 250GB

            Hard disk: HGST Travelstar 7K1000 HTS721010A9E630

             

            Best regards, Alfred

            • 3. Re: NUC7I3BNH freezes - iwlwifi Microcode SW error detected
              Intel Corporation
              This message was posted on behalf of Intel Corporation

              : Thank you very much for providing that information. As soon as I get any updates I will let you know.
               
              Any questions, please let me know.
               
              Regards,
              Alberto R
               

              • 4. Re: NUC7I3BNH freezes - iwlwifi Microcode SW error detected
                Intel Corporation
                This message was posted on behalf of Intel Corporation

                : I just received some updates on this matter. Remember that all the testings Intel did on the NUC were done using Windows, so all the information we have about this configuration is related to that operating system. Now we will do our best in order to provide further details.

                As an option you can always submit this same inquiry on the link below, in order to get further assistance:
                https://01.org/community

                Any questions, please let me know.

                Regards,
                Alberto R

                • 5. Re: NUC7I3BNH freezes - iwlwifi Microcode SW error detected
                  cville

                  Alberto,

                   

                  I realize this system isn't running Windows. However, the reported problem seems to originate from the Intel supplied iwlwifi driver and firmware or possibly from the Intel supplied hardware. I would be helpful to know if the problem is more likely to be caused by software or by the hardware (in which case I will have to send the system back).

                   

                  Best regards, Alfred

                  • 6. Re: NUC7I3BNH freezes - iwlwifi Microcode SW error detected
                    rguevara

                    Hi :

                     

                    I have been looking into this issue (I work for Intel Customer Support, same as Alberto) and I also tried to replicate it, I loaded Ubuntu 16.04.2 LTS as it is with no modifications and I am not experiencing this error message, my system has been only running for a couple of days though.

                    I also did some research about it and this issue was reported to the Linux Community as a bug, see the following article: kernel/git/firmware/linux-firmware.git - Repository of firmware blobs for use with the Linux kernel

                    In addition to that, I have to tell you that we have not officially validated Linux on NUC systems and we try to provide our best effort to help our customers but the support we offered may be limited, same thing goes for the Drivers, we don’t really provide drivers for Linux for NUC, however you are probably aware of Linux* Support for Intel® Wireless Adapters  - these packages are available for customers to try but we recommend to check with the Linux community if you need support.


                    I hope this helps,

                    Ronny G

                    • 7. Re: NUC7I3BNH freezes - iwlwifi Microcode SW error detected
                      cville

                      Hi Ronny,

                       

                      Thanks for getting back to me. I have the following observations/questions after reading your message:

                      1. you're using Ubuntu 16.04 while I'm using Ubuntu 17.04. Is there any reason you installed one over the other?
                      2. the problem I reported seems to occur or occur much more frequently when using Bluetooth. I was pinging a BT device every minute until I stopped this practice 5 days ago because it was freezing the system. The system has been stable since then but this also means it's useless to me since I need the BT.
                      3. is there any chance this could be a hardware problem? I'm bringing this up since there's some evidence that a higher ambient temperature causes more crashes. How do I determine that this is not a hardware malfunction?

                       

                      Your thoughts are very welcome.

                       

                      Beste regards, Alfred

                      • 8. Re: NUC7I3BNH freezes - iwlwifi Microcode SW error detected
                        cville

                        The following - seen for the first time - message appeared on July 26:

                         

                        [Jul26 11:18] iwlwifi 0000:3a:00.0: Getting the temperature timed out

                        • 9. Re: NUC7I3BNH freezes - iwlwifi Microcode SW error detected
                          rguevara

                          Hi Alfred,

                           

                          See my answers below:

                           

                          1. You're using Ubuntu 16.04 while I'm using Ubuntu 17.04. Is there any reason you installed one over the other? No specific reason, it was the version I had available to test. I see no reason for version 17.04 to behave differently.

                           

                          2. The problem I reported seems to occur or occur much more frequently when using Bluetooth. I was pinging a BT device every minute until I stopped this practice 5 days ago because it was freezing the system. The system has been stable since then but this also means it's useless to me since I need the BT. I didnt test this combination thoroughly, meaning that I made it run and put a Bluetooth headset to work for a while and I had no issues but I didnt test it intensively, we offer very limited support for this combination as we dont validate Linux* Operating System on NUC. Can you tell me what is the usage model and if by any chance you are able to test Windows 10? I understand that this is probably not an option for you but it will tell us if we have a driver issue.

                           

                          3. Is there any chance this could be a hardware problem? I'm bringing this up since there's some evidence that a higher ambient temperature causes more crashes. How do I determine that this is not a hardware malfunction? I cant tell with the information we have, please refer to my comment above on question number 2

                           

                          Regards,

                          Ronny G

                           

                           

                          • 10. Re: NUC7I3BNH freezes - iwlwifi Microcode SW error detected
                            cville

                            Hi Ronny,

                             

                            See my answers below:

                             

                            1. You're using Ubuntu 16.04 while I'm using Ubuntu 17.04. Is there any reason you installed one over the other? No specific reason, it was the version I had available to test. I see no reason for version 17.04 to behave differently.

                                I thought so too; thanks for the confirmation.

                             

                            2. The problem I reported seems to occur or occur much more frequently when using Bluetooth. I was pinging a BT device every minute until I stopped this practice 5 days ago because it was freezing the system. The system has been stable since then but this also means it's useless to me since I need the BT. I didnt test this combination thoroughly, meaning that I made it run and put a Bluetooth headset to work for a while and I had no issues but I didnt test it intensively, we offer very limited support for this combination as we dont validate Linux* Operating System on NUC. Can you tell me what is the usage model and if by any chance you are able to test Windows 10? I understand that this is probably not an option for you but it will tell us if we have a driver issue.

                             

                            • On Windows 10: unfortunately I don't have a license. Also it would be very difficult to create a comparable test environment.
                            • The primary usage model for this NUC is as follows. Currently I'm using 6 Raspberry Pi's to read sensors, capture PV (solar energy) and other energy data as well as some other home automation related tasks. Because Raspberry Pi storage is notoriously unreliable - SD cards - I want to replace the majority of the RPI's with the NUC.
                            • The current usage model and de-facto test environment is much more limited and has to remain so until the NUC has proven itself. I'm running SBFspot which queries the SMA PV inverter via Bluetooth ever 5 minutes and stores the resulting data in a database.  The active period is 10 seconds per 5 min with the BT connection terminating (by design) every 5 min.. Note that this is very different from connecting a BT headset to the NUC. The system is currently connected via Ethernet which means that BT is the only wireless networking component that gets actual use. The same application has been running flawlessly for over 3 years on an RPI. Unfortunately the RPI eats (as in physically destroys) 1 SD card every 6-9 months, resulting in data loss and the need to restore from backup to a fresh SD card, which is why I want to move to a NUC with its much more reliable SSD/HD storage. In addition to SBFspot, the system is running Monit (monitoring) and Mosquitto (MQTT broker).

                             

                            3. Is there any chance this could be a hardware problem? I'm bringing this up since there's some evidence that a higher ambient temperature causes more crashes. How do I determine that this is not a hardware malfunction? I cant tell with the information we have, please refer to my comment above on question number 2

                             

                            I hope this answers your questions.

                             

                            Best regards, Alfred

                            • 11. Re: NUC7I3BNH freezes - iwlwifi Microcode SW error detected
                              cville

                              2017-08-07 - UPDATE with new diagnostic information

                               

                              After running SBFspot without problems for almost 2 weeks, I lost the Bluetooth connection again. This time I was very careful to capture as much info as possible before trying te fix the problem. This is what I found:

                              • The SBFspot log shows:

                              Mon Aug  7 12:05:18 2017: CRITICAL: bthConnect() returned -1

                              • DMESG: no BT related data
                              • The BT daemon was still running:

                              pi@eon:[~/smadata/log]: !1366

                              sudo service bluetooth status

                              ● bluetooth.service - Bluetooth service

                                 Loaded: loaded (/lib/systemd/system/bluetooth.service; enabled; vendor preset: enabled)

                                 Active: active (running)

                                   Docs: man:bluetoothd(8)

                              Main PID: 1146 (bluetoothd)

                                 Status: "Running"

                                  Tasks: 1 (limit: 4915)

                                 Memory: 1.9M

                                    CPU: 5.445s

                                 CGroup: /system.slice/bluetooth.service

                                         └─1146 /usr/lib/bluetooth/bluetoothd --noplugin=sap

                              • BT was not blocked:

                              pi@eon:[~/scripts]: rfkill list

                              0: hci0: Bluetooth

                                      Soft blocked: no

                                      Hard blocked: no

                              1: phy0: Wireless LAN

                                      Soft blocked: no

                                      Hard blocked: no

                              pi@eon:[~/scripts]:

                              • bluetoothctl on the NUC shows:

                              pi@eon:[~/scripts]: sudo bluetoothctl

                              [NEW] Controller F8:63:3F:06:8A:14 eon [default]

                              • whereas a Raspberry Pi running the same software shows:

                              pi@zero:[~]: bluetoothctl

                              [NEW] Controller B8:27:EB:27:B9:80 zero [default]

                              [NEW] Device 00:80:25:A6:1C:A2 SMA003d SN: XXX

                              [CHG] Device 00:80:25:A6:1C:A2 Connected: no

                              [DEL] Device 00:80:25:A6:1C:A2 SMA003d SN: XXX

                              [bluetooth]#

                              • lsmod on the NUC:

                              bluetooth             557056  31 btrtl,hci_uart,btintel,btqca,bnep,btbcm,rfcomm,btusb

                              • And on Raspberry Pi:

                              bluetooth             374820  25 hci_uart,bnep,btbcm,rfcomm


                              This command fixed it:

                              sudo hciconfig hci0 up

                               

                              Apparently for unknown reasons the BT link goes down, leaving the BT daemon running. I can't find any traces of this in the syslog. I'd be happy to look somewhere else for additional information if someone tells me where to go.

                              • 12. Re: NUC7I3BNH freezes - iwlwifi Microcode SW error detected
                                cville

                                I'm now convinced this problem is caused by the Bluetooth driver. Here's why:

                                • before every download from the PV inverter via Bluetooth I bring the BT link down and up again:

                                sudo /bin/hciconfig hci0 down

                                sudo /bin/hciconfig hci0 up

                                • with this (ugly) workaround the system has been stable, without a single crash, for almost 8 weeks.

                                 

                                Since bringing the BT link down and up again every 5min. makes the problem go away, I conclude that it's most likely that the driver is at fault. Does anyone agree? Should I report this apparent bug elsewhere?

                                • 13. Re: NUC7I3BNH freezes - iwlwifi Microcode SW error detected
                                  alfinbi

                                  the problem occurs probably because you use the microcode as module that usually reside in /lib/firmware and you have many version of single device. if it is the case, just pick 1 version for your specific device and delete the others iwlwifi microcode. these boot messages indicate that you have many versions of your 8265 device :

                                   

                                  [  +0.004545] iwlwifi 0000:3a:00.0: Direct firmware load for iwlwifi-8265-26.ucode failed with error -2

                                  [  +0.000019] iwlwifi 0000:3a:00.0: Direct firmware load for iwlwifi-8265-25.ucode failed with error -2

                                  [  +0.000102] iwlwifi 0000:3a:00.0: Direct firmware load for iwlwifi-8265-24.ucode failed with error -2

                                  [  +0.000013] iwlwifi 0000:3a:00.0: Direct firmware load for iwlwifi-8265-23.ucode failed with error -2

                                  • 14. Re: NUC7I3BNH freezes - iwlwifi Microcode SW error detected
                                    cville

                                    This is supposed to be the way things normally work. Each driver is test until the correct device is found.

                                    1 2 Previous Next