GL-AX1800 Flint Crashing / Rebooting in less than a day

I have recently purchased a Flint router. I have noticed it reboot unexpectedly each of the last few days. The last crash dump I have included below. It is running the latest firmware 4.6.8.

From the crash dump, is there any way to determine what is going wrong? Are these routers generally stable for many days / weeks under normal use? I don't think the router is under heavy load. VPN server is enabled by nothing using it. SMB / DNLA enabled, but not used at the time.

Oops#1 Part1
<4>[52637.119814] CPU: 2 PID: 0 Comm: swapper/2 Not tainted 4.4.60 #0
<4>[52637.120168] Hardware name: Generic DT based system
<4>[52637.125897] task: 9e45b600 ti: 9e48c000 task.ti: 9e48c000
<4>[52637.130808] PC is at trace_xhci_dbg_ring_expansion+0x57c/0xe3c [xhci_hcd]
<4>[52637.136246] LR is at xhci_stop_endpoint_command_watchdog+0x16c/0x288 [xhci_hcd]
<4>[52637.143003] pc : [<7f0eb7bc>]    lr : [<7f0ec4f0>]    psr: 80000193
<4>[52637.143003] sp : 9e48de70  ip : 9e48de84  fp : 9d50926c
<4>[52637.150127] r10: 00000074  r9 : 00000001  r8 : 9d4ed0e8
<4>[52637.161574] r7 : 9d509160  r6 : 00000020  r5 : 00000000  r4 : 00000001
<4>[52637.166784] r3 : 40000000  r2 : 9d412c00  r1 : 00000000  r0 : 9d509160
<4>[52637.173382] Flags: Nzcv  IRQs off  FIQs on  Mode SVC_32  ISA ARM  Segment kernel
<4>[52637.179894] Control: 10c0383d  Table: 5082006a  DAC: 00000055
<0>[52637.187443] Process swapper/2 (pid: 0, stack limit = 0x9e48c210)
<0>[52637.193086] Stack: (0x9e48de70 to 0x9e48e000)
<0>[52637.199161] de60:                                     00000001 9d509160 00000002 9d4ed000
<0>[52637.203423] de80: 9d4ed0e8 7f0ec4f0 00000002 00000001 9d509190 60000113 82e1263e 9d4ed11c
<0>[52637.211583] dea0: 00000101 7f0ec384 9d4ed0fc 00000000 00000200 80836100 80836080 8007ae58
<0>[52637.219742] dec0: 00000000 00000000 9d4ed11c 9eb55380 7f0ec384 8007b054 808364a8 9d5091e0
<0>[52637.227901] dee0: 00000020 00000001 80836084 80836080 40000001 9e48c000 00000101 8002e63c
<0>[52637.236061] df00: 00000001 8083026c 9e48df00 80889900 0000000a 004fdc27 80836100 00200040
<0>[52637.244221] df20: 808364a8 8083026c 00000000 00000000 00000001 9e408000 9f803000 9e48dfe0
<0>[52637.252381] df40: 808364a8 8002ea50 8083026c 8006ca4c 80856038 808367ec 9f80200c 9e48df88
<0>[52637.260540] df60: 9f802000 80009390 80019398 60000013 ffffffff 9e48dfbc 00000000 8082f30c
<0>[52637.268701] df80: 9e48dfe0 8000a4c0 00000001 00000000 00000000 8000b400 9e48c000 808364a0
<0>[52637.276860] dfa0: 806d2f48 00000000 00000000 8082f30c 9e48dfe0 808364a8 00000000 9e48dfd8
<0>[52637.285020] dfc0: 80019394 80019398 60000013 ffffffff 9e48c000 806d2f48 9e48c000 80063664
<0>[52637.293180] dfe0: 808313d8 8083a552 5f46c06a 808892e8 00000000 4100946c dffbfbf9 d7bfbc5f
<4>[52637.301399] [<7f0eb7bc>] (trace_xhci_dbg_ring_expansion [xhci_hcd]) from [<7f0ec4f0>] (xhci_stop_endpoint_command_watchdog+0x16c/0x288 [xhci_hcd])
<4>[52637.309533] [<7f0ec4f0>] (xhci_stop_endpoint_command_watchdog [xhci_hcd]) from [<8007ae58>] (call_timer_fn+0x24/0x94)
<4>[52637.322529] [<8007ae58>] (call_timer_fn) from [<8007b054>] (run_timer_softirq+0x18c/0x1d8)
<4>[52637.333199] [<8007b054>] (run_timer_softirq) from [<8002e63c>] (__do_softirq+0xe4/0x238)
<4>[52637.341357] [<8002e63c>] (__do_softirq) from [<8002ea50>] (irq_exit+0xac/0x11c)
<4>[52637.349603] [<8002ea50>] (irq_exit) from [<8006ca4c>] (__handle_domain_irq+0x9c/0xac)
<4>[52637.356636] [<8006ca4c>] (__handle_domain_irq) from [<80009390>] (gic_handle_irq+0x54/0x94)
<4>[52637.364622] [<80009390>] (gic_handle_irq) from [<8000a4c0>] (__irq_svc+0x40/0x74)
<4>[52637.372776] Exception stack(0x9e48df88 to 0x9e48dfd0)
<4>[52637.380412] df80:                   00000001 00000000 00000000 8000b400 9e48c000 808364a0
<4>[52637.385453] dfa0: 806d2f48 00000000 00000000 8082f30c 9e48dfe0 808364a8 00000000 9e48dfd8
<4>[52637.393612] dfc0: 80019394 80019398 60000013 ffffffff
<4>[52637.401769] [<8000a4c0>] (__irq_svc) from [<80019398>] (arch_cpu_idle+0x34/0x50)
<4>[52637.406809] [<80019398>] (arch_cpu_idle) from [<80063664>] (cpu_startup_entry+0x154/0x228)
<4>[52637.414274] [<80063664>] (cpu_startup_entry) from [<4100946c>] (0x4100946c)
<0>[52637.422344] Code: e58e4000 e583c008 e583c00c ebffffbe (e5953020) 
<4>[52637.429206] ---[ end trace a36400923af0dd0e ]---


Panic#2 Part1
 ip_set_bitmap_ip ip_set nfnetlink ip6t_NPT ip6t_MASQUERADE nf_nat_masquerade_ipv6 nf_nat nf_conntrack nf_log_ipv6 nf_log_common ip6table_mangle ip6table_filter ip6_tables ip6t_REJECT x_tables nf_reject_ipv6 bonding ifb ip6_udp_tunnel udp_tunnel qca_nss_drv qca_nss_dp udp_diag tcp_diag raw_diag inet_diag tun qca_ssdk vfat fat ntfs nls_utf8 nls_iso8859_1 nls_cp437 mtdoops uas usb_storage xhci_plat_hcd xhci_pci xhci_hcd dwc3 dwc3_qcom dbm uhci_hcd ohci_platform ohci_hcd phy_msm_ssusb_qmp phy_msm_qusb ehci_platform sd_mod scsi_mod kmwan ehci_hcd gpio_button_hotplug gl_sdk4_tertf gl_sdk4_black_white_list usbcore nls_base usb_common mii gl_sdk4_hw_info
<4>[52637.119814] CPU: 2 PID: 0 Comm: swapper/2 Not tainted 4.4.60 #0
<4>[52637.120168] Hardware name: Generic DT based system
<4>[52637.125897] task: 9e45b600 ti: 9e48c000 task.ti: 9e48c000
<4>[52637.130808] PC is at trace_xhci_dbg_ring_expansion+0x57c/0xe3c [xhci_hcd]
<4>[52637.136246] LR is at xhci_stop_endpoint_command_watchdog+0x16c/0x288 [xhci_hcd]
<4>[52637.143003] pc : [<7f0eb7bc>]    lr : [<7f0ec4f0>]    psr: 80000193
<4>[52637.143003] sp : 9e48de70  ip : 9e48de84  fp : 9d50926c
<4>[52637.150127] r10: 00000074  r9 : 00000001  r8 : 9d4ed0e8
<4>[52637.161574] r7 : 9d509160  r6 : 00000020  r5 : 00000000  r4 : 00000001
<4>[52637.166784] r3 : 40000000  r2 : 9d412c00  r1 : 00000000  r0 : 9d509160
<4>[52637.173382] Flags: Nzcv  IRQs off  FIQs on  Mode SVC_32  ISA ARM  Segment kernel
<4>[52637.179894] Control: 10c0383d  Table: 5082006a  DAC: 00000055
<0>[52637.187443] Process swapper/2 (pid: 0, stack limit = 0x9e48c210)
<0>[52637.193086] Stack: (0x9e48de70 to 0x9e48e000)
<0>[52637.199161] de60:                                     00000001 9d509160 00000002 9d4ed000
<0>[52637.203423] de80: 9d4ed0e8 7f0ec4f0 00000002 00000001 9d509190 60000113 82e1263e 9d4ed11c
<0>[52637.211583] dea0: 00000101 7f0ec384 9d4ed0fc 00000000 00000200 80836100 80836080 8007ae58
<0>[52637.219742] dec0: 00000000 00000000 9d4ed11c 9eb55380 7f0ec384 8007b054 808364a8 9d5091e0
<0>[52637.227901] dee0: 00000020 00000001 80836084 80836080 40000001 9e48c000 00000101 8002e63c
<0>[52637.236061] df00: 00000001 8083026c 9e48df00 80889900 0000000a 004fdc27 80836100 00200040
<0>[52637.244221] df20: 808364a8 8083026c 00000000 00000000 00000001 9e408000 9f803000 9e48dfe0
<0>[52637.252381] df40: 808364a8 8002ea50 8083026c 8006ca4c 80856038 808367ec 9f80200c 9e48df88
<0>[52637.260540] df60: 9f802000 80009390 80019398 60000013 ffffffff 9e48dfbc 00000000 8082f30c
<0>[52637.268701] df80: 9e48dfe0 8000a4c0 00000001 00000000 00000000 8000b400 9e48c000 808364a0
<0>[52637.276860] dfa0: 806d2f48 00000000 00000000 8082f30c 9e48dfe0 808364a8 00000000 9e48dfd8
<0>[52637.285020] dfc0: 80019394 80019398 60000013 ffffffff 9e48c000 806d2f48 9e48c000 80063664
<0>[52637.293180] dfe0: 808313d8 8083a552 5f46c06a 808892e8 00000000 4100946c dffbfbf9 d7bfbc5f
<4>[52637.301399] [<7f0eb7bc>] (trace_xhci_dbg_ring_expansion [xhci_hcd]) from [<7f0ec4f0>] (xhci_stop_endpoint_command_watchdog+0x16c/0x288 [xhci_hcd])
<4>[52637.309533] [<7f0ec4f0>] (xhci_stop_endpoint_command_watchdog [xhci_hcd]) from [<8007ae58>] (call_timer_fn+0x24/0x94)
<4>[52637.322529] [<8007ae58>] (call_timer_fn) from [<8007b054>] (run_timer_softirq+0x18c/0x1d8)
<4>[52637.333199] [<8007b054>] (run_timer_softirq) from [<8002e63c>] (__do_softirq+0xe4/0x238)
<4>[52637.341357] [<8002e63c>] (__do_softirq) from [<8002ea50>] (irq_exit+0xac/0x11c)
<4>[52637.349603] [<8002ea50>] (irq_exit) from [<8006ca4c>] (__handle_domain_irq+0x9c/0xac)
<4>[52637.356636] [<8006ca4c>] (__handle_domain_irq) from [<80009390>] (gic_handle_irq+0x54/0x94)
<4>[52637.364622] [<80009390>] (gic_handle_irq) from [<8000a4c0>] (__irq_svc+0x40/0x74)
<4>[52637.372776] Exception stack(0x9e48df88 to 0x9e48dfd0)
<4>[52637.380412] df80:                   00000001 00000000 00000000 8000b400 9e48c000 808364a0
<4>[52637.385453] dfa0: 806d2f48 00000000 00000000 8082f30c 9e48dfe0 808364a8 00000000 9e48dfd8
<4>[52637.393612] dfc0: 80019394 80019398 60000013 ffffffff
<4>[52637.401769] [<8000a4c0>] (__irq_svc) from [<80019398>] (arch_cpu_idle+0x34/0x50)
<4>[52637.406809] [<80019398>] (arch_cpu_idle) from [<80063664>] (cpu_startup_entry+0x154/0x228)
<4>[52637.414274] [<80063664>] (cpu_startup_entry) from [<4100946c>] (0x4100946c)
<0>[52637.422344] Code: e58e4000 e583c008 e583c00c ebffffbe (e5953020) 
<4>[52637.429206] ---[ end trace a36400923af0dd0e ]---
<0>[52637.438006] Kernel panic - not syncing: Fatal exception in interrupt
<2>[52637.440139] CPU1: stopping
<4>[52637.446469] CPU: 1 PID: 0 Comm: swapper/1 Tainted: G      D         4.4.60 #0
<4>[52637.448989] Hardware name: Generic DT based system
<4>[52637.456199] [<8001f784>] (unwind_backtrace) from [<8001bad4>] (show_stack+0x10/0x14)
<4>[52637.460891] [<8001bad4>] (show_stack) from [<80260c10>] (dump_stack+0x8c/0xa0)
<4>[52637.468788] [<80260c10>] (dump_stack) from [<8001e6e0>] (handle_IPI+0x104/0x190)
<4>[52637.475819] [<8001e6e0>] (handle_IPI) from [<800093bc>] (gic_handle_irq+0x80/0x94)
<4>[52637.483370] [<800093bc>] (gic_handle_irq) from [<8000a4c0>] (__irq_svc+0x40/0x74)
<4>[52637.490745] Exception stack(0x9e48bf88 to 0x9e48bfd0)
<4>[52637.498295] bf80:                   00000001 00000000 00000000 8000b400 9e48a000 808364a0
<4>[52637.503336] bfa0: 806d2f48 00000000 00000000 8082f30c 9e48bfe0 808364a8 60000013 9e48bfd8
<4>[52637.511493] bfc0: 80019394 80019398 60000013 ffffffff
<4>[52637.519650] [<8000a4c0>] (__irq_svc) from [<80019398>] (arch_cpu_idle+0x34/0x50)
<4>[52637.524690] [<80019398>] (arch_cpu_idle) from [<80063664>] (cpu_startup_entry+0x154/0x228)
<4>[52637.532156] [<80063664>] (cpu_startup_entry) from [<4100946c>] (0x4100946c)
<2>[52637.540223] CPU0: stopping
<4>[52637.547075] CPU: 0 PID: 0 Comm: swapper/0 Tainted: G      D         4.4.60 #0
<4>[52637.549859] Hardware name: Generic DT based system
<4>[52637.557065] [<8001f784>] (unwind_backtrace) from [<8001bad4>] (show_stack+0x10/0x14)
<4>[52637.561756] [<8001bad4>] (show_stack) from [<80260c10>] (dump_stack+0x8c/0xa0)
<4>[52637.569655] [<80260c10>] (dump_stack) from [<8001e6e0>] (handle_IPI+0x104/0x190)
<4>[52637.576687] [<8001e6e0>] (handle_IPI) from [<800093bc>] (gic_handle_irq+0x80/0x94)
<4>[52637.584238] [<800093bc>] (gic_handle_irq) from [<8000a4c0>] (__irq_svc+0x40/0x74)
<4>[52637.591612] Exception stack(0x80835f50 to 0x80835f98)
<4>[52637.599161] 5f40:                                     00000001 00000000 00000000 8000b400
<4>[52637.604204] 5f60: 80834000 808364a0 806d2f48 00000000 00000000 8082f30c 80835fa8 808364a8
<4>[52637.612362] 5f80: 00000000 80835fa0 80019394 80019398 60000013 ffffffff
<4>[52637.620520] [<8000a4c0>] (__irq_svc) from [<80019398>] (arch_cpu_idle+0x34/0x50)
<4>[52637.626947] [<80019398>] (arch_cpu_idle) from [<80063664>] (cpu_startup_entry+0x154/0x228)
<4>[52637.634589] [<80063664>] (cpu_startup_entry) from [<807f7d28>] (start_kernel+0x430/0x43c)
<2>[52637.642656] CPU3: stopping
<4>[52637.650894] CPU: 3 PID: 0 Comm: swapper/3 Tainted: G      D         4.4.60 #0
<4>[52637.653504] Hardware name: Generic DT based system
<4>[52637.660710] [<8001f784>] (unwind_backtrace) from [<8001bad4>] (show_stack+0x10/0x14)
<4>[52637.665402] [<8001bad4>] (show_stack) from [<80260c10>] (dump_stack+0x8c/0xa0)
<4>[52637.673300] [<80260c10>] (dump_stack) from [<8001e6e0>] (handle_IPI+0x104/0x190)
<4>[52637.680332] [<8001e6e0>] (handle_IPI) from [<800093bc>] (gic_handle_irq+0x80/0x94)
<4>[52637.687883] [<800093bc>] (gic_handle_irq) from [<8000a4c0>] (__irq_svc+0x40/0x74)
<4>[52637.695259] Exception stack(0x9e48ff88 to 0x9e48ffd0)
<4>[52637.702808] ff80:                   00000001 00000000 00000000 8000b400 9e48e000 808364a0
<4>[52637.707849] ffa0: 806d2f48 00000000 00000000 8082f30c 9e48ffe0 808364a8 9eb60f01 9e48ffd8
<4>[52637.716007] ffc0: 80019394 80019398 60000013 ffffffff
<4>[52637.724163] [<8000a4c0>] (__irq_svc) from [<80019398>] (arch_cpu_idle+0x34/0x50)
<4>[52637.729203] [<80019398>] (arch_cpu_idle) from [<80063664>] (cpu_startup_entry+0x154/0x228)
<4>[52637.736669] [<80063664>] (cpu_startup_entry) from [<4100946c>] (0x4100946c)
<0>[52637.744742] The reading for sensor 4 is 0x0020423b
<0>[52637.751639] Couldn't get reading for sensor 5
<0>[52637.756453] The reading for sensor 7 is 0x0020723b
<0>[52637.760879] The reading for sensor 8 is 0x00208238
<0>[52637.765567] The reading for sensor 13 is 0x0022d245
<0>[52637.770341] The reading for sensor 14 is 0x0020e238
<0>[52637.775164] Couldn't get reading for sensor 15

ChatGPT says this may be related to USB.

It looks like you're dealing with a kernel panic report from a Linux-based system, specifically related to USB handling, given the references to the xhci_hcd driver (which is for USB 3.0). The logs show that the kernel encountered a fatal exception in the interrupt handler, leading to the panic.

Key Points:

  1. Crash Location: The panic occurs in the trace_xhci_dbg_ring_expansion function, suggesting issues with USB endpoint handling.
  2. Panic Cause: The message indicates a "Fatal exception in interrupt," which often implies a critical issue with interrupt handling or driver misbehavior.
  3. System State: The stack trace shows various function calls leading up to the crash, with a focus on softirq and timer functions.

@genki Thanks for that - I do have a NVME USB drive connected. I wonder if the drive could have gone to sleep and then the router tries to access it for something and it panics if it doesn't waken in time. I will keep and eye on it and try running without the drive connected for a while.

I managed to lunk this crash to a samba share. I think the USB drive is asleep or in a low power state, as the light on it is off.

Then I start my laptop (Ubuntu) which had the SMB share open. It tries to connect to it and the router immediately crashes / reboots.

It would be an interesting test to keep the drive awake, perhaps with a cron task to read from it every minute or so and see if the crash still occurs when the laptop is started. I have not tried that yet.