I’ve filed a bug report for this (request 38426) issue but am posting here for visibility in case anyone else has experienced anything similar.
We are seeing multiple of our ML2 devices spontaneously rebooting. It has been observed on the latest OS 1.12.0 as well as 1.11.0 for sure. I’m currently rolling one of the devices back to 1.10.0 to see if it is occurring there as well.
Anecdotally it is happening some time around when the device is losing / regaining head pose tracking but it’s not a direct reliably reproducible trigger. It could also just be encouraging the issue to happen sooner.
Bug report logs extracted after the reboot reveal a kernel panic related to the amd gpu driver.
I have attached a copy of one of these logs.
Please advise!
kpanic_dmesg_logs.txt (238.9 KB)
Do you notice this issue when running a specific application or service? Have you found that the device need to loose and regain head pose a few times within a short period for it to occur more frequently - or is loosing and regaining head pose once typically enough? Are you using an application that is resource intensive ?
We primarily notice this when running our own app just due to the fact that we are testing our own app most of the time, however one of our engineers has observed the reboot behavior once when running the Magic Leap provided Spaces app (our app was not running in the background).
Generally it seems that head pose needs to be lost and regained multiple times. Once is never enough to trigger this behavior.
One more bit of info for you is that we typically set our battery setting to Maximum Performance and we disable the Standby modes. Other than that, we are using the ML2 devices with default settings.
Final clues: we observed this issue sporadically when testing in our primarily indoors office environment, but then a week ago we had a demo at a client in a semi-outdoor space where this rebooting issue was encountered multiple times on multiple devices through the day. Ambient temperature was in the 60s F.
1 Like
[Part of same team]
I got it to occur by covering the cameras for 5 seconds and then uncovering. Did this multiple times while logging memory usage through adb. It didn’t reboot until several minutes later when I was entering the info and was wearing the device idle. Available system memory stayed stable at about 3.5gb. The reboots seem linked to a kernel panic with gpu driver. My case was something about cpu #6 accessing memory it couldn’t or something to that effect. System temperatures averaged around 32-55. 60 max for some items using our app.
1 Like
Got two reboots today. Our app was running and idling on desk. I don’t remember if the second reboot was running our app.
Side note: I had crashes on my pc with the newest intel cpu when doing some ai tasks. It was related to one cpu core trying to access memory in another and the bios blocks that. A firmware update (when it came out) fixed that. I don’t know if it has any relation to this situation though.
panic log snippet:
<4>[ 2260.367474] audit: audit_lost=332222 audit_rate_limit=5 audit_backlog_limit=64
<3>[ 2260.367480] audit: rate limit exceeded
<4>[ 2261.370940] audit: audit_lost=332457 audit_rate_limit=5 audit_backlog_limit=64
<3>[ 2261.370958] audit: rate limit exceeded
<4>[ 2262.380239] audit: audit_lost=332694 audit_rate_limit=5 audit_backlog_limit=64
<3>[ 2262.380243] audit: rate limit exceeded
<4>[ 2263.388806] audit: audit_lost=332931 audit_rate_limit=5 audit_backlog_limit=64
<3>[ 2263.388811] audit: rate limit exceeded
<4>[ 2264.400957] audit: audit_lost=333168 audit_rate_limit=5 audit_backlog_limit=64
<3>[ 2264.400962] audit: rate limit exceeded
<4>[ 2265.409705] audit: audit_lost=333405 audit_rate_limit=5 audit_backlog_limit=64
<3>[ 2265.409709] audit: rate limit exceeded
<4>[ 2266.421424] audit: audit_lost=333642 audit_rate_limit=5 audit_backlog_limit=64
<3>[ 2266.421429] audit: rate limit exceeded
<4>[ 2267.430551] audit: audit_lost=333879 audit_rate_limit=5 audit_backlog_limit=64
<3>[ 2267.430554] audit: rate limit exceeded
<4>[ 2268.441467] audit: audit_lost=334116 audit_rate_limit=5 audit_backlog_limit=64
<3>[ 2268.441472] audit: rate limit exceeded
<4>[ 2269.450730] audit: audit_lost=334353 audit_rate_limit=5 audit_backlog_limit=64
<3>[ 2269.450733] audit: rate limit exceeded
<4>[ 2270.462211] audit: audit_lost=334590 audit_rate_limit=5 audit_backlog_limit=64
<3>[ 2270.462219] audit: rate limit exceeded
<4>[ 2271.463358] audit: audit_lost=334825 audit_rate_limit=5 audit_backlog_limit=64
<3>[ 2271.463361] audit: rate limit exceeded
<4>[ 2272.471891] audit: audit_lost=335062 audit_rate_limit=5 audit_backlog_limit=64
<3>[ 2272.471895] audit: rate limit exceeded
<4>[ 2273.483607] audit: audit_lost=335299 audit_rate_limit=5 audit_backlog_limit=64
<3>[ 2273.483611] audit: rate limit exceeded
<4>[ 2274.491732] audit: audit_lost=335536 audit_rate_limit=5 audit_backlog_limit=64
<3>[ 2274.491736] audit: rate limit exceeded
<7>[ 2274.574719] mlmux-power MXPM1111:00: num status msgs received in last 20000 ms: 20
<4>[ 2275.495608] audit: audit_lost=335771 audit_rate_limit=5 audit_backlog_limit=64
<3>[ 2275.495611] audit: rate limit exceeded
<4>[ 2276.505798] audit: audit_lost=336008 audit_rate_limit=5 audit_backlog_limit=64
<3>[ 2276.505837] audit: rate limit exceeded
<4>[ 2277.507896] audit: audit_lost=336243 audit_rate_limit=5 audit_backlog_limit=64
<3>[ 2277.507899] audit: rate limit exceeded
<4>[ 2278.514763] audit: audit_lost=336480 audit_rate_limit=5 audit_backlog_limit=64
<3>[ 2278.514766] audit: rate limit exceeded
<4>[ 2279.528467] audit: audit_lost=336717 audit_rate_limit=5 audit_backlog_limit=64
<3>[ 2279.528471] audit: rate limit exceeded
<4>[ 2280.537660] audit: audit_lost=336954 audit_rate_limit=5 audit_backlog_limit=64
<3>[ 2280.537664] audit: rate limit exceeded
<4>[ 2281.539170] audit: audit_lost=337189 audit_rate_limit=5 audit_backlog_limit=64
<3>[ 2281.539173] audit: rate limit exceeded
<4>[ 2282.550565] audit: audit_lost=337426 audit_rate_limit=5 audit_backlog_limit=64
<3>[ 2282.550569] audit: rate limit exceeded
<4>[ 2283.552536] audit: audit_lost=337657 audit_rate_limit=5 audit_backlog_limit=64
<3>[ 2283.552539] audit: rate limit exceeded
<4>[ 2284.560909] audit: audit_lost=337894 audit_rate_limit=5 audit_backlog_limit=64
<3>[ 2284.560914] audit: rate limit exceeded
<4>[ 2285.564870] audit: audit_lost=338129 audit_rate_limit=5 audit_backlog_limit=64
<3>[ 2285.564874] audit: rate limit exceeded
<12>[ 2286.554195] healthd: battery l=100 v=8785 t=31.9 h=2 st=5 c=0 fc=5409000 cc=33 chg=a
<4>[ 2286.575359] audit: audit_lost=338366 audit_rate_limit=5 audit_backlog_limit=64
<3>[ 2286.575363] audit: rate limit exceeded
<4>[ 2287.582112] audit: audit_lost=338603 audit_rate_limit=5 audit_backlog_limit=64
<6>[ 5401.522532] mero_notifier: PID(3368) signal_cross_core: xpc_notification_send failed, xpc_target_mask=0x100, errno=-11
<6>[ 5401.534550] mero_notifier: PID(3368) signal_cross_core: xpc_notification_send failed, xpc_target_mask=0x100, errno=-11
<6>[ 5401.561594] mero_notifier: PID(3368) signal_cross_core: xpc_notification_send failed, xpc_target_mask=0x100, errno=-11
<6>[ 5401.599294] mero_notifier: PID(6038) signal_cross_core: xpc_notification_send failed, xpc_target_mask=0x100, errno=-11
<4>[ 5401.735620] RIP: 0010:0xffffb5b60020ccd0
<4>[ 5401.735627] Code: ff ff 00 b5 4f 33 ea ec ac 56 00 a8 01 e7 4d 8e ff ff fd ff ff ff 00 00 00 00 38 a9 01 e7 4d 8e ff ff 00 48 bf e0 4d 8e ff ff <01> 00 00 00 00 00 00 00 50 cd 20 00 b6 b5 ff ff 88 d3 29 c0 ff ff
<4>[ 5401.735632] RSP: 0018:ffffb5b6001e0ea8 EFLAGS: 00010282
<4>[ 5401.735639] RAX: ffff8e4dde4dad10 RBX: 0000000000000000 RCX: ffff8e4dde4daa10
<4>[ 5401.735647] RDX: 0000000000000101 RSI: 0000000000000000 RDI: ffff8e4dde4dad10
<4>[ 5401.735652] RBP: ffffb5b6001e0f50 R08: ffff8e4b086a09c0 R09: ffff8e4af2f77540
<4>[ 5401.735657] R10: ffff8e4af2f77240 R11: ffffb5b60020ccd0 R12: ffff8e4df317c1c0
<4>[ 5401.735664] R13: ffff8e4df56e9650 R14: ffff8e4df56e9600 R15: 000000000000000a
<4>[ 5401.735669] FS: 0000000000000000(0000) GS:ffff8e4df56c0000(0000) knlGS:0000000000000000
<4>[ 5401.735678] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
<4>[ 5401.735684] CR2: ffffb5b60020ccd0 CR3: 000000041cc16000 CR4: 0000000000340ee0
<0>[ 5401.735692] Kernel panic - not syncing: Fatal exception in interrupt
<0>[ 5401.736004] Kernel Offset: 0x2f400000 from 0xffffffff81000000 (relocation range: 0xffffffff80000000-0xffffffffbfffffff)
<3>[ 2287.582115] audit: rate limit exceeded
<4>[ 2288.585999] audit: audit_lost=338838 audit_rate_limit=5 audit_backlog_limit=64
<3>[ 2288.586003] audit: rate limit exceeded
<4>[ 2289.588681] audit: audit_lost=339073 audit_rate_limit=5 audit_backlog_limit=64
<3>[ 2289.588684] audit: rate limit exceeded
<4>[ 2290.601208] audit: audit_lost=339310 audit_rate_limit=5 audit_backlog_limit=64
<3>[ 2290.601213] audit: rate limit exceeded
<4>[ 2291.604984] audit: audit_lost=339545 audit_rate_limit=5 audit_backlog_limit=64
<3>[ 2291.604989] audit: rate limit exceeded
<4>[ 2292.609621] audit: audit_lost=339782 audit_rate_limit=5 audit_backlog_limit=64
<3>[ 2292.609625] audit: rate limit exceeded
<4>[ 2293.622113] audit: audit_lost=340019 audit_rate_limit=5 audit_backlog_limit=64
<3>[ 2293.622116] audit: rate limit exceeded
<4>[ 2294.623134] audit: audit_lost=340254 audit_rate_limit=5 audit_backlog_limit=64
<3>[ 2294.623140] audit: rate limit exceeded
<7>[ 2295.054738] mlmux-power MXPM1111:00: num status msgs received in last 20000 ms: 21
<4>[ 2295.632153] audit: audit_lost=340491 audit_rate_limit=5 audit_backlog_limit=64
<3>[ 2295.632158] audit: rate limit exceeded
<4>[ 2296.643418] audit: audit_lost=340728 audit_rate_limit=5 audit_backlog_limit=64
<3>[ 2296.643422] audit: rate limit exceeded
<4>[ 2297.650884] audit: audit_lost=340965 audit_rate_limit=5 audit_backlog_limit=64
<3>[ 2297.650887] audit: rate limit exceeded
<4>[ 2298.664680] audit: audit_lost=341202 audit_rate_limit=5 audit_backlog_limit=64
<3>[ 2298.664696] audit: rate limit exceeded
<4>[ 2299.672377] audit: audit_lost=341439 audit_rate_limit=5 audit_backlog_limit=64
<3>[ 2299.672382] audit: rate limit exceeded
<4>[ 2300.684332] audit: audit_lost=341676 audit_rate_limit=5 audit_backlog_limit=64
<3>[ 2300.684336] audit: rate limit exceeded
<4>[ 2301.693046] audit: audit_lost=341913 audit_rate_limit=5 audit_backlog_limit=64
<3>[ 2301.693049] audit: rate limit exceeded
<4>[ 2302.695058] audit: audit_lost=342148 audit_rate_limit=5 audit_backlog_limit=64
<3>[ 2302.695061] audit: rate limit exceeded
<4>[ 2303.706877] audit: audit_lost=342385 audit_rate_limit=5 audit_backlog_limit=64
<3>[ 2303.706880] audit: rate limit exceeded
<4>[ 2304.714457] audit: audit_lost=342622 audit_rate_limit=5 audit_backlog_limit=64
<3>[ 2304.714460] audit: rate limit exceeded
<4>[ 2305.715576] audit: audit_lost=342857 audit_rate_limit=5 audit_backlog_limit=64
<3>[ 2305.715580] audit: rate limit exceeded
<4>[ 2306.719340] audit: audit_lost=343092 audit_rate_limit=5 audit_backlog_limit=64
<3>[ 2306.719342] audit: rate limit exceeded
<4>[ 2307.731312] audit: audit_lost=343329 audit_rate_limit=5 audit_backlog_limit=64
<3>[ 2307.731316] audit: rate limit exceeded
<4>[ 2308.740113] audit: audit_lost=343566 audit_rate_limit=5 audit_backlog_limit=64
<3>[ 2308.740116] audit: rate limit exceeded
<4>[ 2309.751518] audit: audit_lost=343803 audit_rate_limit=5 audit_backlog_limit=64
<3>[ 2309.751521] audit: rate limit exceeded
<4>[ 2310.759803] audit: audit_lost=344040 audit_rate_limit=5 audit_backlog_limit=64
<3>[ 2310.759806] audit: rate limit exceeded
<4>[ 2311.772454] audit: audit_lost=344277 audit_rate_limit=5 audit_backlog_limit=64
<3>[ 2311.772457] audit: rate limit exceeded
<4>[ 2312.779727] audit: audit_lost=344514 audit_rate_limit=5 audit_backlog_limit=64
<3>[ 2312.779731] audit: rate limit exceeded
<4>[ 2313.792859] audit: audit_lost=344751 audit_rate_limit=5 audit_backlog_limit=64
<3>[ 2313.792863] audit: rate limit exceeded
<4>[ 2314.801646] audit: audit_lost=344988 audit_rate_limit=5 audit_backlog_limit=64
<3>[ 2314.801649] audit: rate limit exceeded
<7>[ 2315.534764] mlmux-power MXPM1111:00: num status msgs received in last 20000 ms: 20
<4>[ 2315.813083] audit: audit_lost=345221 audit_rate_limit=5 audit_backlog_limit=64
<3>[ 2315.813088] audit: rate limit exceeded
<4>[ 2316.820692] audit: audit_lost=345458 audit_rate_limit=5 audit_backlog_limit=64
<3>[ 2316.820695] audit: rate limit exceeded
<4>[ 2317.833592] audit: audit_lost=345695 audit_rate_limit=5 audit_backlog_limit=64```