There is a nebulous problem that has been impacting some of our recent devices. Through a lot of experimentation, it seems there is a relationship between capturing images from an OAK-D Pro POE camera and the computer freezing. The failures are not very reproducible so its hard to create a failure case.
The computer has been consistently freezing after 1-12 hours of operation. If i disable the link to the camera (and stop any software related to the camera), the computer stays up indefinitely. However, when the computer freezes, there aren't many clues to why its freezing.
Oct 31 12:07:04 SCN-011 kernel: invalid opcode: 0000 [#1] PREEMPT SMP NOPTI
Oct 31 12:07:04 SCN-011 kernel: CPU: 3 PID: 1015 Comm: EventRead00Thr Not tainted 5.15.0-73-lowlatency #80-Ubuntu
Oct 31 12:07:04 SCN-011 kernel: Hardware name: SYSTEM_MANUFACTURER SYSTEM_PRODUCT_NAME/Default string, BIOS 5.19 12/28/2023
Oct 31 12:07:04 SCN-011 kernel: RIP: 0010:__skb_datagram_iter+0x1a9/0x2f0
Oct 31 12:07:04 SCN-011 kernel: Code: c6 75 53 48 29 d6 48 8b 55 10 48 01 f7 4c 89 c6 48 01 cf 48 8b 4d b8 e8 15 fe ff ff 44 8b 5d d0 41 01 c4 44 39 f0 75 59 29 c3 <0e> 84 ee fe ff ff 48 8b 55 a8 8b 82 bc 00 00>
Oct 31 12:07:04 SCN-011 kernel: RSP: 0018:ffffbc6681197ae0 EFLAGS: 00010206
Oct 31 12:07:04 SCN-011 kernel: RAX: 0000000000000400 RBX: 0000000000001c00 RCX: 00000000000b6d43
Oct 31 12:07:04 SCN-011 kernel: RDX: 0000000000000800 RSI: ffffbc6681197d48 RDI: ffffbc6681197d48
Oct 31 12:07:04 SCN-011 kernel: RBP: ffffbc6681197b40 R08: 0000000000000400 R09: ffffffffb82e8f30
Oct 31 12:07:04 SCN-011 kernel: R10: 0000000000000000 R11: 0000000000000800 R12: 0000000000000800
Oct 31 12:07:04 SCN-011 kernel: R13: 0000000000000400 R14: 0000000000000400 R15: 0000000000000000
Oct 31 12:07:04 SCN-011 kernel: FS: 00007f20de92a640(0000) GS:ffff97d677f80000(0000) knlGS:0000000000000000
Oct 31 12:07:04 SCN-011 kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Oct 31 12:07:04 SCN-011 kernel: CR2: 00007fcaf62d9c68 CR3: 0000000106c42000 CR4: 0000000000350ee0
Oct 31 12:07:04 SCN-011 kernel: Call Trace:
Oct 31 12:07:04 SCN-011 kernel: <TASK>
Oct 31 12:07:04 SCN-011 kernel: ? receiver_wake_function+0x30/0x30
Oct 31 12:07:04 SCN-011 kernel: skb_copy_datagram_iter+0x38/0xa0
Oct 31 12:07:04 SCN-011 kernel: tcp_recvmsg_locked+0x2a7/0x9e0
Oct 31 12:07:04 SCN-011 kernel: ? __tcp_send_ack.part.0+0xcf/0x1c0
Oct 31 12:07:04 SCN-011 kernel: tcp_recvmsg+0x79/0x1c0
Oct 31 12:07:04 SCN-011 kernel: ? _raw_spin_unlock_bh+0x1e/0x30
Oct 31 12:07:04 SCN-011 kernel: inet_recvmsg+0x5e/0x130
Oct 31 12:07:04 SCN-011 kernel: ? security_socket_recvmsg+0x3a/0x60
Oct 31 12:07:04 SCN-011 kernel: sock_recvmsg+0x71/0x80
Oct 31 12:07:04 SCN-011 kernel: __sys_recvfrom+0x1a2/0x1d0
Oct 31 12:07:04 SCN-011 kernel: ? rseq_get_rseq_cs.isra.0+0x1b/0x230
Oct 31 12:07:04 SCN-011 kernel: ? rseq_ip_fixup+0x72/0x1a0
Oct 31 12:07:04 SCN-011 kernel: ? do_futex+0x162/0x1f0
Oct 31 12:07:04 SCN-011 kernel: __x64_sys_recvfrom+0x24/0x30
Oct 31 12:07:04 SCN-011 kernel: do_syscall_64+0x59/0xc0
Oct 31 12:07:04 SCN-011 kernel: ? switch_fpu_return+0x4e/0xe0
Oct 31 12:07:04 SCN-011 kernel: ? exit_to_user_mode_prepare+0x96/0xb0
Oct 31 12:07:04 SCN-011 kernel: ? syscall_exit_to_user_mode+0x27/0x50
Oct 31 12:07:04 SCN-011 kernel: ? __x64_sys_recvfrom+0x24/0x30
Oct 31 12:07:04 SCN-011 kernel: ? do_syscall_64+0x69/0xc0
Oct 31 12:07:04 SCN-011 kernel: ? do_syscall_64+0x69/0xc0
Oct 31 12:07:04 SCN-011 kernel: ? exit_to_user_mode_prepare+0x96/0xb0
Oct 31 12:07:04 SCN-011 kernel: ? syscall_exit_to_user_mode+0x27/0x50
Oct 31 12:07:04 SCN-011 kernel: ? __x64_sys_recvfrom+0x24/0x30
Oct 31 12:07:04 SCN-011 kernel: ? do_syscall_64+0x69/0xc0
Oct 31 12:07:04 SCN-011 kernel: ? do_syscall_64+0x69/0xc0
Oct 31 12:07:04 SCN-011 kernel: entry_SYSCALL_64_after_hwframe+0x61/0xcb
Oct 31 12:07:04 SCN-011 kernel: RIP: 0033:0x7f210d2a76be
Oct 31 12:07:04 SCN-011 kernel: Code: 4c 24 1c e8 54 93 f6 ff 44 8b 54 24 1c 8b 3c 24 45 31 c9 41 89 c4 48 8b 54 24 10 48 8b 74 24 08 45 31 c0 b8 2d 00 00 00 0f 05 <48> 3d 00 f0 ff ff 77 32 44 89 e7 48 89 04 24>
Oct 31 12:07:04 SCN-011 kernel: RSP: 002b:00007f20de929c70 EFLAGS: 00000246 ORIG_RAX: 000000000000002d
Oct 31 12:07:04 SCN-011 kernel: RAX: ffffffffffffffda RBX: 000000000000000c RCX: 00007f210d2a76be
Oct 31 12:07:04 SCN-011 kernel: RDX: 00000000000b6d43 RSI: 00007f20bc0b7fc0 RDI: 000000000000000c
Oct 31 12:07:04 SCN-011 kernel: RBP: 00007f20bc0b7fc0 R08: 0000000000000000 R09: 0000000000000000
Oct 31 12:07:04 SCN-011 kernel: R10: 0000000000000000 R11: 0000000000000246 R12: 0000000000000000
Oct 31 12:07:04 SCN-011 kernel: R13: 0000000000000000 R14: 0000000000000000 R15: 00007f20e80fff68
Oct 31 12:07:04 SCN-011 kernel: </TASK>
Oct 31 12:07:04 SCN-011 kernel: Modules linked in: ccm binfmt_misc nls_iso8859_1 snd_hda_codec_hdmi snd_hda_codec_realtek snd_hda_codec_generic snd_sof_pci_intel_tgl snd_sof_intel_hda_common soundwire_intel sou>
Oct 31 12:07:04 SCN-011 kernel: blake2b_generic zstd_compress raid10 raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx xor raid6_pq libcrc32c raid1 raid0 multipath linear i915 ttm crct10dif_pc>
Oct 31 12:07:04 SCN-011 kernel: ---[ end trace 6c67e6131f7bcf60 ]---
Oct 31 12:07:04 SCN-011 kernel: RIP: 0010:__skb_datagram_iter+0x1a9/0x2f0
Oct 31 12:07:04 SCN-011 kernel: Code: c6 75 53 48 29 d6 48 8b 55 10 48 01 f7 4c 89 c6 48 01 cf 48 8b 4d b8 e8 15 fe ff ff 44 8b 5d d0 41 01 c4 44 39 f0 75 59 29 c3 <0f> 84 ee fe ff ff 48 8b 55 a8 8b 82 bc 00 00>
Oct 31 12:07:04 SCN-011 kernel: RSP: 0018:ffffbc6681197ae0 EFLAGS: 00010206
Oct 31 12:07:04 SCN-011 kernel: RAX: 0000000000000400 RBX: 0000000000001c00 RCX: 00000000000b6d43
Oct 31 12:07:04 SCN-011 kernel: RDX: 0000000000000800 RSI: ffffbc6681197d48 RDI: ffffbc6681197d48
Oct 31 12:07:04 SCN-011 kernel: RBP: ffffbc6681197b40 R08: 0000000000000400 R09: ffffffffb82e8f30
Oct 31 12:07:04 SCN-011 kernel: R10: 0000000000000000 R11: 0000000000000800 R12: 0000000000000800
Oct 31 12:07:04 SCN-011 kernel: R13: 0000000000000400 R14: 0000000000000400 R15: 0000000000000000
Oct 31 12:07:04 SCN-011 kernel: FS: 00007f20de92a640(0000) GS:ffff97d677f00000(0000) knlGS:0000000000000000
Oct 31 12:07:04 SCN-011 kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Oct 31 12:07:04 SCN-011 kernel: CR2: 00007f6a9c02d048 CR3: 0000000106c42000 CR4: 0000000000350ee0
Oct 31 12:07:13 SCN-011 kernel: igb 0000:03:00.0 enp3s0: igb: enp3s0 NIC Link is Down
I am not sure what other information is relevant that I can provide that would be helpful.