I recently purchased a luxonis camera for obstacle detection. I was thinking that I could possibly modify the YOLOv8 architecture so that it can take in a depth map as the fourth channel in addition to the 3 RGB channels (mostly to deal with objects that are similarly colored to the background). However, I haven't been able to find much mention of this online. Has anyone ever heard of something like this being done before? Is it possible, and would it be worth the effort?
Using depth as a fourth channel to input into YOLO