u111s No, I employed a standard YOLO object detection model alongside a secondary neural network for instance segmentation, like the Segment-Anything model or Google's MediaPipe MagicTouch. This approach proved to be effective for my specific problem.