I have an idea I want to try on the OAK-D but not sure how to approach it. I want to feed a cropped and resized frame into another neural net from a face detect net but avoid the host. I assume this'll be much faster.
From here it seems like a custom PyTorch net in between could do it.
The problem is I would have to link the camera preview and the face detect bounding box output both to this new net. Is that possible? What would the data format be? I checked the docs but couldn't find any leads.