TedHerman The Pi 5 is significantly faster than the Pi 4, which I'd used previously. It may be possible to use the OAK to detect persons and get distances, plus feed a subset of the same images to a neural net on the Pi 5 and somehow merge the results (yes, of course timing will degrade the fusion).
Yes. It's fully possible, however as you have said, timing will be off by a second (depending on the complexity of the model).
TedHerman . I also tried their Yolo v8 pose model - it should be able to run on the OAK, yes?
Yes, but decoding is currently not supported so you will have to do it yourself.
TedHerman I don't have any idea how to debug this. I think the openvino multipose is better than Yolo, if it could work.
I can't really help if you are running the model on the CPU, also I am not familiar with the 2.0API. Likely best to use GPT for this task as it is probably just some bug in the code.
Thanks,
Jaka