TheHiddenWaffle Made the changes I outlined in my edit, my averages for the double input blob came out to 120 ms, which is still faster by 16% on a per-detection basis but the cost is increasing latency by 73%, and of course cutting the frame rate almost in half(vs the single body model) when only 1 body is present to process. here's the model if anyone on this thread is interested but I don't expect many would be. I also tested 3 inputs but there was no meaningful increase in any performance metric, and the latency jumped another 60 ms.