nior Another option is to just discard other detections based on labels, and only keep person label detections. NN inference time would be exactly the same for 1 label or 80 of them (for the same model architecture of course), but postprocessing (NN result decoding like NMS for Yolo, then perhaps object tracking etc.) might be a bit faster.
Thanks, Erik