I am training a model which predicts multiple labels for same object. Example if it is a red car, I need two labels red and car. I trained it by giving different class numbers for the same bounding box in labels. While detecting on test images locally, I get to see multiple labels. However when detecting live on my OAK-D Pro, I always see a single label. I was wondering why I am not able to detect multi labels on the camera, whereas I am able to detect locally. Please help me with it.
Thanks.