Hi @yishu_corpex
In case of SpatialLocationCalculator, the BBOX you see in the examples is the group of points that are taken into account when calculating the depth. The depth is given by an averaging method (usually MEDIAN, but can be MIN/MAX, average, ...).
Same thing goes for SpatialDetectionNetworks, the detected bounding box of an object is first scaled by BBoxScalingFactor
, then the points inside that new BBOX are used to calculate the depth across those pixels. X and Y are taken from the centroid of that BBOX.
Thanks,
Jaka