Hi All,
- I have run into an issue while trying to detect an object and get the distance from the camera to the object. According to the documentation (https://github.com/google/mediapipe/blob/master/docs/solutions/objectron.md#camera-coordinate), the rotation and translation matrices are with respect to the camera coordinate frame (see figure 1) but when I physically measure the distance between the camera and object, the distance/vector does not add up.
For example, if the camera is mounted at some distance from in space as shown in Figure 2, I would like to know the rotation and translation of the center of the bounding box to the camera frame. As per the documentation and my understanding, the rotation and translation of the object are given in terms of the camera coordinates. But if I compute the vector (sqrt(tx2+ty2+tz2)) from this translation vector, I am not getting the distance marked in black in my sketch. Would you guys please point out what I am missing here?
Figure 1.
Figure 2
- Lastly, Can you point me in the right direction to annotate and train 3D objects bounding boxes?