It's fine to write my own code for calculating spatials. But I am wondering if there is a more straightforward way of accessing the xyz spatial data?
For example, let's say:
Let L=(h, v) where h, v refers to the (horizontal, vertical) pixel location value. So on a 720p resolution image, (1280, 720) would be bottom right corner.
Using Oak-D and google mediapipe (running on my computer), my code will locate where the 33 landmarks are. L1, ... , L33 = (h1, v1), ... (h33, v33).
Then I just want it to look up those coordinates on the Oak-D spatial map (depth map?), look up the xyz spatial coordinates, and return those values. So if L15 = (920, 385), then I get something like (x=1m, y=2.5m, z=2m)
If I can accomplish this, then I think I can also write the code to test out different ROI's and average them out on the computer.
What would be the most direct way of just getting those spatial values?