Hi everyone - I'm working on a vehicle that will steer autonomously in an agricultural setting. I want to use an oak-d-lite to determine where a furrow is (the path between the crops, ie where a tractor drives) in a field and use that information to drive in the center of the furrow. After looking at some existing ML model databases, I think that I'm going to have to develop my own model to do this.
As I understand it, the steps to do this are
- use an oak-d-lite to take thousands of photos of furrows
- plug these photos into a ML framework like tensorflow to create a model
- compile the model and run it on the oak-d-lite
So my questions are -
1) Does all of the above seem correct?
2) Is there a preferred format for using depth and RGB data in a training set? Should I preprocess the data in some way?
3) Is there a preferred software stack for dealing with depth data or one that is especially easy to use for these kinds of use cases? I was going to use tensorflow just because it was the only one I had heard of before I starting this project.
As you can probably tell, I'm brand new to the world of computer vision and AI so any help would be much appreciated. Thanks!