Hi dhunjoshi ,
So there are two parts to our OCR system:
- Text detection. We used EAST, as it allows text to be oriented at odd angles/etc. I am pinging our engineer on this on if he did retraining. Anyway, here is one example of retraining it in TensorFlow, which then should be compatible with our platform, as OpenVINO supports TensorFlow.
- OCR. This takes the region found from the text detection, and runs the actual OCR on it. For this network, we use the OCR model from Intel: https://docs.openvinotoolkit.org/2019_R1/_text_recognition_0012_description_text_recognition_0012.html
I do not at this time know if Intel gives a reference on how to retrain that. For some of their networks, they do give retraining. Looking quickly. Not immediately seeing it.
From the notes on it, "VGG16-like backbone and bidirectional LSTM encoder-decoder". So likely any network that is similar to this could be used instead, as long as it is on a similar backbone (or one of these), or uses neural operations supported by OpenVINO for the VPU (OAK-D is the VPU in this context), see here.